-
Download and extract the COCO 2014 Train images and 2014 Val images from here.
-
Download the Karpathy split for COCO from here.
-
Run
notebooks/preprocess_mscoco.ipynb
, updating paths at the top of the notebook. -
Update the
PATHS
variable at the top oflibs/datasets/utils.py
.
-
Download the Flickr30k images from here.
-
Download the Karpathy split for Flickr30k from here.
-
Run
notebooks/preprocess_flickr30k.ipynb
, updating paths at the top of the notebook. -
Update the
PATHS
variable at the top oflibs/datasets/utils.py
.
-
Download the MMIMDB dataset from here.
-
Run
notebooks/preprocess_mmimdb.ipynb
, updating paths at the top of the notebook. -
Update the
PATHS
variable at the top oflibs/datasets/utils.py
.
-
Obtain access to the MIMIC-CXR-JPG Database Database on PhysioNet and download the dataset.
-
Download and unzip the
mimic-cxr-reports.zip
file from this repository. -
Run
notebooks/preprocess_mimiccxr.ipynb
, updating paths at the top of the notebook. -
Update the
PATHS
variable at the top oflibs/datasets/utils.py
.