Download region_descriptions.json.zip and extract it.
Just for test, 'imgs' folder contains only first 200 images from the original dataset(images.zip)
-
'1_img_annotating.py' creates 'imgs_anno' folder to annotate the original images using the information of 'region_descriptions.json'.
-
'2_json_pruning.py' creates 'region_descriptions_pruned.json' file, which eliminates many unnecessary duplicated annotations.