- Python 3.6
- TensorFlow (1.11)
- Keras (2.2.4)
- keras-vis (last version)
pip install git+https://github.com/raghakot/keras-vis.git
- Look at some groups of images: http://chenlab.ece.cornell.edu/projects/touch-coseg/iCoseg_dataset.pdf;
- Download the dataset from: http://chenlab.ece.cornell.edu/downloads.html;
- Move dataset folders inside the
; - Copy 80 images and their ground truths.
List of images to be copied:./datasets/icoseg/subset_80/img_list.txt
Folder structure:
├── ...
├── datasets
│ ├── icoseg
│ │ ├── ground_truth
│ │ ├── image_download
│ │ ├── images
│ │ ├── images_subset5_cvpr10
│ │ ├── subset_80
│ │ │ ├── ground_truth
│ │ │ ├── images
│ │ │ ├── img_list.txt
│ │ │ ├── train.txt
│ │ │ ├── val.txt
│ │ │ └── test.txt
│ │ ├── readme.txt
│ │ └── ...
│ └── ...
└── ...
- Download the dataset from: https://project.inria.fr/aerialimagelabeling/download/;
- Select a subset of images;
- Use
to change img format.
The final structure of ./datasets
├── datasets
│ ├── icoseg
│ │ ├── subset_80
│ │ └── ...
│ ├── inria_aerial
│ │ └── subset_chicago
│ │ ├── test
| | | ├── test_img_from_patches #imgs after merging 224x224 img patches(_h_w)
| | | ├── test_img_patches #512x512 img patches(+img_h+img_w) for unet
| | | ├── test_img_patches_for_sal #224x224 img patches(+img_h+img_w), don't use
| | | ├── test_mask_from_patches #masks after merging 224x224 mask patches(_h_w)
| | | ├── test_mask_patches #512x512 mask patches(+img_h+img_w) for unet
| | | ├── test_mask_patches_for_sal #224x224 mask patches(+img_h+img_w), don't use
| | | ├── test_None_sal_from_patches #images after merging 512x512 patches (None)
| | | ├── test_None_sal_patches #512x512 vgg16_sal_map_None patches for unet
| | | ├── test_None_sal_patches_224_224 #224x224 result patches from vgg16_sal_map_None
| | | ├── test_guided_sal_from_patches #images after merging 512x512 patches (guided)
| | | ├── test_guided_sal_patches #512x512 vgg16_sal_map_guided patches for unet
| | | └── test_guided_sal_patches_224_224 #224x224 result patches from vgg16_sal_map_guided
│ │ ├── train
| | | └── ...
│ │ └── val
| | └── ...
│ ├── playground
│ └── unseen_categories
│ └── test
└── ...
- Visualizing and Understanding Convolutional Networks (Zeiler and Fergus, 2013)
- Deep Inside Convolutional Networks: Visualising Image Classification Models and Saliency Maps (Simonyan et al., 2014)
- Striving for Simplicity: The All Convolutional Net (Springenberg et al., 2015)
- Grad-CAM: Visual Explanations from Deep Networks via Gradient-based Localization (Selvaraju et al., 2016)
- iCoseg: Interactive Co-segmentation with Intelligent Scribble Guidance (Dhruv Batra et al, 2010)
- U-Net: Convolutional Networks for Biomedical Image Segmentation (Ronneberger et al., 2015)
- keras-vis for neural network visualization
- https://machinelearningmastery.com/use-pre-trained-vgg-model-classify-objects-photographs/