Skip to content

nguyendinhson-kaist/ICCV23-Seg-AutoID

Repository files navigation

ICCV23-Seg-AutoID

Setup Environment

Step1: Create a conda environment with Python=3.9. Command:

conda create --name iccv23-py39 python=3.9

Step2: Install Pytorch with CUDA 11.8. Command:

conda install pytorch torchvision torchaudio pytorch-cuda=11.8 -c pytorch -c nvidia

Step3: Install mmdetection 3.1.0 (we will use it as a dependency). Command:

pip install -U openmim
mim install "mmengine==0.8.4"
mim install "mmcv>=2.0.0"
mim install "mmdet==3.1.0"

Step4: We also use some backbone from mmpretrain. Command:

mim install "mmpretrain[multimodal]>=1.0.0rc8"

Other packages can be install by pip or conda --forge. Please check requirements.txt for more detail

Data preparation

Step1: download here and extract dataset as follow:

data
├── train
│   ├── image1.png
│   └── ...
├── val
│   ├── image1.png
│   └── ...
├── test
│   ├── image1.png
│   └── ...
├── train.json
├── val.json
└── test.txt

Step2 (optional): Because mmdet supports both RLE format and Binary format, you can convert RLE annotation to Polygon annotation. Run bellow command:

python utils/rle2polygon.py data/train.json data/poly_train.json
python utils/rle2polygon.py data/val.json data/poly_val.json

Your datafolder should be like this:

data
├── train
│   ├── image1.png
│   └── ...
├── val
│   ├── image1.png
│   └── ...
├── test
│   ├── image1.png
│   └── ...
├── train.json
├── val.json
├── poly_train.json
├── poly_val.json
└── test.txt

Note that we add prefix "poly_" by convention which matchs with the dataset config (change annotation file names in config file).

Step3: You can train the model on train+val dataset to have higher accuracy. Run the bellow command:

python utils/merge_dataset.py data_root

Step4 (optional): we also use a specialized CopyPaste augmentation technique. To train models with our CopyPaste technique, you need to prepare a folder that contains all ground-truth instances from trainning set. Run the bellow command:

python utils/extract_objects.py data_root mode

A {mode}_cropped_objects folder will be created inside the data folder.

Step5 (optional): In case you want to visualize the augmented images, we provide a tool to help you. Run the below command:

python utils/browser_dataset.py data_config --output-dir data_sample --not-show

Learn about configs

Get ready with mmdet config

Everything in mmdet is about config. Before starting with mmdet, you should read their tutorial first. mmdetection (or mmdet) is built on top of a core package called mmengine. I highly recommend you to check their homepage and github for detail documentation and tutorials.

Or read their config explanation at least:

Our configs

Our dataset config can be found at:

./configs/_base_/datasets/vipriors_instance.py

Our model configs:

./configs/exp/{model_name}/*.py

Train a model

Single GPU training

Create a config for your experiment and save in ./configs folder. Then run below command:

CUDA_VISIBLE_DEVICES=0 python train.py config_path model_name

Training outputs (checkpoints, logs, merged config, tensorboard log, etc.) will be available in ./output/(model_name) folder

Distributed training

We also support distributed training. Run the below command:

CUDA_VISIBLE_DEVICES=0,1,2,3 torchrun --nproc_per_node=4 train.py config_path model_name --launcher python

Run SWA

To apply SWA (Stochastic Weight Averaging) after the main training is finished. Run the bellow command (you can use single GPU or Distributed training):

CUDA_VISIBLE_DEVICES=0,1,2,3 torchrun --nproc_per_node=4 swa_train.py config_path last_checkpoint model_name --launcher python

After that, use utils/get_swa_model.py to average all checkpoints exported by swa training process:

python train.py checkpoint_dir

Export submission result

After training a model, you can export submission result. Submission can be produced on val set or test set (if val set, you can see the evalution score). Run below commands:

CUDA_VISIBLE_DEVICES=0 python test.py config_path checkpoint_path [--valid] [--tta]

If you produce result on test set, omit --valid. If you want to apply TTA (test time augmentation), use --tta

All results should be in ./output folder. After running the test command, you can find the inference result as "result.segm.json". Rename it to "submission.json" then zip it. Now it is ready to submit to the test server.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published