Improving Training Strategies for SAR Image Classification

3rd place solution for the NTIRE 2021 Multi-modal Aerial View Object Classification Challenge - Track 1 (SAR) at CVPRW 2021
[Challenge Site] [Challenge Paper] [Challenge Award]

Overview

Recent advancemenets in deep learning have allowed for efficient and accurate classification of electro-optical (EO) images. However, classification of synthetic-aperature radar (SAR) images lag in accuracy. The objective of the challenge is to classify SAR image chips of vehicles into 10 classes, and this work showcases the strategies used to boost performance.

Methodology

Addressing low resolution inputs
The SAR image chips from the dataset have variable lengths of 54 to 60 pixels per side; small image sizes limit the amount of information that each image chip contains. Accuracy on small targets decreases when inputs are downsampled early on in networks, thus the stride of the convolutional layer within the stem block is changed from two to one.

Addressing limited dataset size
The dataset consists of samples that are alike, which limit the intraclass variation. To compensate, networks are pretrained on ImageNet. However, the stem block is excluded from pretraining due to the difference in stride and the inputs having a single channel as opposed to three.

A center-preserving Cutmix modification coined Central Cutmix is employed alongside traditional flipping and rotations to introduce variation during training.

Addressing class imbalance
The classes within the dataset exhibit a large imbalance, where the majority of samples are from the "sedan" class. To ensure a balanced performace, classes are under-sampled to approximately 1400 samples per class during training, which is approximately 80% of the number of "box truck" samples.

Results

MobileNet	Cutmix	Central cutmix	Stem stride=1	Cosine annealing	Accuracy valid data	Accuracy test data
V2					15.58
V2	x				16.88
V2		x			18.05
V3-Large		x			18.96
V3-Large		x	x		21.56
V3-Large		x	x	x	22.34	26.39

Usage

Dependencies

numpy
pandas
pillow
pytorch
pyyaml
torchvision
tqdm

Downloading and Preprocessing Data

cd ./setup/
bash setup.bash

Training

cd ./ntire2021/
python train.py

changing the save_dir path in config_train.yml allows for changing the location of the directory that the newly trained model will be saved to

Predicting

cd ./ntire2021/
python predict.py

changing the save_dir path in config_predict.yml allows for changing the location of the directory that the prediction csv file will be saved to

Citation

If you find this work useful in your research or publication, please cite this work:

@article{liu2021_nti,
  author={Liu, Jerrick and Inkawhich, Nathan and Nina, Oliver and Timofte, Radu and Jain, Sahil and Lee, Bob and Duan, Yuru and Wei, Wei and Zhang, Lei and Xu, Songzheng and Sun, Yuxuan and Tang, Jiaqi and Geng, Xueli and Ma, Mengru and Li, Gongzhe and Geng, Xueli and Cai, Huanqia and Cai, Chengxue and Cummings, Sol and Miron, Casian and Pasarica, Alexandru and Yang, Cheng-Yen and Hsu, Hung-Min and Cai, Jiarui and Mei, Jie and Yeh, Chia-Ying and Hwang, Jenq-Neng and Xin, Michael and Shangguan, Zhongkai and Zheng, Zihe and Yifei, Xu and Yang, Lehan and Xu, Kele and Feng, Min},
  title={NTIRE 2021 Multi-modal Aerial View Object Classification Challenge},
  publisher={arXiv},
  year={2021},
  doi={10.48550/ARXIV.2107.01189},
  url={https://arxiv.org/abs/2107.01189},
}

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
examples		examples
ntire2021		ntire2021
setup		setup
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Improving Training Strategies for SAR Image Classification

Overview

Methodology

Results

Usage

Dependencies

Downloading and Preprocessing Data

Training

Predicting

Citation

About

Languages

License

solcummings/ntire2021-sar

Folders and files

Latest commit

History

Repository files navigation

Improving Training Strategies for SAR Image Classification

Overview

Methodology

Results

Usage

Dependencies

Downloading and Preprocessing Data

Training

Predicting

Citation

About

Topics

Resources

License

Stars

Watchers

Forks

Languages