Video Classification on Indian-Roads using DeepLabV3Plus-Pytorch

We have used Segmentation Backbone of DeepLabv3+ model pre-trained on eMARG-15k(Good/Bad) and extended it for Binary Classification by adding simple Conv + FC layer combination layers.

Quick Overview

Architecture of DeeplabV3+ Fine-tuned for Binary Classification.

DeepLabV3	DeepLabV3+
deeplabv3_resnet50	deeplabv3plus_resnet50
deeplabv3_resnet101	deeplabv3plus_resnet101
deeplabv3_mobilenet	deeplabv3plus_mobilenet
deeplabv3_hrnetv2_48	deeplabv3plus_hrnetv2_48
deeplabv3_hrnetv2_32	deeplabv3plus_hrnetv2_32

All pretrained model checkpoints: Drive

1. Load the pretrained model:

model.load_state_dict( torch.load( CKPT_PATH )['model_state']  )

2. Prediction

Single image:

python predict.py --input datasets/data/eMARG/leftImg8bit/train/city0/PE-AR-7382-157_2_leftImg8bit  --dataset cityscapes --model deeplabv3plus_mobilenet --ckpt checkpoints/best_deeplabv3plus_mobilenet_cityscapes_os16.pth --save_val_results_to test_results

Image folder:

python predict.py --input datasets/data/eMARG/leftImg8bit/train/city0  --dataset cityscapes --model deeplabv3plus_mobilenet --ckpt checkpoints/best_deeplabv3plus_mobilenet_cityscapes_os16.pth --save_val_results_to test_results

Results

1. Performance on eMARG (6 classes, 512 x 384)

Training: 768x768 random crop
validation: 512x384

Model	Batch Size	Accuracy	Precision	Recall	F1-score	checkpoint_link
DeepLabV3Plus-ResNet101	4	0.884	0.8618	0.915	0.887	Download
DeepLabV3Plus-MobileNet	8	0.869	0.841	0.908	0.874	Download

GradCAM Results on eMARG (DeepLabv3Plus-MobileNet/ResNet-101)

eMARG Dataset

1. Requirements

pip install -r requirements.txt

2. Download eMARG and extract it likewise Cityscapes dataset in this format 'datasets/data/eMARG'

/datasets
    /data
        /eMARG
            /gtFine
            /leftImg8bit

3. Train your model on eMARG likewise Cityscapes.

python main.py --model deeplabv3plus_mobilenet --dataset cityscapes --enable_vis --vis_port 28333 --gpu_id 0  --lr 0.1  --crop_size 768 --batch_size 16 --output_stride 16 --data_root ./datasets/data/eMARG
python main.py --model deeplabv3plus_resnet101 --dataset cityscapes --enable_vis --vis_port 28333 --gpu_id 0  --lr 0.1  --crop_size 768 --batch_size 16 --output_stride 16 --data_root ./datasets/data/eMARG

4. Testing

Results will be saved at ./results.

python main.py --model deeplabv3plus_mobilenet --enable_vis --vis_port 28333 --gpu_id 0 --year 2012_aug --crop_val --lr 0.01 --crop_size 513 --batch_size 16 --output_stride 16 --ckpt checkpoints/best_deeplabv3plus_mobilenet_cityscapes_os16.pth --test_only --save_val_results
python main.py --model deeplabv3plus_mobilenet --enable_vis --vis_port 28333 --gpu_id 0 --year 2012_aug --crop_val --lr 0.01 --crop_size 513 --batch_size 16 --output_stride 16 --ckpt checkpoints/best_deeplabv3plus_mobilenet_cityscapes_os16.pth --test_only --save_val_results

Reference

[1] Rethinking Atrous Convolution for Semantic Image Segmentation

[2] Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
datasets		datasets
images		images
metrics		metrics
network		network
utils		utils
video_1		video_1
video_1new		video_1new
.gitignore		.gitignore
LICENSE		LICENSE
OUT1_video_1.mp4		OUT1_video_1.mp4
README.md		README.md
fastai_video_inference.py		fastai_video_inference.py
get-pip.py		get-pip.py
main.py		main.py
out_video_1.mp4		out_video_1.mp4
predict.py		predict.py
requirements.txt		requirements.txt
run_save_maps.sh		run_save_maps.sh
video.py		video.py
video_inference.py		video_inference.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Video Classification on Indian-Roads using DeepLabV3Plus-Pytorch

Quick Overview

Architecture of DeeplabV3+ Fine-tuned for Binary Classification.

All pretrained model checkpoints: Drive

1. Load the pretrained model:

2. Prediction

Results

1. Performance on eMARG (6 classes, 512 x 384)

GradCAM Results on eMARG (DeepLabv3Plus-MobileNet/ResNet-101)

eMARG Dataset

1. Requirements

2. Download eMARG and extract it likewise Cityscapes dataset in this format 'datasets/data/eMARG'

3. Train your model on eMARG likewise Cityscapes.

4. Testing

Reference

About

Releases

Packages

Languages

License

shubhampundhir/dlv3plus_binaryClf_VideoAnalytics

Folders and files

Latest commit

History

Repository files navigation

Video Classification on Indian-Roads using DeepLabV3Plus-Pytorch

Quick Overview

Architecture of DeeplabV3+ Fine-tuned for Binary Classification.

All pretrained model checkpoints: Drive

1. Load the pretrained model:

2. Prediction

Results

1. Performance on eMARG (6 classes, 512 x 384)

GradCAM Results on eMARG (DeepLabv3Plus-MobileNet/ResNet-101)

eMARG Dataset

1. Requirements

2. Download eMARG and extract it likewise Cityscapes dataset in this format 'datasets/data/eMARG'

3. Train your model on eMARG likewise Cityscapes.

4. Testing

Reference

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages