Dilated VGG Networks (Custom Dataset for Image Classification task)

Author

Arpit Aggarwal

Introduction to the Project

In this project, different CNN Architectures like Dilated VGG-16, Dilated VGG-19, VGG-16 and VGG-19 were used for the task of Dog-Cat image classification. The input to the CNN networks was a (224 x 224 x 3) image and the number of classes were 2, where '0' was for a cat and '1' was for a dog. The CNN architectures were implemented in PyTorch and the loss function was Cross Entropy Loss. The hyperparameters to be tuned were: Number of epochs(e), Learning Rate(lr), momentum(m), weight decay(wd) and batch size(bs).

Data

The data for the task of Dog-Cat image classification can be downloaded from: https://drive.google.com/drive/folders/1EdVqRCT1NSYT6Ge-SvAIu7R5i9Og2tiO?usp=sharing. The dataset has been divided into three sets: Training data, Validation data and Testing data. The analysis of different CNN architectures for Dog-Cat image classification was done on comparing the Training Accuracy and Validation Accuracy values.

Results

The results after using different CNN architectures are given below:

VGG-16(pretrained on ImageNet dataset)

Training Accuracy = 99.27% and Validation Accuracy = 96.73% (e = 50, lr = 0.005, m = 0.9, bs = 32, wd = 0.001)

VGG-19(pretrained on ImageNet dataset)

Training Accuracy = 99.13% and Validation Accuracy = 97.25% (e = 50, lr = 0.005, m = 0.9, bs = 32, wd = 5e-4)

Dilated VGG-16

Training Accuracy = 99.17% and Validation Accuracy = 97.11% (e = 40, lr = 1e-3, m = 0.9, bs = 32, wd = 5e-4)

Dilated VGG-19

Training Accuracy = 98.81% and Validation Accuracy = 97.18% (e = 40, lr = 1e-3, m = 0.9, bs = 32, wd = 5e-4)

Software Required

To run the jupyter notebooks, use Python 3. Standard libraries like Numpy and PyTorch are used.

Credits

The following links were helpful for this project:

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
models		models
notebooks		notebooks
LICENSE.md		LICENSE.md
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Dilated VGG Networks (Custom Dataset for Image Classification task)

Author

Introduction to the Project

Data

Results

Software Required

Credits

About

Releases

Packages

Languages

License

arp95/dilated_vgg_networks

Folders and files

Latest commit

History

Repository files navigation

Dilated VGG Networks (Custom Dataset for Image Classification task)

Author

Introduction to the Project

Data

Results

Software Required

Credits

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages