Animal Image Classification using Deep Learning

Author: Tin Trung Nguyen

Project Overview

This project builds a deep learning pipeline to classify animal images into 10 categories using convolutional neural networks (CNNs) and transfer learning.

The goal is to explore the full machine learning workflow, including:

Data exploration and preprocessing
Model development (baseline CNN and pretrained models)
Training and evaluation
Model interpretation using Grad-CAM

Dataset

The dataset consists of approximately 26,000+ images across 10 animal classes:

dog
cat
horse
spider
butterfly
chicken
sheep
cow
squirrel
elephant

Images are collected from real-world sources and include variations which makes the dataset suitable for testing model robustness.

Dataset link on Kaggle: Animals-10

Project Pipeline

The project follows a structured machine learning workflow:

1. Dataset Setup

Organize dataset structure
Verify class labels and image counts

2. Exploratory Data Analysis (EDA)

Analyze class distribution
Inspect image sizes
Detect corrupted or small images
Visualize sample images

3. Data Preprocessing

Train / validation / test split
Image resizing (224 × 224)
Data augmentation (flip, rotation)
Normalization using ImageNet statistics
PyTorch Dataset and DataLoader creation

4. Model Training

Baseline CNN trained from scratch
Transfer learning using pretrained ResNet
Training and validation loops
Performance tracking

5. Model Evaluation

Generating predictions on unseen test data
Computing a confusion matrix to visualize class-wise performance
Producing a classification report (precision, recall, F1-score)
Identifying and visualizing misclassified examples

6. Model Explainability

Extracting feature maps from the final convolutional layer
Computing gradients of the predicted class
Generating heatmaps highlighting important regions
Overlaying heatmaps on original images for interpretation

Findings

The baseline CNN achieved moderate performance, reaching approximately 70% validation accuracy after training. While the model was able to learn meaningful features, its performance plateaued due to the limited capacity of a simple architecture.

In contrast, the ResNet model demonstrated significantly stronger performance, achieving approximately 95% validation accuracy. The model converged rapidly within the first few epochs, highlighting the effectiveness of transfer learning for image classification tasks.

Most classes are classified accurately, including:

dog
spider
chicken
horse

These errors are expected due to similarities in shape, texture, and visual context.

Analysis of misclassified images reveals that errors are often caused by:

Cluttered or complex backgrounds
Low image quality or lighting conditions
Small or partially visible objects
Visual similarity between animal classes

This suggests that the model occasionally relies on contextual or background cues in addition to object features.

Grad-CAM visualizations show that the model generally focuses on relevant regions of the image when making predictions.

In some cases, attention is partially directed toward background regions, indicating that contextual information may influence predictions.

Dependencies

torch
torchvision
numpy
pandas
matplotlib
seaborn
scikit-learn
pillow
opencv-python
jupyter

Project Structure

animals-classification/

├── archive/
│   └── raw-img/
│
├── notebooks/
│   ├── data/
│   │   ├── train/
│   │   ├── val/
│   │   └── test/
│   │
│   ├── models/
│   │   ├── simple_cnn.pth
│   │   └── resnet.pth
│   │
│   ├── dataset_setup.ipynb
│   ├── exploratory_data_analysis.ipynb
│   ├── data_preprocessing.ipynb
│   ├── model_training.ipynb
│   ├── model_evaluation.ipynb
│   └── model_explainability.ipynb
│
├── utils/
│   └── dataset.py
│
└── README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Animal Image Classification using Deep Learning

Project Overview

Dataset

Project Pipeline

1. Dataset Setup

2. Exploratory Data Analysis (EDA)

3. Data Preprocessing

4. Model Training

5. Model Evaluation

6. Model Explainability

Findings

Dependencies

Project Structure

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
archive		archive
notebooks		notebooks
utils		utils
.gitattributes		.gitattributes
README.md		README.md

Folders and files

Latest commit

History

Repository files navigation

Animal Image Classification using Deep Learning

Project Overview

Dataset

Project Pipeline

1. Dataset Setup

2. Exploratory Data Analysis (EDA)

3. Data Preprocessing

4. Model Training

5. Model Evaluation

6. Model Explainability

Findings

Dependencies

Project Structure

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages