Image Classification using CNN and MLP on MNIST and Fashion MNIST Datasets

Overview

This project aims to implement and compare two deep learning models—Convolutional Neural Networks (CNN) and Multi-Layer Perceptrons (MLP)—for image classification on the MNIST and Fashion MNIST datasets. I experimented with different architectures, tuned hyperparameters, and analyzed model performance through visualizations and confusion matrices.

Problem Statement: The goal of this project is to

Implement a CNN and MLP architecture for image classification.
Train the models on MNIST and Fashion MNIST datasets.
Explore different hyperparameters and configurations to improve model performance.
Compare the models in terms of accuracy, training time, and common misclassifications.

Steps Followed

A. Dataset Exploration

I loaded the MNIST and Fashion MNIST datasets using TensorFlow/Keras. Visualized samples from both datasets to understand the image characteristics.

B. CNN Model Implementation

We built a CNN architecture with two convolutional layers, max-pooling layers, and fully connected layers. The model was trained on both datasets.

CNN Architecture:

Convolutional Layer with 32 filters (3x3) and ReLU activation.
MaxPooling Layer (2x2).
Convolutional Layer with 32 filters (3x3) and ReLU activation.
MaxPooling Layer (2x2).
Flatten Layer.
Dense Layer with 128 neurons and ReLU activation.
Output Layer with 10 neurons (softmax activation for classification).

Training Parameters:

Batch size: 64
Epochs: 10
Learning rate: 0.001

C. CNN Hyperparameter Tuning

To further improve model performance, we experimented with the following hyperparameters

Increased filter sizes to 64.
Adjusted kernel size to (5x5) for better feature extraction.
Added dropout layers (0.5 rate) to reduce overfitting.
Lowered the learning rate to 0.0001 for more stable training.

The tuned CNN model improved performance, particularly on the Fashion MNIST dataset.

D. MLP Model Implementation

We implemented a Multi-Layer Perceptron (MLP) with the following architecture

Flatten Layer to transform the image data.
Dense Layer with 128 neurons and ReLU activation.
Dropout Layer (0.5) for regularization.
Dense Layer with 64 neurons and ReLU activation.
Output Layer with 10 neurons (softmax activation for classification).

Training Parameters:

Batch size: 64
Epochs: 10
Learning rate: 0.001

E. MLP Hyperparameter Tuning

To improve the MLP's performance, we adjusted the following hyperparameters

Increased the number of neurons in the fully connected layers (256, 128).
Reduced dropout rate to 0.3.
Lowered learning rate to 0.0001.

F. Model Comparison and Analysis

We trained both models (CNN and MLP) on the MNIST and Fashion MNIST datasets.
Generated training/validation accuracy and loss curves to visualize model performance over epochs.
Generated confusion matrices to analyze common misclassifications in both models.

Results

CNN performed better on both datasets, achieving 99% accuracy on MNIST and 88% accuracy on Fashion MNIST.
MLP performed well but was slightly behind the CNN, with 97% accuracy on MNIST and 87% accuracy on Fashion MNIST.

G. Conclusion

The CNN model outperformed the MLP model, particularly on the Fashion MNIST dataset, which contains more complex image patterns. Hyperparameter tuning further improved both models' performance, and the analysis of confusion matrices highlighted areas of improvement, such as reducing misclassifications between similar classes.

Project Structure

├── README.md

├── Image_Classification.ipynb

├── Problem Statement

├── Images

└── LICENSE

I. README.md: Contains project overview and explanation.

II. Image_Classification.ipynb: Contains the implementation and experiments CNN model & MLP model. Also Contains code for dataset loading and visualization.

III. Problem Statement: contains all the mere details on how to approach to the project

IV. Images: Folder containing visualization outputs (accuracy/loss curves, confusion matrices).

V. LICENSE: A short and simple permissive license with conditions only requiring preservation of copyright and license notices.

🌟 Exciting Update: Internship Completion! 🌟

This project was a part of the Internship (Machine Learning Intern at Skolar) which I have successfully completed during April 2024 to June 2024. 🎉

During this internship, I had the amazing opportunity to work on an Image Classification project using CNN and MLP on the MNIST and Fashion MNIST datasets. This project helped me enhance my skills in deep learning, hyperparameter tuning, and model analysis. The experience was both challenging and incredibly rewarding.

💡 Key Takeaways:

Mastered the application of CNNs and MLPs in image classification.
Gained a deeper understanding of hyperparameter tuning and model optimization.
Improved my ability to analyze and interpret model results using visualizations and confusion matrices.

📢 P.S. I have provided the intership certificate in the files.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
Images		Images
Image_Classification.ipynb		Image_Classification.ipynb
LICENSE		LICENSE
Problem Statement.pdf		Problem Statement.pdf
README.md		README.md
Skolar - Internship Completion Certificate _ Sarowar Ahmed.pdf		Skolar - Internship Completion Certificate _ Sarowar Ahmed.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Image Classification using CNN and MLP on MNIST and Fashion MNIST Datasets

Overview

Steps Followed

A. Dataset Exploration

B. CNN Model Implementation

CNN Architecture:

Training Parameters:

C. CNN Hyperparameter Tuning

D. MLP Model Implementation

Training Parameters:

E. MLP Hyperparameter Tuning

F. Model Comparison and Analysis

Results

G. Conclusion

Project Structure

🌟 Exciting Update: Internship Completion! 🌟

About

Releases

Packages

Languages

License

sarowarahmed/ImageClassification-using-CNN-and-MLP-on-MNIST-and-FashionMNIST-Datasets

Folders and files

Latest commit

History

Repository files navigation

Image Classification using CNN and MLP on MNIST and Fashion MNIST Datasets

Overview

Steps Followed

A. Dataset Exploration

B. CNN Model Implementation

CNN Architecture:

Training Parameters:

C. CNN Hyperparameter Tuning

D. MLP Model Implementation

Training Parameters:

E. MLP Hyperparameter Tuning

F. Model Comparison and Analysis

Results

G. Conclusion

Project Structure

🌟 Exciting Update: Internship Completion! 🌟

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages