The following project is an implementation of deep learning neural network classifier model for multi modal data. The dataset consists of MNIST image and audio data represented as 1D numpy arrays. The data represents digits in the range of 0 to 9. The model uses two Convectional Neural Network Model which are trained on image and audio respectively and then they are late fused with an multi layer neural network in the end. After tuning and adding layers, the final model resulted in an accuracy score of 0.987.
-
Notifications
You must be signed in to change notification settings - Fork 0
PavanS01/Multimodal-MNIST-Dataset-Classifier
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
No description, website, or topics provided.
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published