Classification of Handwritten Character Dataset and Consonant Vowel (CV) segment dataset

Objective

The main aim of this project is to classify Handwritten character dataset which consist of Kannada/Telugu script in coordinate form and to also classify Consonant Vowel (CV) segment dataset, a conversational speech data spoken in Hindi language by using RNN and LSTM.

Handwritten Character Dataset

Data

Five characters are there a, aI, bA, dA and lA, each characters are stored in .txt files as sequence of 2-dimensional points (x and y coordinates) :-

Model

RNN

Accuracy on Train Set: 0.96
Accuracy on Test Set: 0.94

LSTM

Accuracy on Train Set: 0.98
Accuracy on Test Set: 0.97

Confusion Matrix

Consonant Vowel (CV) segment dataset

Data

This dataset consists of subset of CV segments from a conversational speech data spoken in Hindi language. Training and test data are separated and are provided inside the respective CV segment folder where each class consist of 39-dimensional Mel frequency cepstral coefficient (MFCC) features.

Model

RNN

Accuracy on Train Set: 0.988
Accuracy on Test Set: 0.899

LSTM

Accuracy on Train Set: 0.997
Accuracy on Test Set: 0.879

Confusion Matrix

Conclusion

In both of the cases, our data consists of long sequential sequences, the better accuracy of the LSTM model confirms its effectiveness over the standard RNN.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.ipynb_checkpoints		.ipynb_checkpoints
CV_Data		CV_Data
Handwriting_Data		Handwriting_Data
__pycache__		__pycache__
CVDataset.py		CVDataset.py
CV_Code.ipynb		CV_Code.ipynb
HandWritten_Dataset.py		HandWritten_Dataset.py
HandWritten_code.ipynb		HandWritten_code.ipynb
LSTMModel.py		LSTMModel.py
Problem Statement.pdf		Problem Statement.pdf
README.md		README.md
RNNModel.py		RNNModel.py
Report.pdf		Report.pdf
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Classification of Handwritten Character Dataset and Consonant Vowel (CV) segment dataset

Objective

Handwritten Character Dataset

Data

Model

Confusion Matrix

Consonant Vowel (CV) segment dataset

Data

Model

Confusion Matrix

Conclusion

About

Releases

Packages

Languages

Prashant812/LSTM-RNN

Folders and files

Latest commit

History

Repository files navigation

Classification of Handwritten Character Dataset and Consonant Vowel (CV) segment dataset

Objective

Handwritten Character Dataset

Data

Model

Confusion Matrix

Consonant Vowel (CV) segment dataset

Data

Model

Confusion Matrix

Conclusion

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages