Link to the Dataset Available: http://download.tensorflow.org/data/speech_commands_v0.02.tar.gz
-
Notifications
You must be signed in to change notification settings - Fork 0
Developed a speech recognition system using TDNN, preprocessing audio, extracting MFCC features, and training the model. Fine-tuning with augmented data (19,000 rows) improved accuracy from 9% to 80% training and 40% validation. Data augmentation proved crucial for enhancing model performance and generalization. Still working to increase the acc.
Kavayk29/Speech-Recognition-using-TDNN-and-Data-Augmentation
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
Developed a speech recognition system using TDNN, preprocessing audio, extracting MFCC features, and training the model. Fine-tuning with augmented data (19,000 rows) improved accuracy from 9% to 80% training and 40% validation. Data augmentation proved crucial for enhancing model performance and generalization. Still working to increase the acc.
Topics
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published