Bidirectional LSTM network for speech emotion recognition.
-
Updated
Mar 31, 2019 - Python
Bidirectional LSTM network for speech emotion recognition.
From a voice recording containing several voices and possibly music, we count the number of distincts speakers and the number of women and men who speak.
This is an algorithm to identify human voice and do segmentation automatically. The result will be compared to the manual segmentation data, then a accuracy report will be generated based on match rate, insertion rate and omission rate.
Add a description, image, and links to the voice-segmentation topic page so that developers can more easily learn about it.
To associate your repository with the voice-segmentation topic, visit your repo's landing page and select "manage topics."