Speech-Emotion-Recognition

This research project undertakes a comprehensive analysis of speech emotion recognition. Har- monizing datasets involves negotiating disparate naming conventions and emotional expressions, establishing a standardized format for cohesive analysis. The distribution of emotions, excluding surprise, calm, and neutral, is well-balanced. The subsequent focus is on feature extraction through raw audio waveforms, frequency spectrum (FFT), short-time Fourier transform (STFT) spectrograms, and mel spectrograms. Three distinct convolutional neural network (CNN) models—Mel Spectrogram CNN, MFCCs CNN, and Mel Spectrogram CRNN—are developed and evaluated. Results indicate a 72% accuracy in classifying emotions, with Mel spectrogram and MFCC features displaying complementary strengths. The study concludes by suggesting avenues for improvement, emphasizing feature fusion, exploring specialized deep learning models, and addressing data imbalance. Future work involves real-life integration, applying sentiment analysis to predict stock market effects based on emotion-laden communications in S&P 500 earnings calls.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
Final_Project_Arzu_ISIKTOPBAS.pdf		Final_Project_Arzu_ISIKTOPBAS.pdf
README.md		README.md
Term_Project.html		Term_Project.html
Term_Project.ipynb		Term_Project.ipynb
model_crnn.png		model_crnn.png
model_mel.png		model_mel.png
model_mfcc.png		model_mfcc.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Speech-Emotion-Recognition

About

Releases

Packages

Languages

arzuisiktopbas/Speech-Emotion-Recognition

Folders and files

Latest commit

History

Repository files navigation

Speech-Emotion-Recognition

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages