1,440 audio files (.wav), i.e. speech files, from 24 actors that are categorized into 8 separate emotions.
-
Updated
Feb 11, 2019 - Python
1,440 audio files (.wav), i.e. speech files, from 24 actors that are categorized into 8 separate emotions.
Classifying Audio to Emotion
Use GANs to generate spectrogram of speech. The generated spectrogram will be conditioned on emotion
This repository is an import of the original repository that contains some of the models we had tested on the RAVDESS and TESS dataset for our research on Speech Emotion Recognition Models.
Speech Emotion Classification with novel Parallel CNN-Transformer model built with PyTorch, plus thorough explanations of CNNs, Transformers, and everything in between
Pytorch speech emotion recognition for RAVDESS dataset with CNN.
Collaborated to create a Machine Learning model trained and tested with a Random Forest model to predict primary emotion based on input audio file. Data cleaned and trained in a Jupyter Notebook using Pandas and Librosa. Results visualized using Pandas, Tableau, and JavaScript functions with bootstrap in a dynamic HTML website.
This repository contains the source code for my final year project for my undergraduate degree in MTU.
Offical implementation of paper "MSAF: Multimodal Split Attention Fusion"
CNN-LSTM based SER model using RAVDESS database
In this work is proposed a speech emotion recognition model based on the extraction of four different features got from RAVDESS sound files and stacking the resulting matrices in a one-dimensional array by taking the mean values along the time axis. Then this array is fed into a 1-D CNN model as input.
The SER model is capable of detecting eight different male/female emotions from audio speeches using MLP and RAVDESS model
Emotion is an intuitive feeling which can be determined from any person’s circumstances and surroundings. But in this project, we tried to identify the emotional state of a person using his voice as input.
[RAVDESS] Speech Emotion Recognition with Convolutional Attention based Bi-GRU. (Best test accuracy of 87%)
[ICASSP 2023] Official Tensorflow implementation of "Temporal Modeling Matters: A Novel Temporal Emotional Modeling Approach for Speech Emotion Recognition".
💻 🤖 A summary on our attempts at using Deep Learning approaches for Emotional Text to Speech 🔈
A convolutional neural network trained to classify emotions in singing voices.
Add a description, image, and links to the ravdess topic page so that developers can more easily learn about it.
To associate your repository with the ravdess topic, visit your repo's landing page and select "manage topics."