Emotion-Detection-from-Speech

Given a voice detecting emotion it holds. Such as - Angry, Happy, Sad, Disgust, Neutral, Surprise & Pleasant. Etc...

dhawal& Manish Mishra helped me in many ways. Heartily Thanks to both of them.

File name & Download

The speeches used for this project had some specific naming format that one need to put attention on. Our file has OAF_bite_neutral this name format. Since we have 2 speakers old and young, here this voice was uttered by old speaker and this voice YAF_back_fear a young speaker. One sould visit the website of data set to understand it well enough. https://tspace.library.utoronto.ca/handle/1807/24487

Since there is no way to download whole data set at one click, you have to click at least 3,000 times if you will go by conventional way of downloading. Don't waste your time in downloading one and one file. Just add an extention Download Master for Chrome users. it will help you to collect whole data in few clicks. of course it is a time consuming process. Very soon you can get whole data on my online drive. Be patient till then.

Librosa was used to extract the feature out of a given voice.

def extract_feature(file_name):
    X, sample_rate = librosa.load(file_name)
    stft = np.abs(librosa.stft(X))
    mfccs = np.mean(librosa.feature.mfcc(y=X, sr=sample_rate, n_mfcc=40).T, axis=0)
    chroma = np.mean(librosa.feature.chroma_stft(S=stft, sr=sample_rate).T, axis=0)
    mel = np.mean(librosa.feature.melspectrogram(X, sr=sample_rate).T, axis=0)
    contrast = np.mean(librosa.feature.spectral_contrast(S=stft, sr=sample_rate).T, axis=0)
    tonnetz = np.mean(librosa.feature.tonnetz(y=librosa.effects.harmonic(X),
                                              sr=sample_rate).T, axis=0)
    return mfccs, chroma, mel, contrast, tonnetz

the function take the files and return 5 features. Namely, MFCC,Chroma,Mel,Contrast & Tonnetz

thereafter, wee need to parse the files as follows

def parse_audio_files(path):
    features, labels = np.empty((0, 193)), np.empty(0)
    labels = []
    for fn in glob.glob(path):
        try:
            mfccs, chroma, mel, contrast, tonnetz = extract_feature(fn)
        except Exception as e:
            print("Error encountered while parsing file: ", fn)
            continue
        ext_features = np.hstack([mfccs, chroma, mel, contrast, tonnetz])
        features = np.vstack([features, ext_features])
        labels = np.append(labels, fn.split("_")[3].split(".")[0])
        print(fn)
    return np.array(features), np.array(labels)

For training

put audio files in folder name train_sounds

run the training script

it will save the trained model as xyz_Model_protocol2.sav

For prediction

put input files (for which you want to predict emotion) in folder name test_sounds

run the test script

here is what we achieved when training on Decision tree

During test

The best accuracy I got was from Keras (Neural Network) and with ExtraTreeClassifier

While training on `ExtraTreeClassifier`

During Test

It's preety good..!! Isn't it ??

Name		Name	Last commit message	Last commit date
Latest commit History 84 Commits
Report		Report
results/images		results/images
speech_data		speech_data
Decision Tree Project.ipynb		Decision Tree Project.ipynb
ExtraTreeClassifier.ipynb		ExtraTreeClassifier.ipynb
KNN Emotion.ipynb		KNN Emotion.ipynb
Keras Emotion Copy1.ipynb		Keras Emotion Copy1.ipynb
LICENSE.md		LICENSE.md
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Emotion-Detection-from-Speech

File name & Download

Librosa was used to extract the feature out of a given voice.

For training

For prediction

During test

While training on `ExtraTreeClassifier`

During Test

About

Releases

Packages

Languages

License

nirajdevpandey/Emotion-Detection-from-Speech

Folders and files

Latest commit

History

Repository files navigation

Emotion-Detection-from-Speech

File name & Download

Librosa was used to extract the feature out of a given voice.

For training

For prediction

During test

While training on ExtraTreeClassifier

During Test

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

While training on `ExtraTreeClassifier`

Packages