SPARTA-benchmark

SPARTA stands for Speech Profiling for ARabic TAlk. The purpose of this project is to identify the speaker’s gender, dialect, and emotion when speaking from a given utterance.

This research profiles the speaker based on their voice. From a given speech, our model predicts gender, emotion, and dialect in real-time.

Emotion [Sad, Happy, Angry, Surprise, Questioning, and Neutral].
Gender [Male and Female] in the Arabic language (as many other languages).
Dialect [Egypt, MSA, Levant, Gulf, North Africa].

Datasets

Qatar Computing Research Institute QCRI
King Saud University Emotions KSUEmotions
Arabic Natural Audio Dataset ANAD
Spoken Arabic Regional Archive SARA
King Saud University Arabic Speech Database KSU
Multi Dialect Arabic Speech MDAS

Data Preprocessing

To reproduce the dataset collected please follow the instructions:

Download Datasets from the links above.
Run collect_DATASETNAME.py for all datasets.
Make the Train/Dev/Test split by running the preprocessing_scripts/split.py
Extract features by running extract_FEATURENAME.py for all features

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
preprocessing_scripts		preprocessing_scripts
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SPARTA-benchmark

Datasets

Data Preprocessing

About

Releases

Packages

Languages

mawdoo3/sparta-benchmark

Folders and files

Latest commit

History

Repository files navigation

SPARTA-benchmark

Datasets

Data Preprocessing

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages