SPARTA stands for Speech Profiling for ARabic TAlk. The purpose of this project is to identify the speaker’s gender, dialect, and emotion when speaking from a given utterance.
This research profiles the speaker based on their voice. From a given speech, our model predicts gender, emotion, and dialect in real-time.
- Emotion [Sad, Happy, Angry, Surprise, Questioning, and Neutral].
- Gender [Male and Female] in the Arabic language (as many other languages).
- Dialect [Egypt, MSA, Levant, Gulf, North Africa].
- Qatar Computing Research Institute QCRI
- King Saud University Emotions KSUEmotions
- Arabic Natural Audio Dataset ANAD
- Spoken Arabic Regional Archive SARA
- King Saud University Arabic Speech Database KSU
- Multi Dialect Arabic Speech MDAS
To reproduce the dataset collected please follow the instructions:
- Download Datasets from the links above.
- Run
collect_DATASETNAME.py
for all datasets. - Make the Train/Dev/Test split by running the
preprocessing_scripts/split.py
- Extract features by running
extract_FEATURENAME.py
for all features