MF-Saudi: A Multimodal Framework for Bridging the Gap Between Audio and Textual Data for Saudi Dialect Detection

This is the source code for the paper: MF-Saudi: A Multimodal Framework for Bridging the Gap Between Audio and Textual Data for Saudi Dialect Detection by Raed Alharbi, proceeding in Journal of King Saud University - Computer and Information Sciences - 2024

Files

data_processing.py: Contains functions for loading and preprocessing data.
models.py: Contains the main models in the paper.
model_training.py: Contains the main training loop for the model.
plotting.py: Contains functions for plotting training metrics.
feature_extraction.py: Contains functions for extracting features from text and audio.
utils.py: Contains utility functions such as learning rate scheduling and downloading pre-trained models.
example.ipynb: example file to run the training script from jupyter notebook.

Requirements

In order to run the code, will need:

tensorflow_io
pytorch-pretrained-bert
transformers==3.5.1
sentencepiece
keras-preprocessing
librosa==0.9.2
torchaudio

You can install the required packages using:

pip install -r requirements.txt

To train the model, you can use the following script:

python /content/drive/MyDrive/journal_SADA/github_code/main.py --train_path /content/drive/MyDrive/journal_SADA/train.csv --valid_path /content/drive/MyDrive/journal_SADA/valid.csv --base /content/drive/MyDrive/journal_SADA/batch1/batch_1

or, if you prefer to use jupyter notebook, check the file: example.ipynb

Dataset You can download the dataset used in the expeirment from: (https://www.kaggle.com/datasets/sdaiancai/sada2022)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MF-Saudi: A Multimodal Framework for Bridging the Gap Between Audio and Textual Data for Saudi Dialect Detection

Files

Requirements

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
README.md		README.md
data_processing.py		data_processing.py
example.ipynb		example.ipynb
feature_extraction.py		feature_extraction.py
main.py		main.py
model_training.py		model_training.py
models.py		models.py
plotting.py		plotting.py
requirements.txt		requirements.txt
utils.py		utils.py

raed19/MF-Saudi

Folders and files

Latest commit

History

Repository files navigation

MF-Saudi: A Multimodal Framework for Bridging the Gap Between Audio and Textual Data for Saudi Dialect Detection

Files

Requirements

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages