Spoken Language Identification Using Deep Learning

Bachelor Thesis Project

A Deep Learning-Based Approach for Spoken Language Identification

Dataset

Kaggle's spoken language identification with 73080 samples from English, Spanish, and German languages.
ShEMO a large-scale validated database for Persian speech emotion detection

Feature Extraction

Mel Spectrogram is used for feature extraction and results are saved into .npy files. The model reads them using a custom data generator.

Architecture

This project implemented two different architectures CNN and CRNN.

Website

There is also a website!

Presentation video

You can watch presentation in Persian Here.

Name		Name	Last commit message	Last commit date
Latest commit History 58 Commits
explore		explore
images		images
utils		utils
web		web
.gitignore		.gitignore
CNN Model.ipynb		CNN Model.ipynb
CRNN.ipynb		CRNN.ipynb
README.md		README.md
Results.ipynb		Results.ipynb
Small Model.ipynb		Small Model.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Spoken Language Identification Using Deep Learning

Bachelor Thesis Project

Dataset

Feature Extraction

Architecture

Website

Presentation video

About

Releases

Packages

Languages

Mohammadreza-mz/Spoken-Language-Identification

Folders and files

Latest commit

History

Repository files navigation

Spoken Language Identification Using Deep Learning

Bachelor Thesis Project

Dataset

Feature Extraction

Architecture

Website

Presentation video

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages