Dysarthric Speech Recognition

Project Overview

This project aims to classify dysarthric and non-dysarthric speech using deep learning techniques. The dataset consists of audio samples from dysarthric and non-dysarthric individuals, with the goal of developing a model that can accurately distinguish between the two classes.

Features

Speech waveplot visualization
Spectrogram analysis
Zero-crossing rate (ZCR) calculation and visualization
Feature extraction using MFCC
Deep learning model implementation using CNN

Data Information

The dataset contains 2000 audio samples divided into four categories:

Dysarthric females: 500 samples
Dysarthric males: 500 samples
Non-dysarthric females: 500 samples
Non-dysarthric males: 500 samples

Data CSV Information

The data.csv file contains the following columns:

filename: Path to the audio file
is_dysarthria: Indicates if the sample is from a dysarthric individual (dysarthria or non_dysarthria)
gender: Gender of the speaker (male or female)

Usage

Clone the repository.
Ensure the dataset is located in the specified directory.
Run main.m to load data, visualize waveplots, spectrograms, and ZCRs, and train the CNN model.

Work in Progress

This project is a work in progress, and additional features and improvements are being actively developed.

License

This project is licensed under the MIT License.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Dysarthric Speech Recognition

Project Overview

Features

Data Information

Data CSV Information

Usage

Work in Progress

License

Files

README.md

Latest commit

History

README.md

File metadata and controls

Dysarthric Speech Recognition

Project Overview

Features

Data Information

Data CSV Information

Usage

Work in Progress

License