Speaker_Identification

The task is to determine the identity of a person by his/her voice. This is known as Automatic Speaker Recognition and falls in the category of biometric security systems, as it is related to human characteristics or individuality. The application is implemented in Python and is composed of two distinct phases, the training phase and testing phase. They can each be assigned to a task, namely the enroll and recognition task.

We use MFCC to extract the features from the speech signal, Mel-frequency cepstral coefficients (MFCCs) method is one of the most popular strategies for feature extraction in both audio and speech signal. We strive to make the correct identification of the speaker using the Gaussian mixture model (GMM). The features extracted are fed to GMM-based approaches that have the purpose to create speaker models for identification. To evaluate the performance of the speaker identification system we use a Confusion Matrix. This is a useful machine learning method that allows us to measure recall, precision and accuracy.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
models		models
test		test
train		train
README.md		README.md
Speaker Identification - TCSV.pdf		Speaker Identification - TCSV.pdf
accuracy.py		accuracy.py
feature_extraction.py		feature_extraction.py
parameters.py		parameters.py
refactAll.py		refactAll.py
refactOne.py		refactOne.py
test.py		test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Speaker_Identification

About

Releases

Packages

Languages

spanmartina/Speaker_Identification

Folders and files

Latest commit

History

Repository files navigation

Speaker_Identification

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages