Skip to content

The "AI Voice Recognize" project aims to develop a system capable of distinguishing between real human voices and spoofed audio files. Utilizing advanced techniques in machine learning and signal processing, our objective is to create a robust model that can effectively detect audio spoofing attempts.

Notifications You must be signed in to change notification settings

AshenPumpkin/AI-Voice-Recognition

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

51 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

AI Voice Recognize (AVR) 🎙️

Usage Instructions

Download and Install:
    Download the latest release
    Extract the entirety of the release to a directory on your machine
    Simply run the .exe file to launch the application.

Using the Application:
    Open the executable file.
    Use the graphical interface to upload audio files.
    The system will process the audio and display whether it is genuine or spoofed.

Using the Library:
    Download the .tar.gz file or input "pip install AVR" into your CMD
    in a .py or .ipynb file add "import AVR"
    now you can use AVR.query_function() with the path to your file as input. and the function will return the prediction value, True for a bona-fide voice sample, and False for a spoofed voice.
    Remember to use AVR.flush() after use to free up the model memory from your system.

Overview

The AI Voice Recognize (AVR) project, developed by Guy Ben Ari and Ynon Friedman from Afeka College of Engineering, focuses on detecting spoofed audio files and distinguishing them from genuine human voices using advanced machine learning techniques. Dataset

ASVspoof2019 dataset is used, featuring a diverse range of real and spoofed audio files for model training and evaluation. Approach

Architecture: Utilizes LSTM networks for sequential audio data and Conv2D layers for feature extraction from MFCC images.
Model: Features are concatenated and classified using fully connected layers.

Development Environment

Prototyping and Training: Conducted using JupyterLab Notebook.
Version Control: Managed with Git.

Integration

The system is packaged as an executable file for ease of use, providing both real-time and batch processing capabilities. Testing Breakdown

Tests Conducted: Includes UI functionality, audio processing accuracy, and model integration.
Results: Includes test scripts, issues identified, and corrective actions taken.

Limitations and Solutions

Data Inconsistency: Resolved through normalization techniques.
Integration Issues: Fixed by integrating with HuggingFace.
Resource Constraints: Addressed by upgrading VRAM.
Performance: Enhanced with CUDA.
UI and Security: Improved based on feedback and added encryption.

Contact

For inquiries or collaboration:

Guy Ben Ari: gbenari2@gmail.com
Ynon Friedman: ynonfridman@gmaiL.com

About

The "AI Voice Recognize" project aims to develop a system capable of distinguishing between real human voices and spoofed audio files. Utilizing advanced techniques in machine learning and signal processing, our objective is to create a robust model that can effectively detect audio spoofing attempts.

Topics

Resources

Stars

Watchers

Forks

Packages

No packages published

Languages