AudioVision

AudioVision is a web app designed to support individuals with hearing impairments by converting audio files into text and visual representations. Using audio processing and AI, the application not only transcribes audio but also generates graphical sound wave visualizations and provides insightful analyses of the transcribed content.

Access the App

You can access my app and test it at click here.

Features

Audio Transcription: Upload audio files in various formats, such as .ogg and .wav, to receive an accurate text transcription of the spoken content.
Waveform Visualization: View a graphical representation of sound waves, offering a visual way to understand the characteristics of the audio.
Content Analysis: Leverage AI to interpret the transcription, generating summaries and contextual insights tailored for enhanced clarity and accessibility.

Technologies Used

Flask for the application backend
Librosa and Matplotlib for waveform visualizations
SpeechRecognition for audio transcription
Google Generative AI for intelligent summaries and contextual analysis of audio content

How to Use

Upload an audio file on the main page.
The app converts the audio to text, displays the transcription, and shows a representative waveform.
Optionally, view an AI-driven analysis of the content for broader understanding of the transcribed audio.

AudioVision was created to make auditory information more accessible through visual and textual formats, fostering inclusion and accessibility for individuals with hearing loss.

Name	Name	Last commit message	Last commit date
Latest commit vxncius-dev Update README.md Nov 11, 2024 d6ab8b2 · Nov 11, 2024 History 6 Commits
static	static	Update style.css	Nov 1, 2024
templates	templates	Update index.html	Nov 1, 2024
README.md	README.md	Update README.md	Nov 11, 2024
main.py	main.py	Initial commit	Oct 31, 2024
requirements.txt	requirements.txt	Initial commit	Oct 31, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AudioVision

Access the App

Features

Technologies Used

How to Use

About

Releases

Packages

Languages

vxncius-dev/AudioVision

Folders and files

Latest commit

History

Repository files navigation

AudioVision

Access the App

Features

Technologies Used

How to Use

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages