Skip to content

AudioVision is a web application that helps individuals with hearing impairments by transcribing and analyzing audio files, converting them to text and visual waveforms for easier comprehension of spoken content.

Notifications You must be signed in to change notification settings

vxncius-dev/AudioVision

Folders and files

NameName
Last commit message
Last commit date

Latest commit

d6ab8b2 · Nov 11, 2024

History

6 Commits
Nov 1, 2024
Nov 1, 2024
Nov 11, 2024
Oct 31, 2024
Oct 31, 2024

Repository files navigation

AudioVision

AudioVision is a web app designed to support individuals with hearing impairments by converting audio files into text and visual representations. Using audio processing and AI, the application not only transcribes audio but also generates graphical sound wave visualizations and provides insightful analyses of the transcribed content.

Access the App

You can access my app and test it at click here.

Captura de Tela (1)

Features

  • Audio Transcription: Upload audio files in various formats, such as .ogg and .wav, to receive an accurate text transcription of the spoken content.
  • Waveform Visualization: View a graphical representation of sound waves, offering a visual way to understand the characteristics of the audio.
  • Content Analysis: Leverage AI to interpret the transcription, generating summaries and contextual insights tailored for enhanced clarity and accessibility.

Technologies Used

  • Flask for the application backend
  • Librosa and Matplotlib for waveform visualizations
  • SpeechRecognition for audio transcription
  • Google Generative AI for intelligent summaries and contextual analysis of audio content

How to Use

  1. Upload an audio file on the main page.
  2. The app converts the audio to text, displays the transcription, and shows a representative waveform.
  3. Optionally, view an AI-driven analysis of the content for broader understanding of the transcribed audio.

AudioVision was created to make auditory information more accessible through visual and textual formats, fostering inclusion and accessibility for individuals with hearing loss.

About

AudioVision is a web application that helps individuals with hearing impairments by transcribing and analyzing audio files, converting them to text and visual waveforms for easier comprehension of spoken content.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published