WhisperBite is a tool built with Gradio that processes audio and video files to perform speaker diarization (identifying who spoke when) and transcription using OpenAI's Whisper model. It can handle various inputs, separate vocals, identify non-speech sounds, extract individual words, and refine results with a second pass.
-
Updated
May 11, 2025 - Python