Transcribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!
-
Updated
Sep 17, 2024 - Svelte
Transcribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!
The open-source iOS app that's making quality voice transcription more accessible on mobile devices.
Tero Subtitler is an open source, cross-platform, and free subtitle editing software.
Simple web application, which can be used to convert audio to subtitles by OpenAI's Whisper model
A desktop application that transcribes audio from files, microphone input or YouTube videos with the option to translate the content and create subtitles.
This repository contains a Python script that allows users to download the audio from a YouTube video, transcribe it into text, detect the language and save the transcription in txt file automatically.
Modern Desktop Application offering a suite of tools for audio/video text recognition and a variety of other useful utilities.
Persian ASR dataset
Simple Python audio transcriber using OpenAI's Whisper speech recognition model
"Speech-to-Text Realtime with Extension" is a browser extension that converts speech to text in real-time. It supports multiple languages, making it ideal for note-taking, customer service, and accessibility. Easy to install and use on popular browsers.
Generate text captions for audio files & youtube video using OpenAI Whisper on Google Colab. Multiple languages support.
core shell functions building blocks for advanced AI pipelines
A SwiftUI App For People Who Need To Take Down Important Information Quickly.
Chrome Extension to capture captions of ongoing meetings by using webkitspeechrecognition api for all the web video conferencing platforms (for google meet, it directly extracts the captions) and sends to flask api for summarization.
Develop a python application that allows you to extract valuable insights, engage in meaningful conversations, and explore video content in a whole new way.
AudioInsight is a web application that processes audio, generates transcriptions, and allows users to ask questions about the related audio.
Transcribe Audio to Text with node.js using the Whisper model from OpenAI.
An efficient desktop application for transcribing audio files into text using Vosk speech recognition.
Add a description, image, and links to the audio-to-text topic page so that developers can more easily learn about it.
To associate your repository with the audio-to-text topic, visit your repo's landing page and select "manage topics."