GitHub - shubhagarwal1/speech-to-text

Frontend (HTML + JavaScript) ┌────────────────────────────────────────┐ │ User clicks "Start Recognizing" button │ └──────────────────────────┬─────────────┘ │ ▼ ┌───────────────────────────────────┐ │ Browser requests microphone access│ └───────────────────────────────────┘ │ ▼ ┌──────────────────────────────────────┐ │ Browser starts recording audio using │ │ MediaRecorder API │ └──────────────────────────────────────┘ │ ▼ ┌──────────────────────────────────────┐ │ Audio data is collected in chunks and│ │ stored temporarily │ └──────────────────────────────────────┘ │ ▼ ┌──────────────────────────────────────┐ │ Recording stops after a fixed duration│ └──────────────────────────────────────┘ │ ▼ ┌──────────────────────────────────────┐ │ Audio chunks are combined into a Blob │ │ and sent to the Flask backend │ │ using Fetch API │ └──────────────────────────────────────┘ │ ▼

Backend (Flask + Python) ┌──────────────────────────────────────────┐ │ Flask endpoint receives the audio file │ │ and saves it temporarily │ └──────────────────────────┬───────────────┘ │ ▼ ┌────────────────────────────────────┐ │ The speech_recognition library │ │ processes the audio file: │ │ - Audio file is loaded │ │ - Speech recognition is performed │ │ using Google Web Speech API │ │ - Recognized text is extracted │ └────────────────────────────────────┘ │ ▼ ┌─────────────────────────────────────┐ │ Recognized text is sent back as a │ │ JSON response │ └─────────────────────────────────────┘ │ ▼ Frontend (HTML + JavaScript) ┌──────────────────────────────────────────┐ │ JavaScript updates the content of the │ │ <p id="output"></p> element with │ │ the recognized text │ └──────────────────────────────────────────┘

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
static		static
templates		templates
.DS_Store		.DS_Store
README.md		README.md
app.py		app.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases

Packages

Languages

shubhagarwal1/speech-to-text

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages