A simple web application for converting audio files to text using speech recognition. Built with Flask (Python) for the backend, and HTML/CSS for the frontend.
- Upload audio files for conversion.
- Select the language for speech recognition.
- Convert speech to text with real-time feedback.
- Loading animation during the conversion process.
- Python: For the backend server and speech recognition.
- Flask: Web framework for building the server.
- HTML: Structure of the web application.
- CSS: Styling and aesthetics of the web application.
- JavaScript: Handling file uploads and form submission.
- Python 3.x
- Flask
- SpeechRecognition library
- Other Python dependencies (see
requirements.txt
)
-
Clone the repository:
git clone https://github.com/yourusername/speech-to-text-converter.git cd speech-to-text-converter
-
Install the required Python packages:
pip install -r requirements.txt
-
Run the Flask application:
python app.py
-
Open your browser and go to
http://127.0.0.1:5000/
to use the application.
- app.py: Main Python file that runs the Flask server and handles the conversion logic.
- static/styles.css: CSS file for styling the web page.
- templates/index.html: HTML file that contains the structure of the web page.
- Upload an audio file by dragging and dropping it into the upload area or by clicking to select a file.
- Select the language for speech recognition from the dropdown menu.
- Click the "Convert" button to start the conversion process.
- Wait for the conversion: A loading spinner will appear during the process. The resulting text will be displayed once the conversion is complete.
A loading spinner is displayed during the conversion process to indicate that the application is working on the task.
This project is licensed under the MIT License. See the LICENSE file for details.
For any questions or comments, you can reach me at siddiqueearhaan@gmail.com