Speech-to-Text Transcription

This project uses OpenAI's Whisper model to transcribe audio files from a directory and save the results as text files in another directory.

Requirements

Python 3.x
whisper library (install via pip install openai-whisper)
ffmpeg (required by Whisper, install via your package manager)

Installation

Clone the repository:

git clone https://github.com/yourusername/speech-to-text.git
cd speech-to-text

Install dependencies:
```
pip install -r requirements.txt
```

(Optional) Create and activate a virtual environment:

python3 -m venv venv
source venv/bin/activate  # On macOS/Linux
venv\Scripts\activate     # On Windows

Usage

Using the Script Directly

Default directories and language:
```
python3 src/transcribe_audio.py
```
This will transcribe all .ogg and .wav files from the voice_input directory and save the results in the text_output directory. The default language is Russian (ru).
Custom directories and language:
```
python3 src/transcribe_audio.py --input_dir my_input_folder --output_dir my_output_folder --language en
```
This will transcribe files from my_input_folder and save the results in my_output_folder. The language is set to English (en).

Using Makefile

Default directories and language:
```
make transcribe
```
This will transcribe all .ogg and .wav files from the voice_input directory and save the results in the text_output directory. The default language is Russian (ru).
Custom directories and language:
```
make transcribe INPUT_DIR=my_input_folder OUTPUT_DIR=my_output_folder LANGUAGE=ru
```
This will transcribe files from my_input_folder and save the results in my_output_folder. The language is set to English (ru).

Notes

Ensure that the voice_input directory exists and contains valid audio files.
The text_output directory will be created automatically if it doesn't exist.
Supported languages include ru (Russian), en (English), and others. Refer to the Whisper documentation for a full list.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
src		src
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Speech-to-Text Transcription

Requirements

Installation

Usage

Using the Script Directly

Using Makefile

Notes

About

Uh oh!

Releases

Packages

Languages

License

aloescript/speech-to-text

Folders and files

Latest commit

History

Repository files navigation

Speech-to-Text Transcription

Requirements

Installation

Usage

Using the Script Directly

Using Makefile

Notes

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages