Open Translator: Speech To Speech and Speech to text Translator with voice cloning and other cool features
Can be run as Online mode ( connected to internet and make api calls using Google translate - No api keys are needed )
-
Translate from and to 17 Languages :
- The translator supports various languages, including English, Spanish, French, German, Dutch , Japanese, Korean, Turkish, Arabic, Russian, Hebrew, Hindi, Italian, Portuguese, Chinese, Czech and Hungarian.
-
File Menu available options:
-
Convert Audio file to MP3
-
Extract audio from Video
-
YouTube Downloader
-
Replace Audio in Video
-
Video Text Adder
-
Voice Recorder
-
PyTranscriber (shortcut)
-
Exit
Make sure you have the following dependencies installed:
- Python >= 3.10
- Pip (Python package installer)
- FFmpeg #Should be installed manually and added to sys env path
1- Clone the repository:
git clone https://github.com/overcrash66/OpenTranslator.git
2- Navigate to folder:
cd OpenTranslator
3- Create a vitrual env:
py -3.10 -m venv venv
venv\Scripts\activate
4- Install the required Python packages using:
If you would like to use CUDA 118 - GPU:
PY -3.10
pip install torch==2.1.2+cu118 torchaudio==2.1.2+cu118 --index-url https://download.pytorch.org/whl/cu118
PY -3.12
pip install torch==2.2.1+cu118 torchaudio==2.2.1+cu118 --index-url https://download.pytorch.org/whl/cu118
Install mecab https://github.com/ikegami-yukino/mecab/releases
pip install -r requirements_Py312.txt
OR by default you use CPU only:
pip install -r requirements.txt
5- Run the Script:
python OpenTranslator.py
Or Local mode (using a set of LLM's) for audio file translation only, using a WEB UI (Gradio)
python WebUI.py
- ar-test.mp3
- cn-test.mp3
- cs-test.mp3
- de-test.mp3
- en-test.mp3
- es-test.mp3
- fr-test.mp3
- hi-test.mp3
- hu-test.mp3
- it-test.mp3
- jp-test.mp3
- ko-test.mp3
- nl-test.mp3
- pl-test.mp3
- pt-test.mp3
- ru-test.mp3
- tr-test.mp3
- You can customize the translation models and other settings by modifying the script.
This project is licensed under the GPL License - see the LICENSE file for details.
Special thanks to: XTTS_V2 whisper v3 Large Llama2-13b-Language-translate autosub gTTS