linux-speech-typer

Hint: At this point I'd recommend using ideasman42/nerd-dictation

It was developed roughly at the same time and has a number of advantages, namely:

it's faster due to using streaming capabilities
when used with shortcut activation it avoids potential problems with voice activation
supports wayland

Hint 2: Currently this needs a patched version of Uberi/speech_recognition that includes this pull request: Vosk api: allow selecting different models and automatic model download #657

Hint 3: I switched to using ideasman42/nerd-dictation and I'm not maintaining or updating this project for the time being. I'm leaving it here for reference purposes.

Linux speech-to-text tool w/ system tray

Once running, spoken text will be typed at the current cursor location.

Limitation: Currently this is supporting X-Windows (not Wayland) due to limitation in pynput.

This is based on the python package Uberi/speech_recognition. It now uses Vosk API (default) and Google Speech Recognition service as backends. Other speech recognition backends are supported by Uberi/speech_recognition but currently not implemented here.

Currently the version including this pull request is used.

Installation

I recommend using a virtual environment. However, for the system tray tk has to be installed system-wide:

sudo apt-get install python3-tk

git clone the Repository or download the zip file and unzip it. And change into the directory.

git clone https://github.com/rebootl/linux-speech-typer.git
cd speech-typer-gui/

Setup and activate the virtual environment (optional):

python3 -m venv env --system-site-packages
. env/bin/activate

Install dependencies:

pip3 install -r requirements.txt

Start:

./speech-typer-gui.py

The terminal should say listening and you should see the tray icon in the system tray.

To launch it in one step use: /path-to-installation/env/bin/python /path-to-installation/speech-typer-gui.py

Configuration

Different language tags and replacements can be configured in config.py.

Find the relevant language codes here: https://cloud.google.com/speech-to-text/docs/languages

The words in the keys dictionary (list) will be executed as corresponding keystrokes.

Please adjust it to your liking.

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
icons		icons
.gitignore		.gitignore
README.md		README.md
config.py		config.py
requirements.txt		requirements.txt
screenshot.png		screenshot.png
speech-typer-gui.py		speech-typer-gui.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

linux-speech-typer

Installation

Configuration

About

Languages

rebootl/linux-speech-typer

Folders and files

Latest commit

History

Repository files navigation

linux-speech-typer

Installation

Configuration

About

Topics

Resources

Stars

Watchers

Forks

Languages