Skip to content

Linux speech-to-text tool w/ system tray

Notifications You must be signed in to change notification settings

rebootl/linux-speech-typer

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

27 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

linux-speech-typer

Hint: At this point I'd recommend using ideasman42/nerd-dictation

It was developed roughly at the same time and has a number of advantages, namely:

  • it's faster due to using streaming capabilities
  • when used with shortcut activation it avoids potential problems with voice activation
  • supports wayland

Hint 2: Currently this needs a patched version of Uberi/speech_recognition that includes this pull request: Vosk api: allow selecting different models and automatic model download #657

Hint 3: I switched to using ideasman42/nerd-dictation and I'm not maintaining or updating this project for the time being. I'm leaving it here for reference purposes.


Linux speech-to-text tool w/ system tray

Once running, spoken text will be typed at the current cursor location.

Limitation: Currently this is supporting X-Windows (not Wayland) due to limitation in pynput.

screenshot

This is based on the python package Uberi/speech_recognition. It now uses Vosk API (default) and Google Speech Recognition service as backends. Other speech recognition backends are supported by Uberi/speech_recognition but currently not implemented here.

Currently the version including this pull request is used.

Installation

I recommend using a virtual environment. However, for the system tray tk has to be installed system-wide:

sudo apt-get install python3-tk

git clone the Repository or download the zip file and unzip it. And change into the directory.

git clone https://github.com/rebootl/linux-speech-typer.git
cd speech-typer-gui/

Setup and activate the virtual environment (optional):

python3 -m venv env --system-site-packages
. env/bin/activate

Install dependencies:

pip3 install -r requirements.txt

Start:

./speech-typer-gui.py

The terminal should say listening and you should see the tray icon in the system tray.

To launch it in one step use: /path-to-installation/env/bin/python /path-to-installation/speech-typer-gui.py

Configuration

Different language tags and replacements can be configured in config.py.

Find the relevant language codes here: https://cloud.google.com/speech-to-text/docs/languages

The words in the keys dictionary (list) will be executed as corresponding keystrokes.

Please adjust it to your liking.

About

Linux speech-to-text tool w/ system tray

Topics

Resources

Stars

Watchers

Forks

Languages