whisper as stt engine #240

royrogermcfreely · 2023-12-01T10:28:16Z

Is your feature request related to a problem? Please describe.
no

Describe the solution you'd like
use the whisper stt engine within sepia

Additional context
Home Assistant got the "Year of the Voice". there you can use whisper on an rpi4.
i tried it on a vm and got prettey good results

it seems there are 2 versions, didnt searched much about the diffrences

whisper: https://github.com/openai/whisper

fast-whisper: https://github.com/SYSTRAN/faster-whisper <- this one uses home assistant

i found also a docker image from rhasspy: https://hub.docker.com/r/rhasspy/wyoming-whisper

fquirin · 2024-04-26T09:39:42Z

Sorry for the late reply. I'm currently still taking a little break from the project, but I'm determined to resume work later this year.

As for whisper I actually have a working beta version. Unfortunately I did not finish the release before I took a break, but it was already working pretty well. As soon as I resume work, this will be the first task.

fquirin · 2024-04-26T10:41:01Z

Two additional things I should mention.

Whisper is pretty demanding for STT. The smallest model will run fast enough on a Raspberry Pi 5 to get OK user experience, but isn't very accurate. The larger models will require better hardware and tend to hallucinate quite a bit. Nevertheless support will come for everyone to play around with their favorite service ^^.
I've also made a PoC for Nvidia NeMo. My hope is that their models will evolve pretty quickly with better support for custom vocabulary. We'll see.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

whisper as stt engine #240

whisper as stt engine #240

royrogermcfreely commented Dec 1, 2023

fquirin commented Apr 26, 2024

fquirin commented Apr 26, 2024

whisper as stt engine #240

whisper as stt engine #240

Comments

royrogermcfreely commented Dec 1, 2023

fquirin commented Apr 26, 2024

fquirin commented Apr 26, 2024