We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
What are your thoughts on real-time speech-to-text conversion as a feature in Memento?
There are relatively lightweight open-source models like Kaldi, which use little memory in real time.
There's also Whisper, which is heavier and more accurate.
There are already examples of programs like LiveCaptions and vosk api that are lightweight. Vosk has a Japanese model
There are also Google and Azure STT, but they only work online.
The text was updated successfully, but these errors were encountered:
I'm not against it. I'm not too familiar with this stuff, so it will take research.
Sorry, something went wrong.
No branches or pull requests
What are your thoughts on real-time speech-to-text conversion as a feature in Memento?
There are relatively lightweight open-source models like Kaldi, which use little memory in real time.
There's also Whisper, which is heavier and more accurate.
There are already examples of programs like LiveCaptions and vosk api that are lightweight.
Vosk has a Japanese model
There are also Google and Azure STT, but they only work online.
The text was updated successfully, but these errors were encountered: