-
Notifications
You must be signed in to change notification settings - Fork 34
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Real-time Stream Based AI Assistant #20
Comments
output2_added_subtitle.mp4This is how it like as for now. |
@lucasjinreal sounds really interesting. Why you are limiting this only to rust ? you have any thoughts on runing it in browser using wasm / onnx ? |
@devilankur18 Pleased to learn that you are interested in this topic. Why limit it to Rust? There are several reasons:
Regarding the model part, it may not have just one model, so it could run through ONNX or Candle. |
@lucasjinreal I was trying to find some benchmarks in rust using candles / burn, tried examples in browser, not sure if gains are much as of today. I am pretty new to rust ml, You have some bencharks for llm models ? Also let me know where I can be of help. |
@devilankur18 I think using a same model, such as Qwen2B, using llama.cpp, candle to run see the time consume. |
Hello, this is one of my initial proposals for implementing a real-time stream-based AI assistant powered by pure Rust. Given Kokoro's significant role in text speech and the rapid evolution of Large Language Models (LLMs), here are my thoughts on how to achieve this. I will present model selection and the overall architecture. If you are interested, please comment below and share how you can contribute. Together, we can build it. The ultimate goal could be to implement a terminal voice AI assistant as a prototype.
Goal
A voice-based AI assistant (agents). It will possess voice understanding ability (ASR+) and Text-to-Speech (TTS) capabilities (currently mainly in Chinese, with stream mode). In addition to its perception (hearing and speaking), it can have the following abilities:
With these three main goals, I believe this will be an assistant that lives with you, understands you, and helps you with many daily tasks.
On the engineering side, two rules should be followed:
Checkpoints
Leave comments below let me saw your ideas.
Useful links
The text was updated successfully, but these errors were encountered: