Skip to content
View codezjx's full-sized avatar
😱
Overtime
😱
Overtime

Block or report codezjx

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

asr

11 repositories

Speech-to-text, text-to-speech, speaker diarization, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, HarmonyOS, Raspberry Pi, RISC…

C++ 4,477 511 Updated Jan 29, 2025

Whisper realtime streaming for long speech-to-text transcription and translation

Python 2,380 299 Updated Jan 7, 2025

Robust Speech Recognition via Large-Scale Weak Supervision

Python 75,343 9,008 Updated Jan 4, 2025

Port of OpenAI's Whisper model in C/C++

C++ 37,247 3,847 Updated Jan 21, 2025

Open source real-time translation app for Android that runs locally

C++ 7,232 558 Updated Jan 12, 2025

Efficient Inference of Transformer models

C++ 415 43 Updated Aug 7, 2024

Foundational Models for State-of-the-Art Speech and Text Translation

Jupyter Notebook 11,256 1,105 Updated Nov 14, 2024

kaldi-asr/kaldi is the official location of the Kaldi project.

Shell 14,489 5,332 Updated Jan 28, 2025

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 10,062 972 Updated Jan 26, 2025

Multilingual Voice Understanding Model

Python 4,205 371 Updated Jan 8, 2025

SOTA Open Source TTS

Python 18,729 1,416 Updated Jan 26, 2025