跨平台、多任务、高度自定义的骰系开发框架。
-
Updated
Nov 21, 2025 - Python
跨平台、多任务、高度自定义的骰系开发框架。
Voice assistant using Multimodal LLMs - LLaVA-NeXT (Mistral 7B) finetuned & PhoWhisper
Analyze an audio file and count words, sentences and timestamps, filler words
A machine learning solution for classifying emotions in speech audio using hybrid deep learning (CNN-LSTM) and gradient boosting (XGBoost).
Add a description, image, and links to the audio-speech-recognition topic page so that developers can more easily learn about it.
To associate your repository with the audio-speech-recognition topic, visit your repo's landing page and select "manage topics."