audio-speech-recognition

Here are 4 public repositories matching this topic...

跨平台、多任务、高度自定义的骰系开发框架。

nlp dice text-to-speech framework ai cross-platform model artificial-intelligence tts webui dice-roller roll ner asr re dice-roller-library nature-language-processing hydroroll audio-speech-recognition

Voice assistant using Multimodal LLMs - LLaVA-NeXT (Mistral 7B) finetuned & PhoWhisper

Analyze an audio file and count words, sentences and timestamps, filler words

A machine learning solution for classifying emotions in speech audio using hybrid deep learning (CNN-LSTM) and gradient boosting (XGBoost).

Add a description, image, and links to the audio-speech-recognition topic page so that developers can more easily learn about it.

To associate your repository with the audio-speech-recognition topic, visit your repo's landing page and select "manage topics."