speech-commands

Here are 20 public repositories matching this topic...

YuanGongND / ast

Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".

audio deep-learning pytorch representation-learning audio-classification keyword-spotting speech-commands speech-classification

Updated May 21, 2023
Jupyter Notebook

Audio-WestlakeU / audiossl

Star

A library built for easier audio self-supervised training, downstream tasks evaluation

pytorch audio-classification audioset nsynth speech-commands audio-datasets self-supervised-learning voxceleb1 urbansound8k pytorch-lightning audio-representation audio-self-supervised-learning audio-pretraining

Updated Sep 25, 2025
Python

dobby-seo / Wav2Keyword

Star

Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.

transfer-learning keyword-spotting fine-tuning state-of-the-art kws speech-commands

Updated Jan 11, 2023
Python

nyumaya / nyumaya_audio_recognition

Star

Classify audio with neural nets on embedded systems like the Raspberry Pi

raspberry-pi machine-learning embedded-systems hotword-detection keyword-spotting audio-recognition wake-word-detection speech-commands hotword

Updated Apr 10, 2024
Python

philsyn / DiffWave-unconditional

Star

Pytorch Reimplementation of DiffWave unconditional generation: a high quality waveform synthesizer.

waveform speech pytorch speech-synthesis waveform-generation speech-commands waveform-generator diffwave

Updated Apr 13, 2021
Python

ace19-dev / tensorflow-speech-recognition-challenge

Star

Kaggle Competitions: TensorFlow Speech Recognition Challenge

audio tensorflow kaggle-competition speech-recognition speech-commands

Updated Mar 4, 2018
Python

htqin / BiFSMN

Star

Pytorch implementation of BiFSMN, IJCAI 2022

keyword-spotting binary-neural-networks speech-commands

Updated Feb 10, 2023
Python

isadrtdinov / kws-attention

Star

Attention-based model for keywords spotting

deep-learning pytorch attention-mechanism keyword-spotting speech-commands

Updated Aug 9, 2021
Python

shitian-ni / speech-recognition-transfer-learning

Star

Speech command recognition DenseNet transfer learning from UrbanSound8k in keras tensorflow

tensorflow keras kaggle speech-recognition densenet transfer-learning dilatednet speech-commands

Updated Jan 19, 2018
Python

usc-sail / gen-dmcca

Star

Generalized Deep Multiset Canonical Correlation Analysis for Multiview Learning of Speech Representations

multiview-learning speech-commands speech-command-recognition deep-multiset-cca speech-embeddings

Updated Apr 9, 2019
Python

danieleninni / small-footprint-keyword-spotting

Star

Effective processing pipeline and advanced neural network architectures for small-footprint keyword spotting

data-science machine-learning deep-learning cnn speech-recognition rnn resnet attention-mechanism audio-classification keyword-spotting conformer speech-commands

Updated Mar 2, 2023
Python

manojsvgit / Voice_Based_Email_For_Blind

Star

A Python-based application designed specifically for visually impaired users, enabling them to seamlessly send and receive emails using intuitive speech commands. This innovative solution enhances accessibility and independence by allowing users to manage their email communication effortlessly, utilizing voice recognition technology to ensure a us.

machine-learning natural-language-processing accessibility voice-recognition speech-to-text user-experience assistive-technology email-client command-line-interface python-development email-automation speech-commands voice-user-interface python-libraries project-for-visually-impaired

Updated Nov 24, 2024
Python

tuanio / audio-classification

Star

Audio Classification with AlexNet and Speech Commands dataset

pytorch speech-recognition alexnet audio-classification speech-commands pytorch-lightning

Updated May 5, 2022
Python

mryndzionek / kws_cli

Star

Small footprint, standalone, zero dependency, offline keyword spotting (KWS) CLI tool.

cli lightweight machine-learning voice-commands pytorch speech-recognition machinelearning hotword-detection keyword-spotting c-language wake-word-detection onnx kws speech-commands hotword-detector word-spotting tinyml wake-word edgeml

Updated Aug 4, 2024
C

epfluegel / TalkMaths

Star

A Vocola 2 (DNS) extension for creating and editing mathematics (in LaTeX) by voice, using a ZOO interface (Zoomable Online Outliner) such as WorkFlowy or Dynalist.

latex voice-commands speech-recognition workflowy dynalist speech-commands spoken-digits vocola spoken-maths