Skip to content

Tool to transform data from Nemo/Deepspeech format to Kaldi as described here — https://kaldi-asr.org/doc/data_prep.html

License

Notifications You must be signed in to change notification settings

samespace/kaldi-data-preperation

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Kaldi Data Preparation

Tool to transform data from Nemo/Deepspeech format to Kaldi as described here — https://kaldi-asr.org/doc/data_prep.html

Requirements

python >= 3.5

Kenlm (for building LM)

Usage

git clone https://github.com/samespace/kaldi-data-preperation.git && cd kaldi-data-preperation
pip3 install -r requirements.txt

python3 data.py -h

Example

python3 data.py dicts/dict-en.ipa data.csv

License

Read LICENSE

This repo is licensed under GNU GPL V3 license

About

Tool to transform data from Nemo/Deepspeech format to Kaldi as described here — https://kaldi-asr.org/doc/data_prep.html

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages