Tool to transform data from Nemo/Deepspeech format to Kaldi as described here — https://kaldi-asr.org/doc/data_prep.html
python >= 3.5
Kenlm (for building LM)
git clone https://github.com/samespace/kaldi-data-preperation.git && cd kaldi-data-preperation
pip3 install -r requirements.txt
python3 data.py -h
python3 data.py dicts/dict-en.ipa data.csv
Read LICENSE
This repo is licensed under GNU GPL V3 license