The digits recordings data has been taken from here.
- Download Kaldi using
git clone https://github.com/kaldi-asr/kaldi.git kaldi-trunk --origin golden
- Go to
kaldi-trunk/tools/extra/
and run the script./check_dependencies.sh
, follow the instructions and re-run it again till it outputs :
./check_dependencies.sh: all OK.
- Install Kaldi using by doing the following:
In
kaldi-trunk/tools/
do :-j 8 will run 8 jobs in parallel because it may take a while, you can change it to the number of processors you have. usemake -j 8; cd ../src; ./configure; make -j 8
nproc
to check how many logical processors your device have - Go to
kaldi-trunk/tools/
and install SRILM using the script./install_srilm.sh
Note that you may need to go to SRILM website and download the SRILM and put it in the directorykaldi-trunk/tools/
and rename it tosrilm.tgz
before running the script - Go to
kaldi-trunk/egs/
and clone this repo usinggit clone https://github.com/Hamahmi/kaldi-tut.git
- Source
tools/env.sh
inpath.sh
by adding
# Sourcing tools/env.sh ~H
. ../../tools/env.sh
in path.sh
- Change the paths in the files to be correct instead of home/hamahmi/kaldi-trunk/ use your own path
- Run the scripts
script_for_utt2spk.sh
,script_for_wav.scp.sh
, andscript_for_text.sh
in bothkaldi-trunk/egs/kaldi-tut/digits_audio/test/
andkaldi-trunk/egs/kaldi-tut/digits_audio/train
to generate the respective files. Completing these steps ensures that the data has been properly set up. - Run the script
./run.sh
:D
For further steps, kaldi-for-dummies can be followed from here.