Speech Recognition for Uyghur using deep learning

Training:

this model using CTC loss for training.

unzip results.7z and thuyg20_data.7z to the same folder where python source files located. then run:

python train.py

Recognition:

for recognition download only pretrained model(results.7z). then run:

python tonu.py test1.wav

result will be:

        Model loaded: results/UModel_last.pth
            Best CER: 7.21%
             Trained: 473 epochs
The model has 26,389,282 trainable parameters

======================
Recognizing file .\test2.wav
test2.wav -> bu öy eslide xotunining xush tebessumi oghlining omaq külküsi bilen güzel idi

This project using

A free Uyghur speech database Released by CSLT@Tsinghua University & Xinjiang University

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
README.md		README.md
UModel.py		UModel.py
cafe.wav		cafe.wav
data.py		data.py
perlin.wav		perlin.wav
radionoise.wav		radionoise.wav
silence.wav		silence.wav
test1.wav		test1.wav
test2.wav		test2.wav
test3.wav		test3.wav
test4.wav		test4.wav
test5.wav		test5.wav
test6.wav		test6.wav
thuyg20_test.csv		thuyg20_test.csv
thuyg20_train.csv		thuyg20_train.csv
tonu.py		tonu.py
train.py		train.py
uyghur.py		uyghur.py
white.wav		white.wav

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Speech Recognition for Uyghur using deep learning

About

Releases 1

Packages

Languages

gheyret/uyghur-asr-ctc

Folders and files

Latest commit

History

Repository files navigation

Speech Recognition for Uyghur using deep learning

About

Topics

Resources

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages