Implementation of Persian Isolated-Digits Recognition with Matlab
I recorded (0 to 9) digits two times (one for the test and the other one as a reference) in Persian (you can do that in your language!). Then extracted their features (5 different feature vectors, one for each question(i).m) and tried to recognize them with cepstral distance measure. (see image below)
Actually, cepstral distance is not a good measure for the speech recognition task because it ignores time warping. for more accuracy, you can use neural networks such as RNNs.
question1:
12 real cepstrum coefficients
question2:
12 MFCC (Mel-frequency cepstral coefficients)
question3:
12 MFCC + 1 Energy coefficient
question4:
12 MFCC + 1 Energy coefficient and their first derivatives
question5:
12 MFCC + 1 Energy coefficient and their first and second derivatives