- Feature Extraction:
- 39 MFCC features
- Acoustic Model:
- Lexicon/Pronunciation Model
- HMM: what phones can follow each other
- Language Model
- Decoder
- Viterbi Algorithm: dynamic programming for combining all these to get word squence from speech
To execute:
python main.py
To print results:
python summary.py
Objective: Find the most probable sequences of states that maximizes the posterior