-
/Netdata/2020/TTS/SpeakingScoringTTS
-
GAMMA TONE, LFCC, ...
-
Concatenate one-hot encoding to transformer input
-
Include one-hot encoding of the previous and the next phone in transformer input
-
Add duration (number of frames) to the input of transformer