Aligning latent space of speaking style with human perception using a re-embedding strategy
pytorch speech-synthesis vocoder fastspeech2 pytorch-distributeddataparallel hifi-gan speaking-style blizzard-challenge
-
Updated
Jul 21, 2023 - Jupyter Notebook