Representation Mixing

This repo has code and pretrained models in support of the paper Representation Mixing for TTS Synthesis

Try the demo! https://colab.research.google.com/github/kastnerkyle/representation_mixing/blob/master/pretrained/representation_mixing_text_to_speech_demo.ipynb

Samples site: https://s3.amazonaws.com/representation-mixing-site/index.html

Abstract

Recent character and phoneme-based parametric TTS systems using deep learning have shown strong performance in natural speech generation. However, the choice between character or phoneme input can create serious limitations for practical deployment, as direct control of pronunciation is crucial in certain cases. We demonstrate a simple method for combining multiple types of linguistic information in a single encoder, named representation mixing, enabling flexible choice between character, phoneme, or mixed representations during inference. Experiments and user studies on a public audiobook corpus show the efficacy of our approach.

(Taken from the paper)

Architecture Diagram

More Info

pretrained/ contains some information and code for pretrained models, as well as a colab notebook for sampling from the pretrained model

code/ (will) contain a NON-RUNNABLE code dump of my research library used for training the model. This is only for very, very interested people and for seeing the model definition in code. If you just want sound, use the colab.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
code		code
figures		figures
pretrained		pretrained
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Representation Mixing

Abstract

Architecture Diagram

More Info

About

Releases

Packages

Languages

License

kastnerkyle/representation_mixing

Folders and files

Latest commit

History

Repository files navigation

Representation Mixing

Abstract

Architecture Diagram

More Info

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages