DEX-TTS: Diffusion-based EXpressive Text-to-Speech with Style Modeling on Time Variability

This repository is the official implementation of DEX-TTS: Diffusion-based EXpressive Text-to-Speech with Style Modeling on Time Variability.

In this repository, we provide steps for running DEX-TTS and GeDEX-TTS.

🙏 We recommend you visit our demo site. 🙏

DEX-TTS is diffusion-based expressive TTS using reference speech. The overall architecture of DEX-TTS is as below:

GeDEX-TTS is the general version of DEX-TTS, which does not use reference speech. The overall architecture of GeDEX-TTS is as below:

Shortcuts

You can find codes, a demo site, and paper links below.

[👉 Demo] [📄 Paper] [💻 DEX-TTS Code] [💻 GeDEX-TTS Code]

ToDo

Citation

@article{park2024dex,
  title={DEX-TTS: Diffusion-based EXpressive Text-to-Speech with Style Modeling on Time Variability},
  author={Park, Hyun Joon and Kim, Jin Sob and Shin, Wooseok and Han, Sung Won},
  journal={arXiv preprint arXiv:2406.19135},
  year={2024}
}

License

This repository will be released under the MIT license.

Thanks to the open source codebases such as RetNet, FastSpeech2, Grad-TTS, DiT, MaskDiT, and EDM. This repository is built on them.

Name		Name	Last commit message	Last commit date
Latest commit History 65 Commits
DEX-TTS		DEX-TTS
GeDEX-TTS		GeDEX-TTS
images		images
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DEX-TTS: Diffusion-based EXpressive Text-to-Speech with Style Modeling on Time Variability

Shortcuts

ToDo

Citation

License

About

Releases 1

Packages

Languages

License

winddori2002/DEX-TTS

Folders and files

Latest commit

History

Repository files navigation

DEX-TTS: Diffusion-based EXpressive Text-to-Speech with Style Modeling on Time Variability

Shortcuts

ToDo

Citation

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages