F0-DCTTS (F0 Deep Convolutional TTS)

Description

DCTTS with F0

Prerequisite

python 3.7
pytorch 1.3
pysptk
librosa, scipy, tqdm, tensorboardX

Dataset

LJ Speech 1.1
KSS, Korean female single speaker speech dataset.

Samples

samples

Usage

Download the above dataset and modify the path in config.py. And then run the below command.
```
python prepro.py
```
The baseline DCTTS needs to train 100k+ steps
```
python train.py <gpu_id>
```
After training the baseline, you can train F0-DCTTS. Change "f0_mode=True" and "pretrained_path=..." in config.py. And then run the below command one more.
```
python train.py <gpu_id>
```
You can synthesize some speech with f0. You can control using "f0_factor=..." in config.py.
```
python synthesize.py <gpu_id>
```

Notes

This method is easy and simple, but verrrrrrrrrry naive approach.
In this code, I removed SSRN. Thus, you need another mel2wav vocoder. I recommend WaveGlow or Parallel WaveGAN.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
refs		refs
README.md		README.md
config.py		config.py
data.py		data.py
ko_sents.txt		ko_sents.txt
layers.py		layers.py
models.py		models.py
modules.py		modules.py
networks.py		networks.py
prepro.py		prepro.py
synthesize.py		synthesize.py
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

F0-DCTTS (F0 Deep Convolutional TTS)

Description

Prerequisite

Dataset

Samples

Usage

Notes

About

Releases

Packages

Languages

Yangyangii/F0-DCTTS

Folders and files

Latest commit

History

Repository files navigation

F0-DCTTS (F0 Deep Convolutional TTS)

Description

Prerequisite

Dataset

Samples

Usage

Notes

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages