Replies: 3 comments 5 replies
-
Also, when synthesizing speech all I can hear is noise (similar to pretrained hifigan and fastpitch). `CUDA_VISIBLE_DEVICES=0 tts --use_cuda true --text "This was the time when I was young and happy and what not used to go here and there, then every where I see my eyes never despise the horizon." --config_path fast_pitch_vctk-November-28-2021_02+38PM-0000000/config.json --model_path fast_pitch_vctk-November-28-2021_02+38PM-0000000/best_model.pth.tar --vocoder_path ../hifigan/coqui_tts-November-28-2021_01+58PM-0000000/checkpoint_240000.pth.tar --vocoder_config ../hifigan/config.json --out_path vctk_fp_hfgan.wav --speaker_idx VCTK_p226
|
Beta Was this translation helpful? Give feedback.
-
Beta Was this translation helpful? Give feedback.
-
Apparently, I should have launched vocoder training through train_vocoder.py --config_path config.json instead of distributed training of train_hifigan.py. @erogol it's really not that apparent the different of two training options, considering the latter leads nowhere. |
Beta Was this translation helpful? Give feedback.
-
Describe the bug
Running for 240k steps no improvement is avg loss when training HiFiGan.
To Reproduce
Steps to reproduce the behavior:
--> STEP: 24/352 -- GLOBAL_STEP: 244300
| > G_l1_spec_loss: 0.36788 (0.35471)
| > G_gen_loss: 16.55468 (15.96176)
| > G_adv_loss: 0.00000 (0.00000)
| > loss_0: 16.55468 (15.96176)
| > grad_norm_0: 22.85464 (28.87626)
| > current_lr_0: 7.0524350586068e-111
| > current_lr_1: 0.00010
| > step_time: 0.29070 (0.29190)
| > loader_time: 0.00150 (0.00135)
3. Evaluation:
--> EVAL PERFORMANCE
| > avg_loader_time: 0.00034 (-0.00003)
| > avg_G_l1_spec_loss: 0.35621 (+0.00000)
| > avg_G_gen_loss: 16.02957 (+0.00000)
| > avg_G_adv_loss: 0.00000 (+0.00000)
| > avg_loss_0: 16.02957 (+0.00000)
Expected behavior
Improvement in loss during training.
Environment (please complete the following information):
OS Platform and Distribution (e.g., Linux Ubuntu 16.04): Ubuntu 20.04
PyTorch or TensorFlow version (use command below): pytorch 1.10.0
Python version: 3.8.11
CUDA/cuDNN version: py3.8_cuda11.3_cudnn8.2.0_0
GPU model and memory: 2xRTX 3090
Additional context
Add any other context about the problem here.
Script also generates config.json in the dir where train_hifigan.py resides as well as in the generated run dir.
Beta Was this translation helpful? Give feedback.
All reactions