Replies: 6 comments 13 replies
-
#13 Maybe you might have to train it from scratch. |
Beta Was this translation helpful? Give feedback.
-
Finally got it training, I think, the current version of VCTK (0.92) seems to be missing ~500 audio files which have been used in filelist |
Beta Was this translation helpful? Give feedback.
-
Only checked the tensorboard, but I think it's getting there: @p0p4k is it worth to go for a full train up to multiple 100k steps or rather wait for more stuff to be fixed/adjusted? |
Beta Was this translation helpful? Give feedback.
-
Example.zip This is at 76k steps now btw. |
Beta Was this translation helpful? Give feedback.
-
Unlike in the ljspeech discussion, I feel like my trained model if currently worse in terms of emphasis and pauses. |
Beta Was this translation helpful? Give feedback.
-
Checkpoints at 600k and 800kish: Examples with 3 speakers: Will train it to 1m and then likely stop as there doesn't seem to be much more improvement. |
Beta Was this translation helpful? Give feedback.
-
Has someone started the training of a vctk dataset yet?
Not sure how long training would take, but I have some spare GPU power every now and then (2x 3090), so I could train it on and off but that would take a while.
Beta Was this translation helpful? Give feedback.
All reactions