Replies: 7 comments 1 reply
-
Dear @synesthesiam can you please try to help me with this? |
Beta Was this translation helpful? Give feedback.
-
Looking more into the piper-phonemize I can see all the languages except of ukrainian are trained with espeak-ng as the phonemizer. |
Beta Was this translation helpful? Give feedback.
-
Okay, I think I have finally figured out why I am getting different results than you with the Lili slovak voice. Greetings Peter |
Beta Was this translation helpful? Give feedback.
-
Wow, Peter, Here was my original issue that I hoped someone would reply to earlier this year: #536 Thanks, |
Beta Was this translation helpful? Give feedback.
-
Hello @isolveit-aps Once I'll manage to get it polished I'll consider creating relevant pull requests. Thanks and greetings Peter |
Beta Was this translation helpful? Give feedback.
-
Hi Peter @pvagner, Anyways, I will try with a dataset that has more numbers coverage, so that the model will train those spoken numbers, hopefully learning these by triggering the phonemization rules for Faroese numbers. I will also try copying the fo_dic file to my espeak-phonemizer, to see if that helps me with this problem, also. Thanks again for your response - I will keep on trying :) |
Beta Was this translation helpful? Give feedback.
-
Dear All I don't know if this is useful or not, but when I started searching for TTS systems, I saw a very few metadata.csv files that had three columns. One was the names of the .wav file, the next was the text; just like normal. The third column was expanded, with numbers expanded into their word forms. I have tried to find this again and not got there yet. |
Beta Was this translation helpful? Give feedback.
-
Hello,
I am working on a slovak voice together with a friend. He has recorded some 3630 high quality samples of his voice.
The training is running and now after some 1000 steps the voice sounds verry promising however it does not correctly pronounce some numbers such as
4, 40, 41, 45, 5, 50 and so on.
When writing these numbers as words, it's being pronounced correctly.
I guess some of the eSpeak phonemes are not taken into account.
When preprocessing it looks like this.
Looking to the espeak dictsource/sk_list I can see entries such as:
When comparing these with numbers written as words, I would change these to:
I can then compile espeak sk and I will get an sk_dic file.
How do I update the piper_phonemize with these tweaks?
The existing slovak voice named lili does not exhibit this behaviour. Is this the proper way on how to address such an issue?
Also when I determine the correct solution to my issue, can I continue training with the fixed data or do I have to start from scratch?
Please note there are more occurences of these phonemes I have just posted these sk_list entries as an example.
Greetings
Peter
Beta Was this translation helpful? Give feedback.
All reactions