mixed languages sentence. #296
Replies: 2 comments 6 replies
-
Hi @shuanshuan Just so that we are on the same page and please correct me if I am wrong, but you want to be able to send over a single block of text to a TTS engine that is mixed languages and have it generate the speech as a 1x process. So, assuming that is the correct interpretation, the answer is typically a no.... BUT it may be possible with the XTTS model (I cant speak for other models atm). If you want to try this, you will need to be on AllTalk v2 and please understand this is my best guess after a quick little look at how the XTTS tokenizer/phonemizer is working. Please read all the below before you make any changes as it may not be exactly what you want. We would need to update the allowed API passthrough to allow square brackets So in
Once you have done that and saved the settings, reload the XTTS model. Once that is loaded, you can try something like:
and you should see the brackets/language shown As you will see, we are specifying the language in brackets e.g. To explain the tokenizers https://github.com/erew123/alltalk_tts/blob/main/system/ft_tokenizer/tokenizer.py or actually phonemization process, these are used by all TTS engines to break down specific languages and handle the language unique specific quirks. So many TTS engines use espeak-ng or gruut or something like that. When you tell it which language to use, it can then deal with the languages unique things like expanding abbreviations (Dr to Doctor), numbers (12 to twelve), symbols (% to percent), vocabulary mapping etc, basically things that are unique/special about that language. Hence, you always (currently) have to tell the TTS engine what language the text is in, so that it can correctly process the language and its unique characteristics, quirks and symbols then convert those to sound representations that match the language. So, in theory, the way I have shown above, to use the As far as I am aware, there is no TTS engine/model that will just identify a language without a character code being sent. So currently, you would always have to tag each section of language within the text. Hope that helps clarify. Thanks |
Beta Was this translation helpful? Give feedback.
-
[zh]我真的很喜欢[en]AllTalkTTS |
Beta Was this translation helpful? Give feedback.
-
Is there any way or model support mixed languages sentence。
It mainly speaks Chinese and pronounces English correctly.
Beta Was this translation helpful? Give feedback.
All reactions