We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
训练数据集720条,测试集80条,共2.5h左右,训练参数使用的是默认参数,训练了30轮看曲线,发现过拟合很严重
可否提供一些训练建议,另外选取了epoch1和epoch22的参数进行生成,用zeroshot方法可以正常生成(且音色相似度也会提高),用sft方法则会出现长间断的空白,声音也很奇怪
The text was updated successfully, but these errors were encountered:
sft是指针对单个说话人训练,需要准备一个人音频1h以上
Sorry, something went wrong.
2.5h左右音频是单个人的,来源是B站易中天演讲,人耳听音频质量OK
No branches or pull requests
训练数据集720条,测试集80条,共2.5h左右,训练参数使用的是默认参数,训练了30轮看曲线,发现过拟合很严重

可否提供一些训练建议,另外选取了epoch1和epoch22的参数进行生成,用zeroshot方法可以正常生成(且音色相似度也会提高),用sft方法则会出现长间断的空白,声音也很奇怪
The text was updated successfully, but these errors were encountered: