使用Cosyvoice2进行情感语音合成 #921

zhanghanweii · 2025-01-22T09:08:31Z

在按照论文中的方式合成情感音频时无效，论文中的样例如下：

你能用高兴的情感说吗？< |endofprompt| >今天真是太开心了，马上要放假了！I’m so happy, Spring Festival is coming!

我的代码如下：
import sys
sys.path.append("third_party/Matcha-TTS")
from cosyvoice.cli.cosyvoice import CosyVoice, CosyVoice2
from cosyvoice.utils.file_utils import load_wav
import torchaudio

cosyvoice = CosyVoice2('./CosyVoice2-0.5B', load_jit=False, load_trt=False)

cosyvoice = CosyVoice('./pretrained_models/CosyVoice-300M-Instruct')

prompt_speech_16k = load_wav('xiaojin_5s.wav', 16000)

for i, j in enumerate(cosyvoice.inference_cross_lingual('你能用安慰的情感说吗？同时语速放缓。< |endofprompt| >哎呀，没有抢到回家的票真的会让人很失落呢
，但希望你打起精神，一定还会有其他回家的方法的。', prompt_speech_16k, stream=False)):
torchaudio.save('zero_shot_{}.wav'.format(i), j['tts_speech'], cosyvoice.sample_rate)

上述代码会把整段txt文本进行合成，而没有使用情感描述，请问代码有哪些问题呢

Aydous · 2025-01-23T13:40:22Z

cosyvoice.inference_instruct2('哎呀，没有抢到回家的票真的会让人很失落呢，但希望你打起精神，一定还会有其他回家的方法的。', '你能用安慰的情感说吗？同时语速放缓', prompt_speech_16k, stream=False)
试试这个

github-actions · 2025-02-23T02:04:26Z

This issue is stale because it has been open for 30 days with no activity.

github-actions · 2025-03-10T01:52:09Z

This issue was closed because it has been inactive for 14 days since being marked as stale.

github-actions bot added the stale label Feb 23, 2025

github-actions bot closed this as completed Mar 10, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

使用Cosyvoice2进行情感语音合成 #921

使用Cosyvoice2进行情感语音合成 #921

zhanghanweii commented Jan 22, 2025

Aydous commented Jan 23, 2025

github-actions bot commented Feb 23, 2025

github-actions bot commented Mar 10, 2025

使用Cosyvoice2进行情感语音合成 #921

使用Cosyvoice2进行情感语音合成 #921

Comments

zhanghanweii commented Jan 22, 2025

cosyvoice = CosyVoice('./pretrained_models/CosyVoice-300M-Instruct')

Aydous commented Jan 23, 2025

github-actions bot commented Feb 23, 2025

github-actions bot commented Mar 10, 2025