How can I train a model with pure speech tokens and text tokens? #920

PoTaTo-Mika · 2025-01-22T07:20:15Z

The tool "make_parquet_list" is quite useful, but I wonder if I can just use the speech tokens and the text tokens to train the base model Qwen2.5-0.5B from scratch to make it able to generate semantic tokens.

aluminumbox · 2025-01-23T16:43:58Z

yes we will update cosyvoice2 llm training in the future

github-actions · 2025-02-23T02:04:27Z

This issue is stale because it has been open for 30 days with no activity.

github-actions · 2025-03-10T01:52:10Z

This issue was closed because it has been inactive for 14 days since being marked as stale.

github-actions bot added the stale label Feb 23, 2025

github-actions bot closed this as completed Mar 10, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How can I train a model with pure speech tokens and text tokens? #920

How can I train a model with pure speech tokens and text tokens? #920

PoTaTo-Mika commented Jan 22, 2025

aluminumbox commented Jan 23, 2025

github-actions bot commented Feb 23, 2025

github-actions bot commented Mar 10, 2025

How can I train a model with pure speech tokens and text tokens? #920

How can I train a model with pure speech tokens and text tokens? #920

Comments

PoTaTo-Mika commented Jan 22, 2025

aluminumbox commented Jan 23, 2025

github-actions bot commented Feb 23, 2025

github-actions bot commented Mar 10, 2025