A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model excels at editing emotion, speaking style, and paralinguistics, and features robust zero-shot text-to-speech
-
Updated
Nov 28, 2025 - Python
A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model excels at editing emotion, speaking style, and paralinguistics, and features robust zero-shot text-to-speech
Official source codes for the paper: EmoDubber: Towards High Quality and Emotion Controllable Movie Dubbing.
NL interface surpassing GPT limits: emotion control, parallel commands, quantified inputs like "remove 30% emotion".
Add a description, image, and links to the emotion-control topic page so that developers can more easily learn about it.
To associate your repository with the emotion-control topic, visit your repo's landing page and select "manage topics."