Skip to content

Issues: modelscope/ms-swift

GRPO (R1) 训练交流群
#3076 opened Feb 12, 2025 by Jintao-Huang
Open 3
Megatron-SWIFT训练交流群
#3604 opened Mar 21, 2025 by Jintao-Huang
Open
ms-swift3 Suggestion Box
#2217 opened Oct 10, 2024 by Jintao-Huang
Open 39
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

Qwen2.5-Omni-7B 部署api推理报错
#3724 opened Mar 31, 2025 by a67793581
请问有支持vllm v1引擎的计划吗
#3720 opened Mar 30, 2025 by 1212wuhu
GRPO max_grad_norm seems don't work
#3713 opened Mar 28, 2025 by sqc2290318555
loss_scale 疑问
#3711 opened Mar 28, 2025 by Hello-Worldd
Deepspeed Zero++ 会出现Nan
#3697 opened Mar 27, 2025 by MuyeHuang
multi-node grpo training hangs
#3695 opened Mar 27, 2025 by phoenixbai
Cache Inference Optimization
#3689 opened Mar 27, 2025 by Eduiskss
ProTip! Exclude everything labeled bug with -label:bug.