Skip to content

Pull requests: THUDM/slime

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

fix(gemma3): use GeGLU activation instead of SwiGLU
#1825 opened Apr 10, 2026 by leofan-lab Loading…
feat: add GLM-5 SFT loss mask support
#1824 opened Apr 10, 2026 by stevewx Contributor Loading…
4 tasks done
fix address already in use
#1819 opened Apr 8, 2026 by xutianming Contributor Loading…
Add Qwen3.5 VLM CI run-ci-changed
#1814 opened Apr 7, 2026 by zhuzilin Contributor Loading…
feat: delta compression for weight sync
#1806 opened Apr 5, 2026 by nanjiangwill Collaborator Draft
feat: add checkpoint retention limit to automatically clean up old checkpoints
#1798 opened Apr 2, 2026 by stevewx Contributor Loading…
4 tasks done
Add rollout sampling-mask support run-ci-short
#1795 opened Apr 2, 2026 by yitianlian Collaborator Loading…
Hook proposal
#1774 opened Mar 27, 2026 by andrija-s Draft
[kimi25 rl part4] Support K25 HF weight conversion between BF16\FP8\INT4
#1757 opened Mar 23, 2026 by Gao016 Contributor Loading…
[kimi25 rl part2] pass megatron bridge provider args from slime config
#1754 opened Mar 23, 2026 by GeLee-Q Contributor Loading…
[docker] fix qwen3_vl visual module loading
#1727 opened Mar 15, 2026 by ZHZisZZ Loading…
Add Mooncake Backend for Rollout Data Transfer run-ci-megatron
#1709 opened Mar 11, 2026 by zxpdemonio Loading…
6 tasks done
PipelineRL -- keep cache on weight update
#1694 opened Mar 9, 2026 by hari-hm Contributor Loading…
fix: normalize rewards per-group when sample counts are unequal
#1655 opened Mar 2, 2026 by dubin555 Loading…
2 of 3 tasks
feat: Add knowledge distillation example with offline support
#1654 opened Mar 2, 2026 by tourzhao Loading…
3 tasks
[WIP] fix transforrmers api change at 5.2.0 run-ci-megatron
#1647 opened Feb 28, 2026 by UbeCc Member Loading…
Refactor code safety checks by removing patterns
#1643 opened Feb 28, 2026 by Rohan5commit Loading…
ProTip! Type g i on any issue or pull request to go back to the issue listing page.