Skip to content

[kimi25 rl part1.1] add weight conversion for kimi25 (weight update for train-infer disaggregation)#1532

Open
GeLee-Q wants to merge 6 commits intoTHUDM:mainfrom
GeLee-Q:kimi_vl_convert_weight
Open

[kimi25 rl part1.1] add weight conversion for kimi25 (weight update for train-infer disaggregation)#1532
GeLee-Q wants to merge 6 commits intoTHUDM:mainfrom
GeLee-Q:kimi_vl_convert_weight

Conversation

@GeLee-Q
Copy link
Copy Markdown
Contributor

@GeLee-Q GeLee-Q commented Feb 2, 2026

Thanks to the SGLang RL community members @yefei12 @Chokoyo @gxlvera for their help.

Thanks to the NVIDIA team @wplf for their guidance on Megatron Bridge.

Thanks to our colleagues on the AQ Infra team @Gao016 @sxl1993 and Algorithms team @Swayyyyy @yzlnew for their support.

[kimi25 rl part1.1] Add weight conversion for kimi25 (weight update for train-infer disaggregation)
#1532

[kimi25 rl part1.2] support kimi25 q-lora pairing in bridge update path (weight update for train-infer colocate)
#1753

[kimi25 rl part2] Pass Megatron Bridge provider arguments from the slime config
#1754

[kimi25 rl part3] Support the K25 VL rollout processor and train-time token expansion
#1755

[kimi25 rl part4] Support K25 HF weight conversion between BF16\FP8\INT4
#1757

[Megatron Bridge]
https://github.com/fzyzcjy/Megatron-Bridge/pull/7/commits

All of the code still requires further large-scale validation. The experimental results will be made public after validation is complete.

@GeLee-Q GeLee-Q closed this Feb 2, 2026
@GeLee-Q GeLee-Q reopened this Feb 2, 2026
@GeLee-Q GeLee-Q marked this pull request as draft February 2, 2026 08:36
@GeLee-Q GeLee-Q changed the title add weight convert for kimi_vl [kimi25 rl part1]: add weight convert for kimi25 (train-infer different gpus update weights) Mar 23, 2026
@GeLee-Q GeLee-Q changed the title [kimi25 rl part1]: add weight convert for kimi25 (train-infer different gpus update weights) [kimi25 rl part1] add weight conversion for kimi25 (weight update for train-infer disaggregation) Mar 23, 2026
@GeLee-Q GeLee-Q marked this pull request as ready for review March 23, 2026 07:00
Co-authored-by: Chokoyo <zcgu@qq.com>
Co-authored-by: sxl1993 <1218197792@qq.com>
Co-authored-by: Gao016 <yngao016@163.com>
Co-authored-by: yefei12 <xjtu_yefeichen@163.com>
@GeLee-Q GeLee-Q force-pushed the kimi_vl_convert_weight branch from 77c1f72 to 0cc8e37 Compare March 23, 2026 07:32
@GeLee-Q GeLee-Q changed the title [kimi25 rl part1] add weight conversion for kimi25 (weight update for train-infer disaggregation) [kimi25 rl part1.1] add weight conversion for kimi25 (weight update for train-infer disaggregation) Mar 23, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant