-
Notifications
You must be signed in to change notification settings - Fork 3.2k
Pull requests: verl-project/verl
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
build(deps): update transformers requirement from <5.0.0 to <6.0.0
dependencies
Pull requests that update a dependency file
python
Pull requests that update python code
#5293
opened Feb 11, 2026 by
dependabot
bot
Loading…
build(deps): bump sglang[all] from 0.5.2 to 0.5.8.post1
dependencies
Pull requests that update a dependency file
python
Pull requests that update python code
#5292
opened Feb 11, 2026 by
dependabot
bot
Loading…
[misc,trainer,rollout] feat: add Prometheus metrics logging to experiment tracking
#5291
opened Feb 11, 2026 by
guillemgt
Loading…
7 of 8 tasks
Update installation instruction to the actual repo link
#5290
opened Feb 11, 2026 by
Godofnothing
Loading…
[reward] fix: reward model args and reward_kwargs bug
#5289
opened Feb 11, 2026 by
yyDing1
Loading…
8 tasks
[doc] fix: update recipe link to fix 404 not found
#5286
opened Feb 11, 2026 by
tardis-key
Loading…
3 of 8 tasks
[doc] chore: gspo update config and add version with npu
#5279
opened Feb 11, 2026 by
chengminhua
Loading…
1 of 8 tasks
[vllm] feat: add model saving func to dump vllm state_dict before and after update weights
#5278
opened Feb 11, 2026 by
RobotGF
Loading…
8 tasks
[rollout] fix: prompt2text decoding for
SingleTurnAgentLoop
#5277
opened Feb 11, 2026 by
jnash10
Loading…
[veomni] feat: Add GRPO training scripts for Qwen3-VL-30B-MOE (VeOmni Backends)
#5275
opened Feb 11, 2026 by
phdddd
Loading…
8 tasks
[rollout] fix: use the same request_ids in each rollout turn for better tracking in rollout backend
#5271
opened Feb 10, 2026 by
PeterSH6
Loading…
8 tasks
[fsdp, vllm] feat: add NPU GRPO training scripts for Qwen3-VL-30B (FSDP/VLLM backends)
#5260
opened Feb 10, 2026 by
alwaysyiyu
Loading…
8 tasks
[algo] feat: add NPU SAPO training script for Qwen3-8B (FSDP/vLLM backends)
#5257
opened Feb 10, 2026 by
Vvictorrrr
Loading…
2 of 7 tasks
[fsdp, vllm] feat: add NPU GRPO training scripts for Qwen3-VL-8B (FSDP/VLLM backends)
#5250
opened Feb 9, 2026 by
zhihaofang1017
Loading…
8 tasks
[fsdp] feat: upcast MoE routing to FP32 for better accuracy
#5249
opened Feb 9, 2026 by
Shangwei-Li
Loading…
3 of 8 tasks
[megatron] feat: add script for qwen3_235b_grpo training on npu platform
#5242
opened Feb 9, 2026 by
wangshuyang31
•
Draft
8 tasks
[rollout,vllm] Fix DP args and local_rank for Ray NOSET_VISIBLE_DEVICES
#5233
opened Feb 7, 2026 by
JohnConnor123
Loading…
2 of 3 tasks
fix(agent_loop): handle batch size smaller than num_workers
#5231
opened Feb 7, 2026 by
aoshen524
Loading…
3 tasks
feat(vision): add Vision DP for parallel ViT computation across SP ranks
#5230
opened Feb 7, 2026 by
aoshen524
Loading…
3 of 4 tasks
[trainer] feat: add per-round logprob mismatch metrics for multi-turn training
#5229
opened Feb 7, 2026 by
aoshen524
Loading…
5 of 7 tasks
Previous Next
ProTip!
Updated in the last three days: updated:>2026-02-08.