Skip to content

Pull requests: NVIDIA-NeMo/RL

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Create performance-summary.md for NeMo RL documentation Improvements or additions to documentation
#1560 opened Nov 24, 2025 by snowmanwwg Loading…
test: LoRA support for DTensorV2 path for CI CI:L2 Run doctests, unit tests, functional tests, and convergence tests
#1559 opened Nov 24, 2025 by RayenTian Draft
4 tasks
feat: LoRA support for DTensorV2 path
#1556 opened Nov 21, 2025 by samodi-nv Draft
1 of 4 tasks
fix: remove sft-qwen2.5-fsdp2tp8sp from nighlies CI:L0 Run doctests and unit tests
#1555 opened Nov 20, 2025 by ahmadki Loading…
chore: Improve checkpoint loading error messages with common issue and a fix CI:L1 Run doctests, unit tests, and functional tests
#1554 opened Nov 20, 2025 by ahmadki Loading…
fix: add H200 TFLOPS CI:L0 Run doctests and unit tests community-request
#1543 opened Nov 19, 2025 by clumsy Loading…
4 tasks done
feat: per-worker active/idle timeline + IFB size logging CI:L1 Run doctests, unit tests, and functional tests enhancement New feature or request Performance Related to improving performance
#1534 opened Nov 18, 2025 by youngeunkwon0405 Loading…
4 tasks
feat: Support qwen3-next, mcore path
#1530 opened Nov 17, 2025 by ahmadki Loading…
1 task
feat: force on-policy ratio to 1
#1529 opened Nov 17, 2025 by yfw Draft
4 tasks
feat: RL sampler [WIP]
#1522 opened Nov 14, 2025 by pjin-nvidia Draft
4 tasks
feat: Add moe load balancing metrics
#1520 opened Nov 13, 2025 by yfw Draft
4 tasks
feat: Automodel init for DTensorPolicyV2 CI:L2 Run doctests, unit tests, functional tests, and convergence tests
#1509 opened Nov 12, 2025 by adil-a Loading…
refactor: refactor env and data processor & add nemotron super 49b recipes CI:L2 Run doctests, unit tests, functional tests, and convergence tests documentation Improvements or additions to documentation
#1506 opened Nov 11, 2025 by yuki-97 Loading…
build: Use dynamic engine for generate. CI:L1 Run doctests, unit tests, and functional tests
#1502 opened Nov 11, 2025 by shanmugamr1992 Loading…
4 tasks
feat: pipeline-rl style # of inflight prompt regulation CI:L1 Run doctests, unit tests, and functional tests documentation Improvements or additions to documentation
#1499 opened Nov 10, 2025 by youngeunkwon0405 Loading…
4 tasks
feat: allow uv-less execution and fingerprint the environment CI:L1 Run doctests, unit tests, and functional tests CI Relating to CI documentation Improvements or additions to documentation
#1491 opened Nov 9, 2025 by terrykong Loading…
fix: Megatron static inference and adapt to mcore engine API changes CI:L1 Run doctests, unit tests, and functional tests r0.4.0
#1488 opened Nov 7, 2025 by shanmugamr1992 Loading…
4 tasks
feat: Add AceMathRL recipe
#1484 opened Nov 6, 2025 by ffrujeri Draft
4 tasks
feat: fp16 for DTensor policies
#1474 opened Nov 5, 2025 by adil-a Loading…
Mmanohara/merge grpo helpsteer cp tp community-request
#1472 opened Nov 4, 2025 by nv-mmanohara Loading…
4 tasks
feat: DTensorPolicyV2 GPT-OSS support CI:L0 Run doctests and unit tests
#1470 opened Nov 4, 2025 by adil-a Loading…
ProTip! Updated in the last three days: updated:>2025-11-21.