Skip to content

Pull requests: pytorch/torchtitan

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[DRAFT] Consolidate simple_fsdp and compiler_toolkit experiments ciflow/8gpu CLA Signed This label is managed by the Meta Open Source bot.
#2360 opened Feb 10, 2026 by yiming0416 Draft
[Bugfix] Fix simple_rl_multiprocess.py to be runnable with recent vLLM version ciflow/8gpu CLA Signed This label is managed by the Meta Open Source bot.
#2359 opened Feb 10, 2026 by Lucaskabela Loading…
[Bugfix] Fix bitwise determinism after vLLM SiluAndMul change ciflow/8gpu CLA Signed This label is managed by the Meta Open Source bot.
#2358 opened Feb 9, 2026 by Lucaskabela Loading…
[SAC] Refactor activation checkpointing to use centralized policy-based approach ciflow/8gpu CLA Signed This label is managed by the Meta Open Source bot.
#2357 opened Feb 9, 2026 by mori360 Draft
[RFC][DONT LAND] Support different state_dict for save and load ciflow/8gpu CLA Signed This label is managed by the Meta Open Source bot.
#2351 opened Feb 9, 2026 by fegin Draft
[ci] Add DSv3 SimpleFSDP auto_bucketing to h100 ci jobs ciflow/8gpu CLA Signed This label is managed by the Meta Open Source bot.
#2347 opened Feb 9, 2026 by IvanKobzarev Loading…
[simple_fsdp] Use schedule_overlap_bucketing_from_inductor_configs for overlap_passes ciflow/8gpu CLA Signed This label is managed by the Meta Open Source bot.
#2346 opened Feb 9, 2026 by IvanKobzarev Loading…
[DRAFT] Optimize MoE Routing via torch.sort Indices DType Injection CLA Signed This label is managed by the Meta Open Source bot.
#2343 opened Feb 8, 2026 by voidbag Draft
2 of 4 tasks
[dsv3] per-layer error when compile with MoE "HOP: Unsafe side effect" ciflow/8gpu CLA Signed This label is managed by the Meta Open Source bot.
#2341 opened Feb 7, 2026 by weifengpy Loading…
Add run-to-run determinism testing to H100 CI ciflow/8gpu CLA Signed This label is managed by the Meta Open Source bot.
#2339 opened Feb 6, 2026 by xmfan Loading…
random_experiment ciflow/8gpu CLA Signed This label is managed by the Meta Open Source bot.
#2335 opened Feb 6, 2026 by anshul-si Loading…
[torchcomms] Simplify ParallelDims to use base class inheritance and mesh views ciflow/8gpu CLA Signed This label is managed by the Meta Open Source bot.
#2334 opened Feb 6, 2026 by mori360 Loading…
Torchtitan changes to integrate into Verl ciflow/8gpu CLA Signed This label is managed by the Meta Open Source bot.
#2333 opened Feb 5, 2026 by acisseJZhong Loading…
Implement sharding and device mesh debug tool ciflow/8gpu CLA Signed This label is managed by the Meta Open Source bot.
#2328 opened Feb 5, 2026 by fegin Loading…
Disable DDP averaging to avoid repeated gradient averaging bug Something isn't working CLA Signed This label is managed by the Meta Open Source bot. fb-exported meta-exported
#2323 opened Feb 4, 2026 by Shagun-G Loading…
Apply #1895 only when really necessary CLA Signed This label is managed by the Meta Open Source bot.
#2322 opened Feb 4, 2026 by ericschreiber Loading…
Fixed autoparallel integration tests on ROCm. CLA Signed This label is managed by the Meta Open Source bot. module: rocm
#2321 opened Feb 4, 2026 by wenchenvincent Loading…
[No Merge] Debug autoparallel test ci CLA Signed This label is managed by the Meta Open Source bot.
#2317 opened Feb 3, 2026 by wenchenvincent Draft
Register _ScaledPartial placement CLA Signed This label is managed by the Meta Open Source bot.
#2313 opened Feb 2, 2026 by Aidyn-A Loading…
separate out training for fault tolerance ciflow/8gpu CLA Signed This label is managed by the Meta Open Source bot.
#2311 opened Feb 2, 2026 by tushar00jain Loading…
Add Transformer-Engine Fused_Adam Optimizer Support CLA Signed This label is managed by the Meta Open Source bot.
#2293 opened Jan 29, 2026 by vivekgoe Draft
[draft][lora] Apply LoraLinear as a wrapper of Linear ciflow/8gpu CLA Signed This label is managed by the Meta Open Source bot.
#2288 opened Jan 28, 2026 by mori360 Draft
[FSDP2] enable per-param mesh FSDP2 for MoE ciflow/8gpu CLA Signed This label is managed by the Meta Open Source bot.
#2281 opened Jan 28, 2026 by weifengpy Draft
[DeepEP Integration] Free cache after combine for forward only path. CLA Signed This label is managed by the Meta Open Source bot.
#2274 opened Jan 25, 2026 by elfiegg Draft
Enable graph_pp for autoparallel in torchtitan ciflow/8gpu CLA Signed This label is managed by the Meta Open Source bot.
#2271 opened Jan 23, 2026 by sanketpurandare Draft
ProTip! Exclude everything labeled bug with -label:bug.