Skip to content

Pull requests: NVIDIA/TransformerEngine

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[PyTorch] Add FA4 Support
#2432 opened Nov 28, 2025 by yaox12 Draft
1 of 16 tasks
[Pytorch][Bug]MXFP8 Split tensor Bug fix
#2427 opened Nov 26, 2025 by vthumbe1503 Loading…
2 of 13 tasks
[PyTorch] Convert sample tuple to list in cudagraph input reuse
#2426 opened Nov 26, 2025 by buptzyb Loading…
13 tasks
Fix FusedAdam DTensor compatibility issue
#2425 opened Nov 26, 2025 by shjwudp Loading…
13 tasks
[JAX] Wrapper for Permutation Triton kernel MoE
#2419 opened Nov 25, 2025 by tdophung Draft
9 of 16 tasks
[Common] Add kFloat64 partial support
#2417 opened Nov 24, 2025 by phu0ngng Loading…
7 of 13 tasks
[Common] Persistent NVFP4 cast + transpose kernel 2.11.0
#2412 opened Nov 21, 2025 by Oleg-Goncharov Loading…
6 of 13 tasks
[Common] NVTEGroupedTensor class and helpers MoE
#2388 opened Nov 14, 2025 by phu0ngng Loading…
7 of 13 tasks
Enables specified cp rank slicing
#2387 opened Nov 14, 2025 by jomitchellnv Loading…
1 of 13 tasks
[JAX] Re-use RHT matrix constant
#2386 opened Nov 14, 2025 by jberchtold-nvidia Draft
8 of 13 tasks
[Draft] TopK Fusion to JAX MoE
#2385 opened Nov 14, 2025 by mingxu1067 Loading…
5 of 13 tasks
Set RPATH for cuda libraries from python package
#2381 opened Nov 14, 2025 by take-cheeze Draft
4 of 13 tasks
[JAX] Add CP + THD + AG + Striped>1 + SWA support
#2379 opened Nov 13, 2025 by KshitijLakhani Loading…
8 of 13 tasks
[JAX] NVFP4 2D 1x1x for Weight
#2365 opened Nov 10, 2025 by phu0ngng Draft
13 tasks
[JAX] cuBlasMp integration for CollectiveGemm custom op
#2361 opened Nov 7, 2025 by denera Loading…
5 of 13 tasks
Add device-Initiated Grouped GEMM supporting m_splits on device MoE
#2360 opened Nov 7, 2025 by QiZhangNV Loading…
1 of 13 tasks
Add num_splits support for FA3 backend
#2357 opened Nov 6, 2025 by wdykas Loading…
13 tasks
More detailed documentation for recipes
#2343 opened Nov 4, 2025 by pggPL Draft
[Core] Fix inconsistent logic in C++ tensor class
#2330 opened Nov 1, 2025 by timmoon10 Loading…
7 of 13 tasks
[Common] Added an optimized gated rowwise MXFP8 SwiGLU kernel
#2328 opened Oct 31, 2025 by Oleg-Goncharov Loading…
5 of 13 tasks
[Common] Persistent MXFP8 kernel
#2323 opened Oct 30, 2025 by Oleg-Goncharov Draft
13 tasks
[JAX] Make test_layer.py tolerances stricter
#2306 opened Oct 27, 2025 by jberchtold-nvidia Loading…
8 of 13 tasks
ProTip! Updated in the last three days: updated:>2025-11-25.