-
Notifications
You must be signed in to change notification settings - Fork 540
Pull requests: NVIDIA/TransformerEngine
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add device-Initiated Grouped GEMM supporting m_splits on device
#2360
opened Nov 7, 2025 by
QiZhangNV
Loading…
1 of 13 tasks
[JAX] Make all jax attention calls use non-packed common calls
#2358
opened Nov 6, 2025 by
pggPL
Loading…
8 of 13 tasks
[JAX] Support for checkpointing quantizations
#2356
opened Nov 6, 2025 by
jberchtold-nvidia
Loading…
8 of 13 tasks
[PyTorch] Fix amax computation using output_t data in normalization
#2355
opened Nov 6, 2025 by
negvet
Loading…
1 of 13 tasks
[PyTorch][NVFP4][MOE] NVFP4 Grouped Hadamard Amax Kernel
#2351
opened Nov 6, 2025 by
zhongbozhu
Loading…
4 of 17 tasks
[JAX] Fused layers argument default values changed
#2347
opened Nov 5, 2025 by
tdophung
Loading…
6 of 13 tasks
[Core] Fix inconsistent logic in C++ tensor class
#2330
opened Nov 1, 2025 by
timmoon10
Loading…
7 of 13 tasks
[Common] Added an optimized gated rowwise MXFP8 SwiGLU kernel
#2328
opened Oct 31, 2025 by
Oleg-Goncharov
Loading…
5 of 13 tasks
[Pytorch] change fused cross entropy backward grad to fp32 and reduce one read/…
#2325
opened Oct 31, 2025 by
RandMist
Loading…
8 of 13 tasks
[PyTorch] Implement Selective Activation Checkpointing for LayerNormMLP with checkpoint flag
#2311
opened Oct 28, 2025 by
jaimec00
Loading…
7 of 13 tasks
[JAX] Make test tolerances stricter
#2306
opened Oct 27, 2025 by
jberchtold-nvidia
Loading…
8 of 13 tasks
[common] Misc improvements for attention
2.10.0
#2272
opened Oct 15, 2025 by
cyanguwa
Loading…
8 of 13 tasks
Previous Next
ProTip!
Updated in the last three days: updated:>2025-11-04.