-
Notifications
You must be signed in to change notification settings - Fork 430
Open
Labels
Description
- [P0] Support emulated mode Support emulated mode for mxfp8 moe training to support non-sm100 CI or dev env #3598
- [P0] Fix torch reference for per group blocked layout for groups along K
- [P1] Add B200 to CI [BE] Add B200 runner to CI #2964 and add tests to workflow triggered by label
- [P0] Unify mxfp8 dense + moe code/configs
- [P0] Expose
wgrad_with_hprecipe in quantize_ api and torchtitan - [P1] Expose
use_triton_for_dim0_castknob in quantize_ api and torchtitan - [P1] Remove
use_cuda_kernel_for_blocked_layoutknob in quantize_ api, autoselect instead - [P1] Ship gb200 compatible wheels
- [P0] Add better documentation for how to use each building block individually, padding group sizes, etc
Reactions are currently unavailable