generated from amazon-archives/__template_Apache-2.0
-
Notifications
You must be signed in to change notification settings - Fork 22
Pull requests: aws-neuron/neuronx-distributed-inference
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add Trinity model family (AfmoeForCausalLM) contrib
#55
opened Feb 27, 2026 by
jimburtoft
Loading…
11 of 12 tasks
feat: add expert_wise_scale support for per-expert FP8 quantization in MoE models
#35
opened Feb 13, 2026 by
lifelongeeek
Loading…
8 of 10 tasks
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.