Skip to content

Pull requests: NVIDIA/TensorRT-Model-Optimizer

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Fix: supporting gpt-oss HF eagle
#398 opened Oct 1, 2025 by h-guo18 Loading…
Remove Qwen tokenizer modification
#390 opened Sep 29, 2025 by cjluo-nv Loading…
Updated Prune-KD NeMo flow
#382 opened Sep 26, 2025 by AAnoosheh Loading…
megatron realquant FP8 WIP
#367 opened Sep 24, 2025 by cjluo-nv Draft
QLoRA DDP export
#353 opened Sep 22, 2025 by sugunav14 Loading…
4 of 6 tasks
add new trainer
#352 opened Sep 22, 2025 by h-guo18 Draft
Feat: Hardware-aware autoquant
#343 opened Sep 19, 2025 by h-guo18 Draft
support W4afp8 quant in v3.1
#337 opened Sep 18, 2025 by Bruce-x-1997 Loading…
FP8 Block quantize onnx export support
#324 opened Sep 15, 2025 by jingyu-ml Loading…
add end_process in deepseek ptq
#317 opened Sep 15, 2025 by Bruce-x-1997 Loading…
Update eagle notebook example with sglang
#316 opened Sep 14, 2025 by jamieliNVIDIA Loading…
Fix the bug in realquant
#301 opened Sep 8, 2025 by yeyu-nvidia Loading…
Fix speculative decoding example stale Not updated in a long time
#214 opened Jun 13, 2025 by Framartin Loading…
Bump the pip group across 3 directories with 1 update
#205 opened Jun 5, 2025 by dependabot bot Loading…
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.