[FP4] Update to make compression handling more generic for fp4 #448

dsikka · 2025-09-08T21:31:45Z

Summary

More generic handling when running fp4 compression

If group / tensor_group / channel - nvfp4 - packed quantized
If not and running w4a4 - float-quantized
Otherwise, naive-quantized

dsikka added 6 commits August 29, 2025 16:24

add format infer code

211fc85

update

a3bb4dd

update

5174978

add loguru

9bd80bb

use dense not None

5bf3212

update

25acb7d

dsikka changed the title ~~[FP4] Update to make compression handling to be more generic for fp4~~ [FP4] Update to make compression handling more generic for fp4 Sep 8, 2025

Base automatically changed from update_format to main September 8, 2025 22:32

update

27b317e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[FP4] Update to make compression handling more generic for fp4 #448

[FP4] Update to make compression handling more generic for fp4 #448

Uh oh!

dsikka commented Sep 8, 2025 •

edited

Loading

Uh oh!

Uh oh!

[FP4] Update to make compression handling more generic for fp4 #448

Are you sure you want to change the base?

[FP4] Update to make compression handling more generic for fp4 #448

Uh oh!

Conversation

dsikka commented Sep 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

dsikka commented Sep 8, 2025 •

edited

Loading