Fix activation gradient backprop in GPTQ #1197

irenaby · 2024-09-01T17:37:31Z

Pull Request Description:

Add freeze_quant_params flag to base trainable quantizer with False as default.
Implement quant params freezing for STE activation quantizers.
Use activation trainable quantizers in GPTQ instead of inferable quantizers, with frozen quant params.

Checklist before requesting a review:

I set the appropriate labels on the pull request.
I have added/updated the release note draft (if necessary).
I have updated the documentation to reflect my changes (if necessary).
All function and files are well documented.
All function and classes have type hints.
There is a licenses in all file.
The function and variable names are informative.
I have checked for code duplications.
I have added new unittest (if necessary).

… activation quanizers

github-actions bot added auto:gptq auto:tests auto:trainable_infrastructure labels Sep 1, 2024

irenaby force-pushed the act_quant_cp branch from 7eda137 to 5bbde3a Compare September 1, 2024 18:05

irenaby mentioned this pull request Sep 1, 2024

Move STE/LSQ activation quantizers from QAT to trainable infrastructure for PyTorch #1178

Merged

9 tasks

irenaby force-pushed the act_quant branch from 67d545b to a248c5b Compare September 3, 2024 13:55

irenaby force-pushed the act_quant_cp branch from 5bbde3a to 0592c01 Compare September 3, 2024 13:58

irenaby force-pushed the act_quant branch from a248c5b to 882bff6 Compare September 3, 2024 15:54

add a flag to freeze quant params in base trainable quantizer and ste…

aa1dd8b

… activation quanizers

irenaby force-pushed the act_quant_cp branch from 0592c01 to 0d639ed Compare September 3, 2024 17:06

github-actions bot added the auto:qat label Sep 3, 2024

irenaby changed the base branch from act_quant to main September 4, 2024 06:45

irenaby force-pushed the act_quant_cp branch from 0d639ed to cd3a018 Compare September 4, 2024 06:51

github-actions bot removed the auto:qat label Sep 4, 2024

irenaby force-pushed the act_quant_cp branch 2 times, most recently from 4fa358d to c3544d8 Compare September 4, 2024 06:58

add tests for ste activation qparams freezing

2cbd343

irenaby force-pushed the act_quant_cp branch from c3544d8 to 2cbd343 Compare September 4, 2024 07:01

irenaby marked this pull request as ready for review September 4, 2024 07:50

ofirgo approved these changes Sep 5, 2024

View reviewed changes

irenaby merged commit 75e9f83 into main Sep 8, 2024
32 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix activation gradient backprop in GPTQ #1197

Fix activation gradient backprop in GPTQ #1197

irenaby commented Sep 1, 2024

Fix activation gradient backprop in GPTQ #1197

Fix activation gradient backprop in GPTQ #1197

Conversation

irenaby commented Sep 1, 2024

Pull Request Description:

Checklist before requesting a review: