[PT2] MinMax #3166

AlexanderDokuchaev · 2024-12-23T21:46:37Z

Changes

Introduce TORCH2 backend
MinMax algorithms for torch2 backend
Add handle_torch_function for quantization function to trace it by torch_function

Related tickets

152996

Tests

test install

daniil-lyakhov

Great job!
It would be great if you would separate the small typo fixes from the min-max implementation, it's too much small changes for me (and probably for others as well)

nncf/experimental/torch2/engine.py

nncf/experimental/torch2/model_transformer.py

tests/torch2/utils.py

daniil-lyakhov · 2024-12-29T18:03:32Z

Are conformance tests numbers available?

AlexanderDokuchaev · 2024-12-30T14:37:21Z

Model	metric	FQ	int8	metric (N)	FQ (N)	int8 (N)
torchvision/resnet18	0.6945	30	21	0.69488	30	21
timm/mobilenetv3_small_050	0.4261	62	36	0.4192	62	36
timm/deit3_small_patch16_224	0.8126	74	50	0.8126	74	50
timm/crossvit_9_240	0.7275	112	88	0.72746	112	88
hf/bert-base-uncased		74	77		74	77

(N) - experimental tracing without F/BC

alexsu52

Do you have any performance numbers (speed and memory consumption) for t2f tracing approach? I believe that the value of the PR will be higher if you add it to the PR description.

nncf/experimental/torch2/function_hook/nncf_graph/nncf_graph_builder.py

nncf/quantization/quantize_model.py

nncf/quantization/algorithms/min_max/torch_backend.py

nncf/torch/quantization/layers.py

nncf/experimental/torch2/commands.py

nncf/experimental/torch2/engine.py

AlexanderDokuchaev · 2025-01-27T16:37:30Z

Model	MinMax. time (develop)	←PR	Stat. col. time (develop)	←PR	RAM MiB (develop)	←PR
bert-base-uncased	0:00:56	0:00:52	0:00:53	0:00:49	535	553
resnet18	0:00:14	0:00:14	0:00:12	0:00:13	1297	1359
crossvit_9_240	0:01:03	0:00:56	0:00:59	0:00:53	1477	1579
deit3_small_patch16_224	0:00:40	0:00:38	0:00:37	0:00:36	1746	1854
mobilenetv3_small_050	0:00:24	0:00:24	0:00:22	0:00:22	1231	1221

nncf/quantization/quantize_model.py

nncf/common/factory.py

alexsu52

LGTM

daniil-lyakhov

Minor

nncf/experimental/torch2/commands.py

nncf/experimental/torch2/model_transformer.py

nncf/experimental/torch2/quantization/quantize_model.py

nncf/quantization/algorithms/min_max/torch_backend.py

tests/torch2/function_hook/quantization/test_quantized_graphs.py

github-actions bot added NNCF PT Pull requests that updates NNCF PyTorch NNCF Common Pull request that updates NNCF Common experimental NNCF PTQ Pull requests that updates NNCF PTQ API Public API-impacting changes labels Dec 23, 2024

AlexanderDokuchaev added 7 commits December 24, 2024 01:32

init

1b2bf15

Merge branch 'develop' into ad/pt2_minmax

a4366ab

efficientnet_pytorch==0.7.1

516c716

addict

ab0ceec

dot

af939a4

mypy

873a1df

c

6c62051

AlexanderDokuchaev marked this pull request as ready for review December 25, 2024 09:58

AlexanderDokuchaev requested a review from a team as a code owner December 25, 2024 09:58

Merge branch 'develop' into ad/pt2_minmax

3301f7b

AlexanderDokuchaev requested review from daniil-lyakhov and alexsu52 December 25, 2024 12:34

AlexanderDokuchaev added 8 commits December 25, 2024 20:25

mypy

c3c809c

f

cc9e0fc

none

b18c9a2

no cache for gpu

40466e2

p

656ca73

revert

5d038c2

Merge branch 'develop' into ad/pt2_minmax

1f8ce90

rename

8693917

daniil-lyakhov reviewed Dec 29, 2024

View reviewed changes

nncf/experimental/torch2/engine.py Outdated Show resolved Hide resolved

nncf/experimental/torch2/model_transformer.py Show resolved Hide resolved

tests/torch2/utils.py Show resolved Hide resolved

Merge branch 'develop' into ad/pt2_minmax

e2f892e

AlexanderDokuchaev force-pushed the ad/pt2_minmax branch from ef93ff8 to 77cbceb Compare January 25, 2025 22:38

AlexanderDokuchaev force-pushed the ad/pt2_minmax branch from 77cbceb to b91cb9e Compare January 26, 2025 14:57

rm torch2 backend

b91cb9e

alexsu52 reviewed Jan 27, 2025

View reviewed changes

AlexanderDokuchaev added 3 commits January 27, 2025 13:26

c

2cf3eff

dub

4e39113

com

f9e8200

AlexanderDokuchaev requested a review from alexsu52 January 27, 2025 15:00

alexsu52 reviewed Jan 28, 2025

View reviewed changes

nncf/quantization/quantize_model.py Outdated Show resolved Hide resolved

nncf/common/factory.py Show resolved Hide resolved

AlexanderDokuchaev added 4 commits January 28, 2025 19:53

c

9a8fc76

f

c1151b8

iter

287e624

Merge branch 'develop' into ad/pt2_minmax

2a33082

alexsu52 approved these changes Jan 29, 2025

View reviewed changes

daniil-lyakhov reviewed Jan 29, 2025

View reviewed changes

github-actions bot added the NNCF OpenVINO Pull requests that updates NNCF OpenVINO label Jan 29, 2025

daniil-lyakhov approved these changes Jan 29, 2025

View reviewed changes

AlexanderDokuchaev added 3 commits January 29, 2025 14:47

comments

33713a1

Merge branch 'develop' into ad/pt2_minmax

98030f0

mypy

2f36c9f

alexsu52 merged commit 5ad9bc4 into openvinotoolkit:develop Jan 29, 2025
17 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[PT2] MinMax #3166

[PT2] MinMax #3166

AlexanderDokuchaev commented Dec 23, 2024 •

edited

Loading

daniil-lyakhov left a comment

daniil-lyakhov commented Dec 29, 2024

AlexanderDokuchaev commented Dec 30, 2024

alexsu52 left a comment

AlexanderDokuchaev commented Jan 27, 2025

alexsu52 left a comment

daniil-lyakhov left a comment

[PT2] MinMax #3166

[PT2] MinMax #3166

Conversation

AlexanderDokuchaev commented Dec 23, 2024 • edited Loading

Changes

Related tickets

Tests

daniil-lyakhov left a comment

Choose a reason for hiding this comment

daniil-lyakhov commented Dec 29, 2024

AlexanderDokuchaev commented Dec 30, 2024

alexsu52 left a comment

Choose a reason for hiding this comment

AlexanderDokuchaev commented Jan 27, 2025

alexsu52 left a comment

Choose a reason for hiding this comment

daniil-lyakhov left a comment

Choose a reason for hiding this comment

AlexanderDokuchaev commented Dec 23, 2024 •

edited

Loading