Introduce OVPipelineQuantizationConfig by nikita-savelyevv · Pull Request #1310 · huggingface/optimum-intel

nikita-savelyevv · 2025-05-15T16:03:57Z

What does this PR do?

Changes:

Introduced OVPipelineQuantizationConfig allowing to specify quantization parameters per model component. Add corresponding tests.
Introduced a more advanced logic for inferring quantization config type from dictionary. Moved this logic to a separate function: _quantization_config_from_dict().
Updated default int4 config for phi4-multimodal. WWB similarity: 85.30%.

For example the code below applies int8 PTQ to lm_model, int8 WC to text_embeddings_model and no optimization to vision_embeddings_model.

from optimum.intel import OVModelForVisualCausalLM
from optimum.intel import OVPipelineQuantizationConfig, OVQuantizationConfig, OVWeightQuantizationConfig

model_id = "OpenGVLab/InternVL2-1B"
model = OVModelForVisualCausalLM.from_pretrained(
    model_id,
    export=True,
    trust_remote_code=True,
    quantization_config=OVPipelineQuantizationConfig(
        quantization_configs={
            "lm_model": OVQuantizationConfig(bits=8),
            "text_embeddings_model": OVWeightQuantizationConfig(bits=8),
        },
        dataset="contextual",
        trust_remote_code=True,
    )
)

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

HuggingFaceDocBuilderDev · 2025-05-15T16:09:08Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

nikita-savelyevv · 2025-05-19T18:09:26Z

@l-bat Could you please review this PR?

tests/openvino/test_quantization.py

docs/source/openvino/optimization.mdx

This reverts commit 1ae7374.

docs/source/openvino/optimization.mdx

optimum/intel/openvino/configuration.py

IlyasMoutawwakil · 2025-05-22T07:19:04Z

optimum/intel/openvino/quantization.py

                model,
                calibration_datasets["model"],
-                subset_size=quantization_config.num_samples,
+                subset_size=quantization_config.num_samples or 128,


why is this value hidden here ?

128 was previously a default value for num_samples argument in OVQuantizationConfig. In this PR I've removed it so it is now None by default. That's why 128 has appeared here.

Ideally, we should transition to providing arguments via quantization_config.to_nncf_dict() here, as it is done for OV case. But I propose to do this in a separate PR.

IlyasMoutawwakil

LGTM !

nikita-savelyevv · 2025-05-27T12:23:10Z

Hi @echarlaix! Do you mind us merging this or you would like to take a look?

echarlaix · 2025-05-28T13:01:47Z

apologies for the delay @nikita-savelyevv, taking a look right now!

echarlaix

Looks great thanks @nikita-savelyevv !!

Initial commit

c033c2e

Nikita Savelyev added 8 commits May 15, 2025 18:14

Tweaks

a154ec0

Fix test

0dc8a45

Fix config for phi4mm

e9d60bb

Update ignored scope

9213856

Update docs

529c584

Revert disabling tests

5f37972

Merge branch 'main' into ns/pipeline-quantization-config

69444d5

Update optimization.mdx

8a2af74

nikita-savelyevv marked this pull request as ready for review May 19, 2025 18:09

l-bat approved these changes May 20, 2025

View reviewed changes

tests/openvino/test_quantization.py Outdated Show resolved Hide resolved

docs/source/openvino/optimization.mdx Outdated Show resolved Hide resolved

Nikita Savelyev added 4 commits May 20, 2025 12:06

Update test_quantization.py

1ae7374

Revert "Update test_quantization.py"

b87ccb1

This reverts commit 1ae7374.

Fix test

d119357

Add support for VLM ptq

c416681

nikita-savelyevv requested a review from eaidova May 20, 2025 13:55

Update docs

a91f1cb

eaidova reviewed May 21, 2025

View reviewed changes

docs/source/openvino/optimization.mdx Show resolved Hide resolved

eaidova reviewed May 21, 2025

View reviewed changes

docs/source/openvino/optimization.mdx Outdated Show resolved Hide resolved

eaidova approved these changes May 21, 2025

View reviewed changes

nikita-savelyevv requested review from IlyasMoutawwakil and echarlaix May 21, 2025 11:35

Update docs

c454bcb

IlyasMoutawwakil reviewed May 22, 2025

View reviewed changes

optimum/intel/openvino/configuration.py Outdated Show resolved Hide resolved

IlyasMoutawwakil reviewed May 22, 2025

View reviewed changes

Nikita Savelyev added 3 commits May 22, 2025 17:56

Rename pipeline_quantization_configs

2572f4a

Also rename in docs

eacd61b

Merge branch 'main' into ns/pipeline-quantization-config

c93c98a

nikita-savelyevv requested a review from IlyasMoutawwakil May 26, 2025 08:31

IlyasMoutawwakil approved these changes May 26, 2025

View reviewed changes

Update phi4mm config

7cc88b6

echarlaix approved these changes May 28, 2025

View reviewed changes

echarlaix merged commit 54b40e1 into huggingface:main May 28, 2025
18 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introduce OVPipelineQuantizationConfig#1310

Introduce OVPipelineQuantizationConfig#1310
echarlaix merged 19 commits intohuggingface:mainfrom
nikita-savelyevv:ns/pipeline-quantization-config

nikita-savelyevv commented May 15, 2025 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented May 15, 2025

Uh oh!

nikita-savelyevv commented May 19, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

IlyasMoutawwakil May 22, 2025

Uh oh!

nikita-savelyevv May 22, 2025

Uh oh!

IlyasMoutawwakil left a comment

Uh oh!

nikita-savelyevv commented May 27, 2025

Uh oh!

echarlaix commented May 28, 2025

Uh oh!

echarlaix left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Conversation

nikita-savelyevv commented May 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Before submitting

Uh oh!

HuggingFaceDocBuilderDev commented May 15, 2025

Uh oh!

nikita-savelyevv commented May 19, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

IlyasMoutawwakil May 22, 2025

Choose a reason for hiding this comment

Uh oh!

nikita-savelyevv May 22, 2025

Choose a reason for hiding this comment

Uh oh!

IlyasMoutawwakil left a comment

Choose a reason for hiding this comment

Uh oh!

nikita-savelyevv commented May 27, 2025

Uh oh!

echarlaix commented May 28, 2025

Uh oh!

echarlaix left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

nikita-savelyevv commented May 15, 2025 •

edited

Loading