Add an initial warmup step to `IPEXModel`s #543

ofirzaf · 2024-01-31T03:34:08Z

What does this PR do?

The first 2 forwards of an IPEXModel after trace/load includes background optimizations steps that make the output of these forwards unpredictable and non consistent with the model after the optimizations. To fix that, an initial warmup step was added to the __init__ of IPEXModels

Depends on PR #542

@echarlaix

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

HuggingFaceDocBuilderDev · 2024-01-31T08:01:54Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

echarlaix

LGTM

optimum/intel/ipex/modeling_base.py

echarlaix · 2024-01-31T13:34:24Z

optimum/intel/ipex/modeling_base.py

+    @wraps(IPEXModel.forward)
    def forward(self, *args, **kwargs):
-        outputs = self.model(*args, **kwargs)
+        outputs = super().forward(*args, **kwargs)


why is this needed ?

The prepare_jit_inputs looks at the signature of the function and the wraps and super help avoid code copy

would prefer we avoid as it will fail in case outputs is not a dict

optimum-intel/optimum/intel/ipex/modeling_base.py

Line 193 in 8ee487d

return ModelOutput(**outputs) if isinstance(outputs, dict) else ModelOutput(logits=outputs[0])

also not sure to see the link with prepare_jit_inputs

In _init_warmup we call prepare_jit_inputs which examines the passed model's forward signature to see which dummy inputs exists in the signature. If we don't use wraps we get the signature of
self, *args, **kwargs
instead of
self, input_ids: torch.Tensor, attention_mask: torch.Tensor, token_type_ids: torch.Tensor = None, **kwargs,

outputs will always be a dict because this is the output of IPEXModel.forward, no?

OK I understand, was thinking that prepare_jit_inputs was only used for the torchscript export but I see that it's also used in _init_warmup, thanks for the clarification

here I'm talking about outputs https://github.com/huggingface/optimum-intel/blob/8ee487dc2ade5bd0023d1bbe0a0103d6af8821e0/optimum/intel/ipex/modeling_base.py#L192C9-L192C16

echarlaix approved these changes Jan 31, 2024

View reviewed changes

optimum/intel/ipex/modeling_base.py Outdated Show resolved Hide resolved

ofirzaf added 4 commits January 31, 2024 03:13

Handle autocast in IPEXModel.forward

4751164

Handle missing torch_dtype in config

0edb5c4

Warmup IPEX models at init

742ff39

Minor fix

1012770

echarlaix reviewed Jan 31, 2024

View reviewed changes

ofirzaf added 3 commits January 31, 2024 07:44

Merge branch 'main' into ipex-warmup

e5b425d

Fix _init_warmup use_cache condition

d797cc9

Fix output handling in IPEX question answering

abb7b00

echarlaix merged commit 788e458 into huggingface:main Jan 31, 2024
7 of 10 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add an initial warmup step to `IPEXModel`s #543

Add an initial warmup step to `IPEXModel`s #543

ofirzaf commented Jan 31, 2024

HuggingFaceDocBuilderDev commented Jan 31, 2024

echarlaix left a comment

echarlaix Jan 31, 2024

ofirzaf Jan 31, 2024

echarlaix Jan 31, 2024

echarlaix Jan 31, 2024

ofirzaf Jan 31, 2024

echarlaix Jan 31, 2024 •

edited

Loading

Add an initial warmup step to IPEXModels #543

Add an initial warmup step to IPEXModels #543

Conversation

ofirzaf commented Jan 31, 2024

What does this PR do?

Before submitting

HuggingFaceDocBuilderDev commented Jan 31, 2024

echarlaix left a comment

Choose a reason for hiding this comment

echarlaix Jan 31, 2024

Choose a reason for hiding this comment

ofirzaf Jan 31, 2024

Choose a reason for hiding this comment

echarlaix Jan 31, 2024

Choose a reason for hiding this comment

echarlaix Jan 31, 2024

Choose a reason for hiding this comment

ofirzaf Jan 31, 2024

Choose a reason for hiding this comment

echarlaix Jan 31, 2024 • edited Loading

Choose a reason for hiding this comment

Add an initial warmup step to `IPEXModel`s #543

Add an initial warmup step to `IPEXModel`s #543

echarlaix Jan 31, 2024 •

edited

Loading