Skip to content

Commit 6dc4524

Browse files
committed
[OV]: Fixed inferece after 4 bit weight compression
1 parent 8f7d016 commit 6dc4524

File tree

1 file changed

+3
-1
lines changed

1 file changed

+3
-1
lines changed

optimum/intel/openvino/modeling_decoder.py

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -592,7 +592,9 @@ def _from_pretrained(
592592
else:
593593
init_cls = cls
594594

595-
causal_model = init_cls(model=model, config=config, model_save_dir=model_cache_path.parent, **kwargs)
595+
causal_model = init_cls(
596+
model=model, config=config, model_save_dir=model_cache_path.parent, compile=not load_in_4bit, **kwargs
597+
)
596598

597599
if load_in_4bit:
598600
if not is_nncf_available():

0 commit comments

Comments
 (0)