Refactor and Add Tests #2

apaniukov · 2024-01-10T09:20:14Z

What does this PR do?

Fixes # (issue)

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

* add llamd and bloom in ipex inference tests * only add llama tests

* add position_ids in forward * check if jit model need position_ids * use MODEL_TYPES_REQUIRING_POSITION_IDS * fix has_position_ids * fix position_ids length * rm useless params * check model inputs by input names * fix format * check input names in graph model * fix style * consider eager model in input_names * add input names * add text input names * fix styl;e * Update optimum/intel/generation/modeling.py * fix format * Update optimum/intel/generation/modeling.py --------- Co-authored-by: Ella Charlaix <ella@huggingface.co> Co-authored-by: Ella Charlaix <80481427+echarlaix@users.noreply.github.com>

* f32 precision for compare-with-transformers tests Allow these tests to pass locally on devices where inference is run by default in FP16 (GPU) or BF16 (latest Xeon). * Add F32_CONFIG constant for modeling tests * Replace get_version with is_openvino_version

ilya-lavrenov · 2024-01-10T18:28:20Z

maybe it's better to re-create PR to optimum-intel from apaniukov:openvino_tokenizers ?

…quest-wrapper-typo Fix typo inside InferRequestWrapper

apaniukov · 2024-01-12T13:30:56Z

@ilya-lavrenov huggingface#513

* Add try for get_property * refine code --------- Co-authored-by: fishbell <wangwang.wang@intel.com>

…i-fix Fix error with optimum-cli export openvino --help

* Bump min torch version * comment * add torch min version

* Allow loading of stateful models (no patching yet) * Stateful models support * Fix forward for chatglm * Passing stateful as a dedicated parameter * Fixed possibly misaligned types in ShapeOf Concat sub-expression * Fixed critical typo in infer_request invocation * Apply bettertransfomer when model is converted in stateful mode * Correct default value handling for stateful flag * Apply bettertransformer under try-except to avoid crashes when model is not supported * Added --stateful option in optimum-cli * Raise if too old version of opevino is used ans stateful=True * Fix openvino version check to be compatible with openvino-nightly * Fix for bloom family * Allow loading of stateful models (no patching yet) * Stateful models support * Fix forward for chatglm * Passing stateful as a dedicated parameter * Fixed possibly misaligned types in ShapeOf Concat sub-expression * Fixed critical typo in infer_request invocation * Apply bettertransfomer when model is converted in stateful mode * Correct default value handling for stateful flag * Apply bettertransformer under try-except to avoid crashes when model is not supported * Added --stateful option in optimum-cli * Raise if too old version of opevino is used ans stateful=True * Fix openvino version check to be compatible with openvino-nightly * Fix for bloom family * Fix general code style and appliy renaming suggestions * fix version checking if openvino not in site-packages * use reset_stateif available * remove input patch in bettertransformer apply * add tests * add type hints and update doc strings * added more tests * Fixed outdated signature of InferRequest wrapper to fix one of the quantizer tests. * Switch to stateful model by default * Allow loading of stateful models (no patching yet) * Stateful models support * Fix forward for chatglm * Passing stateful as a dedicated parameter * Fixed possibly misaligned types in ShapeOf Concat sub-expression * Apply bettertransfomer when model is converted in stateful mode * Correct default value handling for stateful flag * Apply bettertransformer under try-except to avoid crashes when model is not supported * Added --stateful option in optimum-cli * Raise if too old version of opevino is used ans stateful=True * Fix openvino version check to be compatible with openvino-nightly * Fix for bloom family * Fix general code style and appliy renaming suggestions * fix version checking if openvino not in site-packages * use reset_stateif available * remove input patch in bettertransformer apply * add tests * add type hints and update doc strings * added more tests * Fixed outdated signature of InferRequest wrapper to fix one of the quantizer tests. * Stateful models support * Fix forward for chatglm * Passing stateful as a dedicated parameter * Apply bettertransfomer when model is converted in stateful mode * Raise if too old version of opevino is used ans stateful=True * Fix openvino version check to be compatible with openvino-nightly * Fix for bloom family * fix test and add beam_idx attribute * apply review comments * stateful by default fixes * less agressive stateful * ensure that task support stateful * remove debug print * Apply suggestions from code review Co-authored-by: Sergey Lyalin <sergey.lyalin@intel.com> * Apply suggestions from code review Co-authored-by: Helena Kloosterman <helena.kloosterman@intel.com> * update requirements and warning messages * Apply suggestions from code review Co-authored-by: Ella Charlaix <80481427+echarlaix@users.noreply.github.com> * fix cli export * Update optimum/exporters/openvino/__main__.py Co-authored-by: Ella Charlaix <80481427+echarlaix@users.noreply.github.com> --------- Co-authored-by: Sergey Lyalin <sergey.lyalin@intel.com> Co-authored-by: Helena Kloosterman <helena.kloosterman@intel.com> Co-authored-by: Ella Charlaix <80481427+echarlaix@users.noreply.github.com>

…uggingface#518) * Use f32 precision for compare-to-diffusers tests * Add f32 ov_config to training/stable diffusion tests

Reuse existing preprocessors

* add IPEX model and README update ipex modeling and add case for text-generation and text-classification Signed-off-by: Wang, Yi A <yi.a.wang@intel.com> * fix style * IPEX modeling refactorization * typo * remove use cache arg when loading model * fix style * move tests * remove readme * add test * add warning if use_cache mismatch * fix * format * update setup * add use_cache attribute --------- Signed-off-by: Wang, Yi A <yi.a.wang@intel.com> Co-authored-by: Feng, Jiqing <jiqing.feng@intel.com>

* add IPEX model for QA task * add fix

…huggingface#533) * Expose InferRequestWrapper class so it can be imported from elsewhere * Fix

…#536) * add test * format * Add image classification task * Add test

huggingface#537) * relax requirements to have registered normalized config for usage converted decoder models * add property for access to normalized config

…face#540)

* fix torch version for ipex tests * disbale tests for incompatible torch version with ipex * fix

…ce#544) * Refactor IPEX CausalLM for better model arch scale * Fix style

* Handle autocast in IPEXModel.forward * Handle missing torch_dtype in config

* Handle autocast in IPEXModel.forward * Handle missing torch_dtype in config * Warmup IPEX models at init * Minor fix * Fix _init_warmup use_cache condition * Fix output handling in IPEX question answering

…ommit Fix openvino/test_training.py

…huggingface#552)

* Initial code for load_in_4_bit * Dataset does not work * Intermediate changes * Make it working with dataset * Style * Fixed small issue * Fixed failed tests * Style * Comment failed tests due to NNCF 2.8 * Commented failed tests until new NNCF release * Added tests for load_in_4bit * Added awq option. Included NNCF package into openvino extra. * Rolled back including nncf into openvino extra * Style * Fixed tests * Fixed issues with models larger than 1B. Added tests. * Style * Fixed issues. Applied comments. * Removed unnecessary exception * Applied more comments * Fixed issue * Make quantization_config a part of OVConfig in OVQuantizer * Fixed issue with Transformers * Fixed test * Changed the naming. Added additional tests * Fixed tests * Fixed tests * Applied more comments * Style

…#535) * skip compression weights tests for nncf==2.8.0 and reworked logic of optimization stateful PyTorch models * black happy * ruff happy * updated nncf version * replied to comments * replied comments * typo * cherry pick fixes for tests from PR 538 * replied to comments --------- Co-authored-by: Ella Charlaix <80481427+echarlaix@users.noreply.github.com>

jiqing-feng and others added 4 commits January 8, 2024 10:49

Fix model dtype (huggingface#502)

8340e1d

Add ipex inference llama test (huggingface#503)

77f9756

* add llamd and bloom in ipex inference tests * only add llama tests

Disable marian test until openvino next release (huggingface#504)

03e1fa6

apaniukov marked this pull request as ready for review January 10, 2024 10:00

apaniukov mentioned this pull request Jan 10, 2024

Convert tokenizers with openvino_tokenizers huggingface/optimum-intel#500

Closed

3 tasks

nikita-savelyevv and others added 2 commits January 11, 2024 19:02

Fix typo inside InferRequestWrapper

ba2487b

Merge pull request huggingface#511 from nikita-savelyevv/fix-infer-re…

23f4f5d

…quest-wrapper-typo Fix typo inside InferRequestWrapper

wgzintel and others added 8 commits January 12, 2024 17:04

Add try for get_property (huggingface#510)

3f7551e

* Add try for get_property * refine code --------- Co-authored-by: fishbell <wangwang.wang@intel.com>

Fix error with optimum-cli export openvino --help

545ad5a

Merge pull request huggingface#514 from huggingface/helena/optimum-cl…

3c196c3

…i-fix Fix error with optimum-cli export openvino --help

Bump min torch version (huggingface#515)

133aa7d

* Bump min torch version * comment * add torch min version

Add openvino-nightly to automated tests (huggingface#506)

2f2a764

Fix loading Timm models with ov_config (huggingface#517)

e22a2ac

Use f32 inference for some OpenVINO stable diffusion/training tests (h…

76ce9de

…uggingface#518) * Use f32 precision for compare-to-diffusers tests * Add f32 ov_config to training/stable diffusion tests

apaniukov force-pushed the openvino_tokenizers branch from f2a2877 to 57782d1 Compare January 18, 2024 10:43

slyalin and others added 10 commits January 18, 2024 12:49

Convert tokenizers with openvino_tokenizers

94bc226

Update optimum/exporters/openvino/__main__.py

6bb395f

Refactor and Add Tests

7d16ec7

Fix t5 Test

f0933ad

Add Warning

24cc616

Return Tests

49337b0

Move export_tokenizer to convert.py

7709043

Reuse existing preprocessors

Avoid Double Tokenizer Save

dbd609b

Fix Style

7e24f10

Refactor After Review

2cf460d

echarlaix and others added 30 commits January 26, 2024 19:19

Add IPEX model for question answering (huggingface#534)

87b36db

* add IPEX model for QA task * add fix

Expose InferRequestWrapper class so it can be imported from elsewhere (…

6bf5fbc

…huggingface#533) * Expose InferRequestWrapper class so it can be imported from elsewhere * Fix

Merge branch 'main' into openvino-tokenizers

9a2e271

Move tokenizers to OV dependencies

7ee347e

Add IPEX models for audio and image classification tasks (huggingface…

6e79be1

…#536) * add test * format * Add image classification task * Add test

relax requirements to have registered normalized config for usage con… (

20df723

huggingface#537) * relax requirements to have registered normalized config for usage converted decoder models * add property for access to normalized config

Check OV Compatibility

8d2ec41

IPEX decoder model fix (huggingface#539)

1b5c3cb

Enable loading of torchscript model with INC and add warning (hugging…

3b627f4

…face#540)

Bump OV Version

32a7274

Fix torch version for ipex tests (huggingface#545)

a251422

* fix torch version for ipex tests * disbale tests for incompatible torch version with ipex * fix

Refactor IPEX CausalLM for better model architecture scale (huggingfa…

398450d

…ce#544) * Refactor IPEX CausalLM for better model arch scale * Fix style

Automatic torch.autocast for IPEXModel (huggingface#542)

8ee487d

* Handle autocast in IPEXModel.forward * Handle missing torch_dtype in config

Add an initial warmup step to IPEXModels (huggingface#543)

788e458

* Handle autocast in IPEXModel.forward * Handle missing torch_dtype in config * Warmup IPEX models at init * Minor fix * Fix _init_warmup use_cache condition * Fix output handling in IPEX question answering

Fix format (huggingface#546)

0ca9447

Dev version

552de65

Move OpenVINO Tokenizers To Optional Dependencies

8c029e0

Fix OV pre-commit test

7ea3656

CUSTOMIZED_QUANTIZATION_CONFIG is updated

24f40bf

Merge pull request huggingface#548 from daniil-lyakhov/dl/fix_ov_prec…

5120f75

…ommit Fix openvino/test_training.py

Update README (huggingface#549)

0f45751

Add bloom ipex inference test (huggingface#551)

ad99b98

Remove pytorch v2.1.2 constraint for tests since ipex v2.2.0 release (…

24a1e30

…huggingface#552)

Fix openvino export model from ONNX (huggingface#554)

e40e627

Add --convert-tokenizer Option to CLI

09b067f

Merge branch 'main' into openvino_tokenizers

f3b8ce8

Fix SD Tokenizer

3c27fbd

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor and Add Tests #2

Refactor and Add Tests #2

apaniukov commented Jan 10, 2024

ilya-lavrenov commented Jan 10, 2024

apaniukov commented Jan 12, 2024

Refactor and Add Tests #2

Are you sure you want to change the base?

Refactor and Add Tests #2

Conversation

apaniukov commented Jan 10, 2024

What does this PR do?

Before submitting

ilya-lavrenov commented Jan 10, 2024

apaniukov commented Jan 12, 2024