Fix optimum-cli command for VLM example in README #1348

helena-intel · 2024-12-09T15:34:13Z

With the existing command users get an error: Channel size 4304 should be divisible by size of group 128.

README.md

helena-intel · 2024-12-11T10:30:22Z

@AlexKoff88 @nikita-savelyevv on Xeon the method with --dataset, both with and without num-samples, gives an empty result with the README example. With --group-size 16 I do get a good result. Also tried to test on laptop yesterday, but there the model export did not work (unrelated to this).

ilya-lavrenov · 2024-12-12T07:51:46Z

build_jenkins

helena-intel · 2024-12-12T20:04:28Z

For now the commands suggested by Alexander and Nikita don't work with GenAI (see comments and linked issue). There's a ticket about that. Can anyone suggest another model that works well? Smaller is better. Otherwise I'll remove the INT4 suggestion for now, until the issue is fixed in GenAI.

nikita-savelyevv · 2024-12-13T10:23:21Z

For now the commands suggested by Alexander and Nikita don't work with GenAI (see comments and linked issue). There's a ticket about that. Can anyone suggest another model that works well? Smaller is better. Otherwise I'll remove the INT4 suggestion for now, until the issue is fixed in GenAI.

OpenGVLab/InternVL2-1B is quite small and works with VLMPipeline. But I would recommend to wait for the fix huggingface/optimum-intel#1058 to be merged.

helena-intel · 2024-12-13T13:09:26Z

OpenGVLab/InternVL2-1B is quite small and works with VLMPipeline. But I would recommend to wait for the fix huggingface/optimum-intel#1058 to be merged.

Thank you! That's a much smaller model, much more user friendly, especially on AI PC. I tested and it works well out of the box with GenAI with the example from the README (after installing einops and timm). I'll update the PR with this. If someone disagrees, let me know.

The fix in optimum-intel is not enough for this because then exporting will work, but inference with GenAI doesn't. When that is fixed in an upcoming GenAI release we could change back to MiniCPM - though I see no downside for having smaller models in README examples.

Co-authored-by: Alexander Kozlov <alexander.kozlov@intel.com>

Co-authored-by: Nikita Savelyev <nikita.savelyev@intel.com>

helena-intel · 2024-12-13T14:12:27Z

Changed README to use OpenGVLab/InternVL2-1B with INT4. With FP16/FP32 model export, inference with the README example is empty. I added it to the ticket about empty output with MiniCPM. For now, this model with INT4 works, is quick to download and convert, and it's just for an inference example in the README, INT4 should be fine.

ilya-lavrenov · 2024-12-16T16:12:44Z

build_jenkins

With the existing command users get an error: Channel size 4304 should be divisible by size of group 128. --------- Co-authored-by: Alexander Kozlov <alexander.kozlov@intel.com> Co-authored-by: Nikita Savelyev <nikita.savelyev@intel.com> Co-authored-by: Ilya Lavrenov <ilya.lavrenov@intel.com>

Ported: - #1348 - #1410 - #1406 - #1424 --------- Co-authored-by: Helena Kloosterman <helena.kloosterman@intel.com> Co-authored-by: Alexander Kozlov <alexander.kozlov@intel.com> Co-authored-by: Nikita Savelyev <nikita.savelyev@intel.com> Co-authored-by: Irina Efode <irina.efode@intel.com>

github-actions bot added the no-match-files label Dec 9, 2024

ilya-lavrenov assigned nikita-savelyevv and AlexKoff88 Dec 9, 2024

ilya-lavrenov added this to the 2025.0 milestone Dec 9, 2024

ilya-lavrenov added the port to LTS PR needs to be ported to LTS label Dec 9, 2024

AlexKoff88 reviewed Dec 10, 2024

View reviewed changes

README.md Outdated Show resolved Hide resolved

AlexKoff88 reviewed Dec 10, 2024

View reviewed changes

README.md Outdated Show resolved Hide resolved

nikita-savelyevv reviewed Dec 10, 2024

View reviewed changes

README.md Outdated Show resolved Hide resolved

AlexKoff88 approved these changes Dec 10, 2024

View reviewed changes

AlexKoff88 enabled auto-merge December 10, 2024 12:18

nikita-savelyevv mentioned this pull request Dec 10, 2024

[OV] Fix data-free VLM compression via optimum-cli huggingface/optimum-intel#1058

Merged

3 tasks

helena-intel disabled auto-merge December 11, 2024 10:30

helena-intel and others added 4 commits December 13, 2024 14:43

Fix optimum-cli command for MiniCPM model in README

33620ca

Update README.md

ddfd5fe

Co-authored-by: Alexander Kozlov <alexander.kozlov@intel.com>

Update README.md

6f60dd5

Co-authored-by: Nikita Savelyev <nikita.savelyev@intel.com>

Use InternVL2 model in VLM README example

3a83895

helena-intel force-pushed the helena/fix-vlm-optimumcli branch from 1ec6978 to 3a83895 Compare December 13, 2024 13:45

helena-intel changed the title ~~Fix MiniCPM optimum-cli command in README~~ Fix optimum-cli command for VLM example in README Dec 13, 2024

ilya-lavrenov approved these changes Dec 13, 2024

View reviewed changes

ilya-lavrenov assigned Wovchena Dec 13, 2024

AlexKoff88 approved these changes Dec 16, 2024

View reviewed changes

Merge branch 'master' into helena/fix-vlm-optimumcli

9db225b

Wovchena approved these changes Dec 16, 2024

View reviewed changes

Wovchena enabled auto-merge December 16, 2024 07:50

Merge branch 'master' into helena/fix-vlm-optimumcli

bab1074

Wovchena added this pull request to the merge queue Dec 16, 2024

github-merge-queue bot removed this pull request from the merge queue due to failed status checks Dec 16, 2024

Wovchena added this pull request to the merge queue Dec 17, 2024

Merged via the queue into openvinotoolkit:master with commit a651292 Dec 17, 2024
59 checks passed

ilya-lavrenov mentioned this pull request Jan 5, 2025

Ported several PRs to releases/2024/6 #1479

Merged

ilya-lavrenov removed the port to LTS PR needs to be ported to LTS label Jan 7, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix optimum-cli command for VLM example in README #1348

Fix optimum-cli command for VLM example in README #1348

helena-intel commented Dec 9, 2024

helena-intel commented Dec 11, 2024

ilya-lavrenov commented Dec 12, 2024

helena-intel commented Dec 12, 2024 •

edited

Loading

nikita-savelyevv commented Dec 13, 2024

helena-intel commented Dec 13, 2024

helena-intel commented Dec 13, 2024

ilya-lavrenov commented Dec 16, 2024

Fix optimum-cli command for VLM example in README #1348

Fix optimum-cli command for VLM example in README #1348

Conversation

helena-intel commented Dec 9, 2024

helena-intel commented Dec 11, 2024

ilya-lavrenov commented Dec 12, 2024

helena-intel commented Dec 12, 2024 • edited Loading

nikita-savelyevv commented Dec 13, 2024

helena-intel commented Dec 13, 2024

helena-intel commented Dec 13, 2024

ilya-lavrenov commented Dec 16, 2024

helena-intel commented Dec 12, 2024 •

edited

Loading