-
Notifications
You must be signed in to change notification settings - Fork 203
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix optimum-cli command for VLM example in README #1348
Fix optimum-cli command for VLM example in README #1348
Conversation
@AlexKoff88 @nikita-savelyevv on Xeon the method with --dataset, both with and without num-samples, gives an empty result with the README example. With --group-size 16 I do get a good result. Also tried to test on laptop yesterday, but there the model export did not work (unrelated to this). |
build_jenkins |
For now the commands suggested by Alexander and Nikita don't work with GenAI (see comments and linked issue). There's a ticket about that. Can anyone suggest another model that works well? Smaller is better. Otherwise I'll remove the INT4 suggestion for now, until the issue is fixed in GenAI. |
|
Thank you! That's a much smaller model, much more user friendly, especially on AI PC. I tested and it works well out of the box with GenAI with the example from the README (after installing einops and timm). I'll update the PR with this. If someone disagrees, let me know. The fix in optimum-intel is not enough for this because then exporting will work, but inference with GenAI doesn't. When that is fixed in an upcoming GenAI release we could change back to MiniCPM - though I see no downside for having smaller models in README examples. |
Co-authored-by: Alexander Kozlov <alexander.kozlov@intel.com>
Co-authored-by: Nikita Savelyev <nikita.savelyev@intel.com>
1ec6978
to
3a83895
Compare
Changed README to use |
build_jenkins |
With the existing command users get an error: Channel size 4304 should be divisible by size of group 128. --------- Co-authored-by: Alexander Kozlov <alexander.kozlov@intel.com> Co-authored-by: Nikita Savelyev <nikita.savelyev@intel.com> Co-authored-by: Ilya Lavrenov <ilya.lavrenov@intel.com>
With the existing command users get an error: Channel size 4304 should be divisible by size of group 128. --------- Co-authored-by: Alexander Kozlov <alexander.kozlov@intel.com> Co-authored-by: Nikita Savelyev <nikita.savelyev@intel.com> Co-authored-by: Ilya Lavrenov <ilya.lavrenov@intel.com>
With the existing command users get an error: Channel size 4304 should be divisible by size of group 128.