add support granite and granitemoe models #1099

eaidova · 2025-01-06T16:42:26Z

What does this PR do?

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

HuggingFaceDocBuilderDev · 2025-01-06T16:47:54Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

related to huggingface/optimum-intel#1099 added opportunity to test these models via llm_bench Co-authored-by: Ilya Lavrenov <ilya.lavrenov@intel.com>

IlyasMoutawwakil

LGTM thanks for investigating the MoE tracing problem !

echarlaix

LGTM!

tests/openvino/utils_tests.py

echarlaix · 2025-01-07T13:12:35Z

optimum/exporters/openvino/model_patcher.py

+# copied from https://github.com/huggingface/transformers/blob/v4.47.1/src/transformers/models/granitemoe/modeling_granitemoe.py#L281
+def _granite_moe_parallel_experts_forward(self, inputs, expert_size):
+    output_list = []
+    # difference with original
+    # 1) expert_size is tensor instead of list of ints after gating patching, that does not allow use original inputs.split(expert_size)
+    # 2) use index_start:next_index for obtaining expert inputs splits one by one instead of precomputed splits once before cycle


super helpful, thanks!

SearchSavior · 2025-01-07T18:24:29Z

Thank you eaidova and team for your work on this! Very instructive about how the IR format works.

I ran a checkout to open this branch in a fresh conda environment and inspected the changes locally. Yet I am still getting errors that the optimum exporters extension is missing when running via the CLI tool or through export=True in from.pretrained.

I still have a lot to learn about advanced package management with python but once the unrecognized export config error resolved I figured it might be useful to share here.

IlyasMoutawwakil · 2025-01-08T12:57:34Z

@SearchSavior in a clean env, after you checkout to this branch or even on main you should do pip install .[openvino]
Make sure you don't install with the editable mode flag -e so that optimum and optimum-intel are found by the interpreter in you're site-packages. Tell me if this works for you.

add support granite and granitemoe models

05e0a1d

add tests and docs

cea2d1f

eaidova added the openvino-test Trigger OpenVINO slow tests label Jan 6, 2025

eaidova requested review from AlexKoff88, IlyasMoutawwakil and echarlaix January 6, 2025 17:12

eaidova mentioned this pull request Jan 6, 2025

[llm_bench] add support granite and granitemoe models openvinotoolkit/openvino.genai#1486

Merged

AlexKoff88 approved these changes Jan 7, 2025

View reviewed changes

IlyasMoutawwakil approved these changes Jan 7, 2025

View reviewed changes

echarlaix approved these changes Jan 7, 2025

View reviewed changes

add models to test cases

3609dd1

eaidova force-pushed the ea/granite_moe branch from a54202f to 3609dd1 Compare January 8, 2025 06:38

IlyasMoutawwakil merged commit 7d7de7c into huggingface:main Jan 8, 2025
28 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add support granite and granitemoe models #1099

add support granite and granitemoe models #1099

eaidova commented Jan 6, 2025 •

edited

Loading

HuggingFaceDocBuilderDev commented Jan 6, 2025

IlyasMoutawwakil left a comment

echarlaix left a comment

echarlaix Jan 7, 2025

SearchSavior commented Jan 7, 2025 •

edited

Loading

IlyasMoutawwakil commented Jan 8, 2025 •

edited

Loading

add support granite and granitemoe models #1099

add support granite and granitemoe models #1099

Conversation

eaidova commented Jan 6, 2025 • edited Loading

What does this PR do?

Before submitting

HuggingFaceDocBuilderDev commented Jan 6, 2025

IlyasMoutawwakil left a comment

Choose a reason for hiding this comment

echarlaix left a comment

Choose a reason for hiding this comment

echarlaix Jan 7, 2025

Choose a reason for hiding this comment

SearchSavior commented Jan 7, 2025 • edited Loading

IlyasMoutawwakil commented Jan 8, 2025 • edited Loading

eaidova commented Jan 6, 2025 •

edited

Loading

SearchSavior commented Jan 7, 2025 •

edited

Loading

IlyasMoutawwakil commented Jan 8, 2025 •

edited

Loading