apply sdpa for mpt and internlm #676

eaidova · 2024-04-22T13:18:30Z

What does this PR do?

optimize mpt and internlm models with scaled dot product attention
fixed export baichuan-13b model

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

HuggingFaceDocBuilderDev · 2024-04-22T16:27:59Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

eaidova · 2024-04-23T05:50:19Z

@echarlaix could you please take a look?

optimum/exporters/openvino/convert.py

AlexKoff88 · 2024-04-24T11:44:58Z

Can we have a test for each model architecture that is updated in this PR?

optimum/exporters/openvino/model_patcher.py

eaidova · 2024-04-24T15:21:02Z

Can we have a test for each model architecture that is updated in this PR?

this is update for models that are already in testing, I added only baichuan based on different code version in tests, mpt and internlm are remain without changes

tests/openvino/utils_tests.py

eaidova force-pushed the ea/mpt_sdpa branch from 555e44a to ee02349 Compare April 22, 2024 16:22

eaidova mentioned this pull request Apr 22, 2024

Convert failed of baichuan-inc/Baichuan2-13B-Chat openvinotoolkit/openvino.genai#357

Closed

eaidova force-pushed the ea/mpt_sdpa branch from ee02349 to c674492 Compare April 22, 2024 16:27

AlexKoff88 reviewed Apr 24, 2024

View reviewed changes

optimum/exporters/openvino/convert.py Outdated Show resolved Hide resolved

echarlaix reviewed Apr 24, 2024

View reviewed changes

optimum/exporters/openvino/model_patcher.py Outdated Show resolved Hide resolved

optimum/exporters/openvino/model_patcher.py Outdated Show resolved Hide resolved

eaidova force-pushed the ea/mpt_sdpa branch 2 times, most recently from cd9633c to ad62a15 Compare April 24, 2024 16:37

eaidova requested review from AlexKoff88 and echarlaix April 24, 2024 16:37

eaidova force-pushed the ea/mpt_sdpa branch from ad62a15 to a141858 Compare April 24, 2024 16:44

AlexKoff88 approved these changes Apr 25, 2024

View reviewed changes

eaidova added 5 commits April 25, 2024 13:54

apply sdpa for mpt and internlm

529381e

fix bauchan-13b

7a2bdf3

fix accuracy

6713059

small refactoring

93e77a1

add test for baichuan 13b

5aa30ed

eaidova force-pushed the ea/mpt_sdpa branch from a141858 to 7eafa22 Compare April 25, 2024 09:55

add support output_attentions

e60872c

eaidova force-pushed the ea/mpt_sdpa branch from 7eafa22 to e60872c Compare April 25, 2024 09:59

eaidova added 4 commits April 25, 2024 16:28

Merge branch 'main' into ea/mpt_sdpa

9f072b2

Merge branch 'main' into ea/mpt_sdpa

0b9b253

code style

4f36b6f

Merge branch 'main' into ea/mpt_sdpa

2f960af

eaidova force-pushed the ea/mpt_sdpa branch from c121390 to 2f960af Compare April 25, 2024 16:02

Merge branch 'main' into ea/mpt_sdpa

f75d18d

echarlaix approved these changes Apr 30, 2024

View reviewed changes

tests/openvino/utils_tests.py Outdated Show resolved Hide resolved

Update tests/openvino/utils_tests.py

9bde686

echarlaix merged commit e1b6a59 into huggingface:main Apr 30, 2024
11 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

apply sdpa for mpt and internlm #676

apply sdpa for mpt and internlm #676

eaidova commented Apr 22, 2024 •

edited

Loading

HuggingFaceDocBuilderDev commented Apr 22, 2024

eaidova commented Apr 23, 2024

AlexKoff88 commented Apr 24, 2024

eaidova commented Apr 24, 2024 •

edited

Loading

apply sdpa for mpt and internlm #676

apply sdpa for mpt and internlm #676

Conversation

eaidova commented Apr 22, 2024 • edited Loading

What does this PR do?

Before submitting

HuggingFaceDocBuilderDev commented Apr 22, 2024

eaidova commented Apr 23, 2024

AlexKoff88 commented Apr 24, 2024

eaidova commented Apr 24, 2024 • edited Loading

eaidova commented Apr 22, 2024 •

edited

Loading

eaidova commented Apr 24, 2024 •

edited

Loading