Ipex doc #828

jiqing-feng · 2024-07-17T03:08:40Z

Hi @echarlaix . I just noticed that we haven't got any ipex doc on huggingface doc hub.

This is the 1st step that enables the optimmu-cli command to export models.

I will add ipex doc in the next step (in this PR), please review the optimmu-cli command for ipex. Thx!

jiqing-feng · 2024-07-17T08:12:23Z

All ipex docs have been added, please take a review @echarlaix

cc @rbrugaro

docs/source/_toctree.yml

README.md

optimum/commands/register/register_openvino.py

Co-authored-by: Ella Charlaix <80481427+echarlaix@users.noreply.github.com>

docs/source/ipex/export.mdx

Co-authored-by: Ella Charlaix <80481427+echarlaix@users.noreply.github.com>

optimum/commands/export/ipex.py

docs/source/ipex/export.mdx

optimum/commands/export/ipex.py

README.md

Co-authored-by: Ella Charlaix <80481427+echarlaix@users.noreply.github.com>

jiqing-feng · 2024-07-18T01:48:25Z

@echarlaix Please trigger the tests. Thx.

jiqing-feng · 2024-07-22T09:23:39Z

Hi @echarlaix . For the IPEX doc, I have removed export and anything related to jit. It will not confuse users even we change jit.trance to torch.compile because we didn't mention it in the docs, and the user's behaviors will not be changed. We need an ipex doc to show users how to use IPEXModel and don't need to tell users the details, WDYT?

docs/source/ipex/inference.mdx

echarlaix · 2024-07-25T16:47:09Z

docs/source/ipex/inference.mdx

+
+# Inference
+
+Optimum Intel can be used to load models from the [Hub](https://huggingface.co/models) and create pipelines to run inference with IPEX optimization (including patching, weight prepack and graph mode) on a variety of Intel processors (currently only support for CPU)


What's the plan for the models format in the next release ? From my understanding we will stop using TorchScript, I'd prefer to wait for the next release once we have something that won't get deprecated, before adding it to the documentation

Do you mean the Pytorch release? If so, it depends on the models performance under torch.compile. If all models can have an acceptable speed-up under torch.compile, we will remove jit.trace and apply torch.compile; otherwise, we will keep torchscript or convert parts of models to torch.compile, I will discuss with you before I take any actions.

I don't think waiting for the next release is the best option since the torch.compile is not under our control. Currently, torch.compile is not working on all models for all tasks; many performance regressions need to be fixed. We will apply the torch.compile one by one for models. Changing all models to compile without any issues is impossible for now, so it's a long-term project. Besides, we will not change the API in IPEXModel, so it's okay to deliver this ipex doc to users.

…will be changed to compile in the future

docs/source/ipex/inference.mdx

Co-authored-by: Ella Charlaix <80481427+echarlaix@users.noreply.github.com>

jiqing-feng · 2024-08-14T09:08:54Z

Hi @echarlaix . Do you think this PR could be merged since all tests passed?

HuggingFaceDocBuilderDev · 2024-08-26T14:25:03Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

docs/source/index.mdx

docs/Dockerfile

docs/source/ipex/models.mdx

docs/source/ipex/inference.mdx

jiqing-feng added 6 commits July 16, 2024 03:33

change readme, source/index, source/installation

d7b0fc4

add ipex doc 1st step

78f7c61

update readme for command line usage

b531a72

fix bug for ipex readme

90d9000

add export doc

b39be97

update all ipex docs

a90cb23

jiqing-feng added 2 commits July 17, 2024 04:13

rm diffusers

e884158

change register

2100cd9