Add install and build instructions, refactor docs structure #22

yatarkan · 2024-06-07T12:01:43Z

No description provided.

Wovchena

Looking good so far!

yatarkan · 2024-06-07T15:12:52Z

src/README.md


 ## Supported models

-1. chatglm
-   1. https://huggingface.co/THUDM/chatglm2-6b - refer to
-   [chatglm2-6b - AttributeError: can't set attribute](../../../llm_bench/python/doc/NOTES.md#chatglm2-6b---attributeerror-cant-set-attribute)


Not sure if we need to keep troubleshooting hint links in the "Supported Models" table.
But if we need such links (now there are only for chatglm2-6b and Qwen-7B-Chat-Int4 models) - I would suggest to place them in the end of SUPPORTED_MODELS.md file in "Troubleshooting" section.

No need to have them. These are for llm_bench

Wovchena · 2024-06-07T15:26:59Z

README.md

@@ -1,6 +1,17 @@
-## GenAI Pipeline Repository
+# OpenVINO™ GenAI


Out of curiosity, is TM required? The benefit of not having that is that it's potentially easier to scan the repo for non-ASCII chars. But we don't have it for now, so no need to change.

Wovchena · 2024-06-07T15:29:06Z

src/docs/BUILD.md

+    git clone https://github.com/openvinotoolkit/openvino.genai.git
+    cd openvino.genai
+    git submodule update --init --recursive


Suggested change

git clone https://github.com/openvinotoolkit/openvino.genai.git

cd openvino.genai

git submodule update --init --recursive

git clone --recursive https://github.com/openvinotoolkit/openvino.genai.git

cd openvino.genai

Wovchena · 2024-06-07T15:32:57Z

src/docs/BUILD.md

+    cmake -DCMAKE_BUILD_TYPE=Release -S ./ -B ./build/
+    cmake --build ./build/ --config Release --target package -j
+    cmake --install ./build/ --config Release --prefix ov
+    ./ov/samples/cpp/build_samples.sh -i ./s\ pace


s pace is used in tests to verify our cmake vars handle it correctly. You shouldn't encourage having spaces.

I'm also not sure whether installation is part of the build process. Samples can be run from the build folder. They are installed to ov in tests to verify, that it will work from the package.

Wovchena · 2024-06-07T15:35:35Z

src/docs/INSTALL_ARCHIVE.md

+    ```sh
+    curl -L https://storage.openvinotoolkit.org/repositories/openvino_genai/packages/2024.1/linux/l_openvino_genai_toolkit_ubuntu22_2024.1.0.15008.f4afc983258_x86_64.tgz --output openvino_genai_2024.1.0.tgz
+    tar -xf openvino_genai_2024.1.0.tgz
+    sudo mv l_openvino_genai_toolkit_ubuntu24_2024.1.0.15008.f4afc983258_x86_64 /opt/intel/openvino_genai_2024.1.0


Please, replace 2024.1.0 with 2024.2.0 everywhere. I'll be published for 2024.2.0.

Wovchena · 2024-06-07T15:37:22Z

src/docs/INSTALL_ARCHIVE.md

+    sudo mv l_openvino_genai_toolkit_ubuntu22_2024.1.0.15008.f4afc983258_x86_64 /opt/intel/openvino_genai_2024.1.0
+    ```
+
+For other operating systems, please refer to the guides in documentation:


Since you cover ubuntu20 and 22 separately, you should add TODO for ubuntu24, which is going to be supported starting with 2024.2

Added placeholder command for ubuntu24 and TODO for updating links to archives.

Basically install commands for different ubuntu versions are quite similar, only archive links differ. Does it make sense to keep only one (generic) ubuntu instructions and provide links to docs for other OS?

Yes, I think one is better

LLM return logits with probabilities of each token, these probabilities can be converted to tokens/words with different technics: greedy decoding, beam search decoding, random sampling, etc. This requires writing user unfriendly post-processing even for the simplest scenario of greedy decoding. In order to make live easier we we combined all decoding scenarios into a single function call, where the decoding method and parameters are specified by arguments. In this PR we provide a user friendly API for text generation inspired by `generate` method from HuggingFace transformers library. - [x] enable calling tokenizers/detokenizers from LLMPipeline - [ ] add callback for streaming mode - done partially, need to improve - [x] rewritten samples with the current approach: [causal_lm/cpp/generate_pipeline/generate_sample.cpp#L73-L83](https://github.com/pavel-esir/openvino.genai/blob/generate_pipeline/text_generation/causal_lm/cpp/generate_pipeline/generate_sample.cpp#L73-L83) - [x] Multibatch greedy decoding - [ ] Speculative decoding - [ ] Grouped Beam Search decoding: ready for batch 1, need to rebase multibatch support after merging openvinotoolkit#349 - [x] Random sampling Example 1: Greedy search generation ``` LLMPipeline pipe(model_path, device); // Will try to load config from generation_config.json. // but if not found default velues for gready search will be used GenerationConfig config = pipe.generation_config(); cout << pipe(prompt, config.max_new_tokens(20)); ``` Example 2: TextStreaming mode ``` LLMPipeline pipe(model_path, device); GenerationConfig config = pipe.generation_config(); auto text_streamer = TextStreamer{pipe}; auto text_streamer_callback = [&text_streamer](std::vector<int64_t>&& tokens, LLMPipeline& pipe){ text_streamer.put(tokens[0]); }; pipe(prompt, config.max_new_tokens(20).set_callback(text_streamer_callback)); text_streamer.end(); ``` CVS-132907 CVS-137920 --------- Co-authored-by: Wovchena <vladimir.zlobin@intel.com> Co-authored-by: Ilya Lavrenov <ilya.lavrenov@intel.com> Co-authored-by: Alexander Suvorov <alexander.suvorov@intel.com> Co-authored-by: Yaroslav Tarkan <yaroslav.tarkan@intel.com> Co-authored-by: Xiake Sun <xiake.sun@intel.com> Co-authored-by: wenyi5608 <93560477+wenyi5608@users.noreply.github.com> Co-authored-by: Ekaterina Aidova <ekaterina.aidova@intel.com> Co-authored-by: guozhong wang <guozhong.wang@intel.com> Co-authored-by: Chen Peter <peter.chen@intel.com>

Wovchena

Retarget to https://github.com/openvinotoolkit/openvino.genai/tree/releases/2024/2

yatarkan · 2024-06-10T13:28:31Z

Closing as branch is retargeted

Previous PR for comments reference: pavel-esir#22 --------- Co-authored-by: Tatiana Savina <tatiana.savina@intel.com> Co-authored-by: Zlobin Vladimir <vladimir.zlobin@intel.com>

Wovchena reviewed Jun 7, 2024

View reviewed changes

yatarkan commented Jun 7, 2024

View reviewed changes

yatarkan changed the title ~~Add initial files with instructions~~ Add install and build instructions, refactor docs structure Jun 7, 2024

yatarkan requested a review from Wovchena June 7, 2024 15:21

Wovchena suggested changes Jun 7, 2024

View reviewed changes

pavel-esir and others added 2 commits June 10, 2024 11:59

remove optimum-intel

26c3c40

Wovchena suggested changes Jun 10, 2024

View reviewed changes

yatarkan added 7 commits June 10, 2024 17:23

Add initial files with instructions

ef4a2e5

Move md files with instructions to docs dir

6552010

Update root and src readme files

3707686

Add rest of supported models to table with architecture

49df483

Remove old list of supported models from src readme

c8ef9bc

Use git clone recursive command

85314ea

Add ubuntu24 command placeholder

4dae776

yatarkan force-pushed the yt/build-instructions branch from 99504da to 4dae776 Compare June 10, 2024 13:26

yatarkan mentioned this pull request Jun 10, 2024

Add install and build instructions, refactor docs structure openvinotoolkit/openvino.genai#485

Merged

yatarkan closed this Jun 10, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add install and build instructions, refactor docs structure #22

Add install and build instructions, refactor docs structure #22

yatarkan commented Jun 7, 2024

Wovchena left a comment

yatarkan Jun 7, 2024

Wovchena Jun 7, 2024

Wovchena Jun 7, 2024

Wovchena Jun 7, 2024

yatarkan Jun 10, 2024

Wovchena Jun 7, 2024

Wovchena Jun 7, 2024

yatarkan Jun 10, 2024

Wovchena Jun 7, 2024

yatarkan Jun 10, 2024

yatarkan Jun 10, 2024

Wovchena Jun 10, 2024

Wovchena left a comment

yatarkan commented Jun 10, 2024

		@@ -1,6 +1,17 @@
		## GenAI Pipeline Repository
		# OpenVINO™ GenAI

Add install and build instructions, refactor docs structure #22

Add install and build instructions, refactor docs structure #22

Conversation

yatarkan commented Jun 7, 2024

Wovchena left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Wovchena left a comment

Choose a reason for hiding this comment

yatarkan commented Jun 10, 2024