Skip to content

Commit 4b30914

Browse files
committed
Add links
1 parent 7c7efc8 commit 4b30914

File tree

2 files changed

+6
-4
lines changed

2 files changed

+6
-4
lines changed

demos/README.md

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -42,6 +42,7 @@ ovms_demo_bert
4242
ovms_demo_universal-sentence-encoder
4343
ovms_demo_benchmark_client
4444
ovms_string_output_model_demo
45+
ovms_demos_gguf
4546
4647
```
4748

@@ -61,6 +62,7 @@ OpenVINO Model Server demos have been created to showcase the usage of the model
6162
|[Long context LLMs](./continuous_batching/long_context/README.md)| Recommendations for handling very long context in LLM models|
6263
|[Visual Studio Code assistant](./code_local_assistant/README.md)|Use Continue extension to Visual Studio Code with local OVMS serving|
6364
|[Image Generation](image_generation/README.md)|Generate images|
65+
|[GGUF models support](gguf/README.md)|Serve GGUF models with OVMS|
6466

6567

6668
Check out the list below to see complete step-by-step examples of using OpenVINO Model Server with real world use cases:

demos/pull_gguf_model.md renamed to demos/gguf/README.md

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -2,7 +2,7 @@
22

33
This demo shows how to deploy model with the OpenVINO Model Server.
44

5-
*Note*: This is experimental feature and issues in accuracy of models may be observed.
5+
> **NOTE**: This is experimental feature and issues in accuracy of models may be observed.
66
77
> **NOTE:** Model downloading feature is described in depth in separate documentation page: [Pulling HuggingFaces Models](../../docs/pull_hf_models.md).
88
@@ -17,12 +17,13 @@ Start with deploying the model:
1717
:sync: docker
1818
Start docker container:
1919
```bash
20+
mkdir models
2021
docker run -d --rm --user $(id -u):$(id -g) -p 8000:8000 -v $(pwd)/models:/models/:rw \
2122
-e http_proxy=$http_proxy -e https_proxy=$https_proxy -e no_proxy=$no_proxy \
2223
openvino/model_server:latest \
2324
--rest_port 8000 \
2425
--model_repository_path /models/ \
25-
--task image_generation \
26+
--task text_generation \
2627
--source_model "Qwen/Qwen2.5-3B-Instruct-GGUF" \
2728
--gguf_filename qwen2.5-3b-instruct-q4_k_m.gguf \
2829
--model_name LLM
@@ -33,10 +34,9 @@ docker run -d --rm --user $(id -u):$(id -g) -p 8000:8000 -v $(pwd)/models:/model
3334
:sync: bare-metal
3435
```bat
3536
mkdir models
36-
3737
ovms --rest_port 8000 ^
3838
--model_repository_path /models/ ^
39-
--task image_generation ^
39+
--task text_generation ^
4040
--source_model "Qwen/Qwen2.5-3B-Instruct-GGUF" ^
4141
--gguf_filename qwen2.5-3b-instruct-q4_k_m.gguf ^
4242
--model_name LLM

0 commit comments

Comments
 (0)