Add links

atobiszei · atobiszei · commit 4b30914e020c · 2025-09-04T14:28:03.000+02:00
diff --git a/demos/README.md b/demos/README.md
@@ -42,6 +42,7 @@ ovms_demo_bert
 ovms_demo_universal-sentence-encoder
 ovms_demo_benchmark_client
 ovms_string_output_model_demo
+ovms_demos_gguf
 
 ```
 
@@ -61,6 +62,7 @@ OpenVINO Model Server demos have been created to showcase the usage of the model
 |[Long context LLMs](./continuous_batching/long_context/README.md)| Recommendations for handling very long context in LLM models|
 |[Visual Studio Code assistant](./code_local_assistant/README.md)|Use Continue extension to Visual Studio Code with local OVMS serving|
 |[Image Generation](image_generation/README.md)|Generate images|
+|[GGUF models support](gguf/README.md)|Serve GGUF models with OVMS|
 
 
 Check out the list below to see complete step-by-step examples of using OpenVINO Model Server with real world use cases:
diff --git a/demos/gguf/README.md b/demos/gguf/README.md
@@ -2,7 +2,7 @@
 
 This demo shows how to deploy  model with the OpenVINO Model Server.
 
-*Note*: This is experimental feature and issues in accuracy of models may be observed.
+> **NOTE**: This is experimental feature and issues in accuracy of models may be observed.
 
 > **NOTE:** Model downloading feature is described in depth in separate documentation page: [Pulling HuggingFaces Models](../../docs/pull_hf_models.md).
 
@@ -17,12 +17,13 @@ Start with deploying the model:
 :sync: docker
 Start docker container:
 ```bash
+mkdir models
 docker run -d --rm --user $(id -u):$(id -g) -p 8000:8000 -v $(pwd)/models:/models/:rw \
   -e http_proxy=$http_proxy -e https_proxy=$https_proxy -e no_proxy=$no_proxy \
   openvino/model_server:latest \
     --rest_port 8000 \
     --model_repository_path /models/ \
-    --task image_generation \
+    --task text_generation \
     --source_model "Qwen/Qwen2.5-3B-Instruct-GGUF" \
     --gguf_filename qwen2.5-3b-instruct-q4_k_m.gguf \
     --model_name LLM
@@ -33,10 +34,9 @@ docker run -d --rm --user $(id -u):$(id -g) -p 8000:8000 -v $(pwd)/models:/model
 :sync: bare-metal
 ```bat
 mkdir models
-
 ovms --rest_port 8000 ^
   --model_repository_path /models/ ^
-  --task image_generation ^
+  --task text_generation ^
   --source_model "Qwen/Qwen2.5-3B-Instruct-GGUF" ^
   --gguf_filename qwen2.5-3b-instruct-q4_k_m.gguf ^
   --model_name LLM