Implemented frontend docs #3791

simveit · 2025-02-22T21:25:28Z

Motivation

Rewrote frontend docs as jupyter notebook.

shuaills

Great job

shuaills · 2025-02-22T21:47:36Z

docs/frontend/frontend.ipynb

+    "from sglang.test.test_utils import is_in_ci\n",
+    "\n",
+    "if is_in_ci():\n",
+    "    from patch import launch_server_cmd\n",


CI failed, patch.py is not in proper place.

yes i noticed. I will adjusted that and fix it tomorrow in better way.

Actually I am not sure if there is an easy way without using python project or sys to make patch.py accessible in all the subfolders (both of which IMO would be overly complex and make the notebooks less readable...)
Maybe @zhaochenyang20 has a better idea.
Also the CI still failed, it seemed the !wget https://github.com/sgl-project/sglang/blob/main/test/lang/example_image.png?raw=true -O example_image.png i use didn't work because after the image got not recognized. Locally this worked of course. Any ideas about why that is?

I agree with the current design of the patch. I mean including patch.py in sub-directories. I think for the image, you can refer to this https://docs.sglang.ai/backend/openai_api_vision.html

ok i fixed the issue with wget.

import requests def download_image(url, filename): response = requests.get(url) if response.status_code == 200: with open(filename, "wb") as f: f.write(response.content) print(f"Successfully downloaded {filename}") else: print(f"Failed to download image: Status code {response.status_code}") image_url = "https://github.com/sgl-project/sglang/blob/main/test/lang/example_image.png?raw=true" download_image(image_url, "example_image.png")

this code solved the problem

…o feature/frontend-docs

simveit · 2025-02-23T12:34:20Z

Before we merge this I will update the engine API now. I will put this also into this PR.

…d vlm in engine doc, refactored frontend.

simveit · 2025-02-23T16:13:17Z

@shuaills @zhaochenyang20
I implemented vlm for engine doc, fixed mistake in example for engine with vlm (the script wants to access some tokenizer_manager attribute of engine which doesnt exist) and moved hidden_states extraction to examples.
For some reason the CI failed for the openai completions notebook which i didnt modify.
Shuai could you take a look at this please, i am not on computer now.

zhaochenyang20 · 2025-02-23T23:38:21Z

@simveit I rerun it and let's see what will happen. Are you agree with the docs? @shuaills I can merge it

zhaochenyang20 · 2025-02-24T09:56:15Z

@simveit @shuaills I will wait approval from Shuai. But LGTM

shuaills · 2025-02-24T14:11:35Z

docs/backend/offline_engine_api.ipynb

+    "import os\n",
+    "\n",
+    "os.environ[\"CUDA_VISIBLE_DEVICES\"] = \"1\""


Remove this

sure, sorry this was for my local dev setup.

shuaills · 2025-02-24T14:13:50Z

docs/backend/offline_engine_api.ipynb

+    "image_token = conv.image_token\n",
+    "\n",
+    "# Convert image to bytes\n",
+    "image = Image.open(\"example_image.png\")\n",


Can we use url here? So we don't need to manage file system.

i will adjust that to use the load_image function from utils.py (see below)

shuaills · 2025-02-24T14:16:33Z

examples/runtime/engine/offline_batch_inference_vlm.py

+            f.write(response.content)
+        print(f"Successfully downloaded {filename}")
+    else:
+        print(f"Failed to download image: Status code {response.status_code}")


Download to file system may introduce some problems, can we use url instead?

I don't believe that will make make a large difference. Under the hood something like

def load_image(image_file: Union[str, bytes]): from PIL import Image image = image_size = None if isinstance(image_file, bytes): image = Image.open(BytesIO(image_file)) elif image_file.startswith("http://") or image_file.startswith("https://"): timeout = int(os.getenv("REQUEST_TIMEOUT", "3")) response = requests.get(image_file, timeout=timeout) image = Image.open(BytesIO(response.content)) elif image_file.lower().endswith(("png", "jpg", "jpeg", "webp", "gif")): image = Image.open(image_file) elif image_file.startswith("data:"): image_file = image_file.split(",")[1] image = Image.open(BytesIO(base64.b64decode(image_file))) elif image_file.startswith("video:"): image_file = image_file.replace("video:", "") image, image_size = decode_video_base64(image_file) elif isinstance(image_file, str): image = Image.open(BytesIO(base64.b64decode(image_file))) else: raise ValueError(f"Invalid image: {image}") return image, image_size

will be called under the hood if we provide an url.
A syntax you use in this notebook was tried to use before for the example and it didn't run. (threw error).

But maybe we can use this function from above instead of custom code. this should be more elegant. i will adjust the code to that.

this will have especially benefit we don't save anything as a file and I agree that this is little bit undesirable.

The code mostly operates in memory, except when handling local file paths. But if we do open(filename, "wb") , it will accesse the file system.

yes, i will adjust to use to utils function from sglang code base from above

I fixed this @shuaills @zhaochenyang20 . The current implemetation is much easier and cleaner.

…o feature/frontend-docs

simveit · 2025-02-24T20:47:30Z

@zhaochenyang20 @shuaills i moved also the send request to the start and renamed section to "Getting started". Let me know if that is not what you intendend

zhaochenyang20 · 2025-02-25T16:41:09Z

Sure. let shuai review this. And I will merge it after his approval.

zhaochenyang20 · 2025-02-25T16:41:14Z

@shuaills Thanks!

zhaochenyang20 · 2025-02-26T18:36:15Z

To me, i will remove VLM Inference and the get hidden state in these docs. But I will tell the users at the place I point to, "You can refer to these examples for VLM offline inference and getting hidden states."

For the getting hidden state examples, please make one, just following https://docs.sglang.ai/backend/offline_engine_api.html#Return-Hidden-States

@simveit

simveit · 2025-02-26T18:38:21Z

To me, i will remove `VLM Inference` and the get hidden state in these docs. But I will tell the users at the place I point to, "You can refer to these examples for [VLM offline inference](https://github.com/sgl-project/sglang/blob/main/examples/runtime/engine/offline_batch_inference_vlm.py) and [getting hidden states](xxx)."
For the getting hidden state examples, please make one, just following https://docs.sglang.ai/backend/offline_engine_api.html#Return-Hidden-States

@simveit

I both included hidden states example and also fixed the vlm example for engine in this PR. let me remove the example from notebook later and refer to the links

zhaochenyang20 · 2025-02-26T18:38:15Z

docs/index.rst


   start/install.md
+   start/send_request.ipynb


if you move send_reqeusts here, the patch would fail and send_request.ipynb will take large VRAM. But I think you can add one patch.py here.

i think we already have patch.py in start dir for this PR (see changed files)

zhaochenyang20 · 2025-02-26T18:53:29Z

examples/runtime/engine/hidden_states.py

@@ -0,0 +1,30 @@
+import sglang as sgl


add demonstration here. And saying that we are working on moving get_hidden_state to a sampling parameter rather than a server argument.

…o feature/frontend-docs

simveit added 2 commits February 22, 2025 21:20

Implemented frontend docs

4fb4932

Adjust structure and removed markdown

d35fd60

simveit mentioned this pull request Feb 22, 2025

[Feature] Reorganize all the docs #3596

Open

2 tasks

shuaills self-requested a review February 22, 2025 21:30

shuaills suggested changes Feb 22, 2025

View reviewed changes

Duplicate patch to frontend folder to make CI run. Fix that later.

5b908e6

simveit force-pushed the feature/frontend-docs branch from 1b63fd2 to 5b908e6 Compare February 22, 2025 21:50

zhaochenyang20 and others added 4 commits February 22, 2025 17:19

Merge branch 'main' into feature/frontend-docs

b442bc9

Try to fix problem with wget

b028232

Merge branch 'main' into feature/frontend-docs

956c35f

Merge branch 'feature/frontend-docs' of github.com:simveit/sglang int…

f08eda3

…o feature/frontend-docs

Moved hidden states to examples, fixed vlm engine example, implemente…

cb05784

…d vlm in engine doc, refactored frontend.

Merge branch 'main' into feature/frontend-docs

3da982f

shuaills suggested changes Feb 24, 2025

View reviewed changes

simveit added 3 commits February 24, 2025 21:20

Improved image handling

4b03cde

Merge branch 'feature/frontend-docs' of github.com:simveit/sglang int…

319aa8c

…o feature/frontend-docs

Moved sending request to getting started

8b08d46

Merge branch 'main' into feature/frontend-docs

b7ca041

zhaochenyang20 requested changes Feb 26, 2025

View reviewed changes

Doc for hidden states, pointing to examples instead of including them

083f0c9

simveit added 2 commits February 26, 2025 20:40

fixup

c2ed910

small fix

afe2dd0

simveit force-pushed the feature/frontend-docs branch from 5fc29aa to afe2dd0 Compare February 26, 2025 20:02

zhaochenyang20 marked this pull request as ready for review February 26, 2025 20:09

zhaochenyang20 and others added 5 commits February 26, 2025 12:09

Merge branch 'main' into feature/frontend-docs

e749e01

move back to backend

48e4f2e

Merge branch 'feature/frontend-docs' of github.com:simveit/sglang int…

e0bc971

…o feature/frontend-docs

fix doctree

3d1dc96

fix doctree

a5ef802

zhaochenyang20 approved these changes Feb 26, 2025

View reviewed changes

Merge branch 'main' into feature/frontend-docs

c5ee617

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implemented frontend docs #3791

Implemented frontend docs #3791

simveit commented Feb 22, 2025

shuaills left a comment

shuaills Feb 22, 2025

simveit Feb 22, 2025

simveit Feb 22, 2025 •

edited

Loading

zhaochenyang20 Feb 23, 2025

simveit Feb 23, 2025 •

edited

Loading

simveit commented Feb 23, 2025

simveit commented Feb 23, 2025

zhaochenyang20 commented Feb 23, 2025

zhaochenyang20 commented Feb 24, 2025

shuaills Feb 24, 2025

simveit Feb 24, 2025

shuaills Feb 24, 2025

simveit Feb 24, 2025

shuaills Feb 24, 2025

simveit Feb 24, 2025

simveit Feb 24, 2025 •

edited

Loading

simveit Feb 24, 2025

shuaills Feb 24, 2025

simveit Feb 24, 2025

simveit Feb 24, 2025

simveit commented Feb 24, 2025

zhaochenyang20 commented Feb 25, 2025

zhaochenyang20 commented Feb 25, 2025

zhaochenyang20 commented Feb 26, 2025

simveit commented Feb 26, 2025

zhaochenyang20 Feb 26, 2025

simveit Feb 26, 2025

zhaochenyang20 Feb 26, 2025

Implemented frontend docs #3791

Are you sure you want to change the base?

Implemented frontend docs #3791

Conversation

simveit commented Feb 22, 2025

Motivation

shuaills left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

simveit Feb 22, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

simveit Feb 23, 2025 • edited Loading

Choose a reason for hiding this comment

simveit commented Feb 23, 2025

simveit commented Feb 23, 2025

zhaochenyang20 commented Feb 23, 2025

zhaochenyang20 commented Feb 24, 2025

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

simveit Feb 24, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

simveit commented Feb 24, 2025

zhaochenyang20 commented Feb 25, 2025

zhaochenyang20 commented Feb 25, 2025

zhaochenyang20 commented Feb 26, 2025

simveit commented Feb 26, 2025

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

simveit Feb 22, 2025 •

edited

Loading

simveit Feb 23, 2025 •

edited

Loading

simveit Feb 24, 2025 •

edited

Loading