Reasoning parser #3859

ShaoZhang0115 · 2025-02-25T18:27:06Z

Motivation

Rewrite #3202

Modifications

add --enable-reasoning and --reasoning-parser options for deepseek r1 series models.
return reasoning_content as in official api, ref: https://api-docs.deepseek.com/zh-cn/guides/reasoning_model, in both streaming and non-streaming chat completions.
Example:

python -m sglang.launch_server --host 0.0.0.0 \
--model-path deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B \
--tp 1 --enable-reasoning --reasoning-parser deepseek-r1

curl --location --request POST 'http: //localhost:30000/v1/chat/completions' \
--header 'Content-Type: application/json' \
--header 'Authorization: Bearer YOUR_API_KEY' \
--data '{
    "model": "default",
    "messages": [
        {
            "role": "user",
            "content": "Calculate 1 + 3"
        }
    ],
    "stream": false
}'

Get response:

{
    "id": "53de20f7f1244195826e7b52011c37a4",
    "object": "chat.completion",
    "created": 1740507802,
    "model": "default",
    "choices": [
        {
            "index": 0,
            "message": {
                "role": "assistant",
                "content": "\n\n**Solution:**\n\nTo calculate \\(1 + 3\\), follow these easy steps:\n\n1. **Identify the numbers to add:**  \n   You have the number **1** and the number **3**.\n\n2. **Add the numbers together:**  \n   \\[\n   1 + 3 = 4\n   \\]\n\n3. **Final Answer:**  \n   \\[\n   \\boxed{4}\n   \\]",
                "reasoning_content": "To calculate the sum of 1 and 3, I will begin by identifying the two numbers involved in the addition. The first number is 1, and the second number is 3.\n\nNext, I will add these two numbers together. Adding 1 and 3 gives me a total of 4.\n\nTherefore, the result of 1 plus 3 is 4.\n",
                "tool_calls": null
            },
            "logprobs": null,
            "finish_reason": "stop",
            "matched_stop": 151643
        }
    ],
    "usage": {
        "prompt_tokens": 11,
        "total_tokens": 179,
        "completion_tokens": 168,
        "prompt_tokens_details": null
    }
}

Docs with be updated as soon as possible.

Checklist

Format your code according to the Code Formatting with Pre-Commit.
Add unit tests as outlined in the Running Unit Tests.
Update documentation / docstrings / example tutorials as needed, according to Writing Documentation.
Provide throughput / latency benchmark results and accuracy evaluation results as needed, according to Benchmark and Profiling and Accuracy Results.
For reviewers: If you haven't made any contributions to this PR and are only assisting with merging the main branch, please remove yourself as a co-author when merging the PR.
Please feel free to join our Slack channel at https://slack.sglang.ai to discuss your PR.

…ions

shuaills · 2025-02-25T18:59:10Z

python/sglang/srt/reasoning_parser.py

+        self.think_start_token = "<think>"
+        self.think_end_token = "</think>"


Can we extend this to all reasoning models? Not just dpsk R1. There might be different thinking tokens.

I think different reasoning models need different parers, and I add docs for it.

…asoning-parser

xihuai18 · 2025-02-26T07:12:30Z

Add Docs
Test with streaming and non-streaming cases, with truncated or non-truncated max-tokens for reasoning.

xihuai18 · 2025-02-26T07:13:27Z

However, I can not pass my tests with --enable-torch-compile, which is confusing.

…g logic

…m:xihuai18/sglang into reasoning-parser

xihuai18 added 6 commits February 25, 2025 22:44

[Feature] Add reasoning parser support to chat generation and complet…

3c4b823

…ions

fix: add choices for reasoning-parser

1727ba4

Fix reasoning_parser.py

4759bed

fix: handle possible "<think>\n" in reasoning parser

31e4dd5

fix: update help text for reasoning parser to include supported models

d19b7a0

fix: update reasoning parser to handle changes in DeepSeek output format

addaeb5

ShaoZhang0115 requested review from merrymercy, Ying1123, hnyls2002, zhyncs, ispobock and ByronHsu as code owners February 25, 2025 18:27

Merge branch 'main' into reasoning-parser

b11076b

shuaills suggested changes Feb 25, 2025

View reviewed changes

xihuai18 added 4 commits February 26, 2025 11:44

fix: refine reasoning parser output handling and clean up response logic

f362249

Merge branch 'main' into reasoning-parser

0b0b087

docs: add reasoning parser documentation for DeepSeek model support

2ed2b6d

Merge branch 'reasoning-parser' of github.com:xihuai18/sglang into re…

3c7f1a7

…asoning-parser

xihuai18 added 7 commits February 26, 2025 19:57

Merge branch 'main' into reasoning-parser

94f1db5

refactor: enhance DeepSeekR1ReasoningParser initialization and parsin…

6cedd5f

…g logic

Merge branches 'reasoning-parser' and 'reasoning-parser' of github.co…

30b938e

…m:xihuai18/sglang into reasoning-parser

refactor: remove NotImplementedError from BaseReasoningParser class

857297c

refactor: simplify BaseReasoningParser initialization and parsing logic

9b209ba

refactor: streamline buffer handling in BaseReasoningParser

7c61f89

refactor: improve text handling in v1_chat_generate_response function

d3aa86f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reasoning parser #3859

Reasoning parser #3859

ShaoZhang0115 commented Feb 25, 2025

shuaills Feb 25, 2025

xihuai18 Feb 26, 2025

xihuai18 commented Feb 26, 2025

xihuai18 commented Feb 26, 2025

		self.think_start_token = "<think>"
		self.think_end_token = "</think>"

Reasoning parser #3859

Are you sure you want to change the base?

Reasoning parser #3859

Conversation

ShaoZhang0115 commented Feb 25, 2025

Motivation

Modifications

Checklist

shuaills Feb 25, 2025

Choose a reason for hiding this comment

xihuai18 Feb 26, 2025

Choose a reason for hiding this comment

xihuai18 commented Feb 26, 2025

xihuai18 commented Feb 26, 2025