Fix chunk order issue in native field streaming #8892

TomeHirata · 2025-10-03T00:57:55Z

The current streaming listener implementation has an issue that chunk order is modified (by at most 10 chunks) when a native response chunk is included in the string streaming due to the buffering logic for the string output field. This PR fixes the issue by flushing the chunks once when the native response chunk is received.

Copilot

Pull Request Overview

This PR fixes a chunk order issue in native field streaming where chunks could be reordered by up to 10 positions when native response chunks are included in string streaming due to buffering logic.

Modifies streaming listener to flush buffer when empty chunks with native fields are received
Reorders listeners to process active buffering listeners first, ensuring correct chunk order
Adds test coverage to verify the fix works correctly

Reviewed Changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 2 comments.

File	Description
tests/streaming/test_streaming.py	Adds test assertions to verify chunk order is maintained correctly
dspy/streaming/streaming_listener.py	Implements buffer flushing for empty chunks and refactors token extraction
dspy/streaming/streamify.py	Reorders listeners to prioritize active buffering listeners

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

dspy/streaming/streaming_listener.py

dspy/streaming/streamify.py

chenmoneygithub · 2025-10-03T01:37:52Z

dspy/streaming/streaming_listener.py

+        # If we receive an empty chunk but streaming has started, flush the buffer.
+        # LiteLLM does not send completely empty chunks (https://github.com/BerriAI/litellm/blob/main/litellm/litellm_core_utils/model_response_utils.py#L10),
+        # so empty content means it has other native fields such as provider_specific_fields.
+        if not chunk_message:


is this correct? From the log shared internally, the LM can produce citation chunks in between of field chunks, then it could lead to unintended flush?

Flush is for finding the end token, right? It is true that the native chunk is passed between field chunks, when the citation chunk is passed, the end token (like [[ ##) shouldn't be in the queue. So it's a safe time to flush the tokens for the ongoing string field.

I am not entirely sure about the LM streaming order, like how it tangles normal text with native features/events, but if it looks like:

text: what

text: the

citation:

text: jesus?

text: [[ ##

citation:

text: completed ## ]]

Then at step 6, [[ ## will be yielded by flush call.

In that example, does it mean completed ## ]] is supported by the citation #5? I'm assuming that LM produces the native response and string response in the right order. If it makes mistakes, yeah it won't work well.

Fix chunk order issue in native field streaming

196d859

TomeHirata requested a review from Copilot October 3, 2025 00:57

Copilot AI reviewed Oct 3, 2025

View reviewed changes

dspy/streaming/streaming_listener.py Show resolved Hide resolved

dspy/streaming/streamify.py Show resolved Hide resolved

comment

2f24a77

TomeHirata requested a review from chenmoneygithub October 3, 2025 01:35

chenmoneygithub reviewed Oct 3, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix chunk order issue in native field streaming #8892

Fix chunk order issue in native field streaming #8892

TomeHirata commented Oct 3, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

chenmoneygithub Oct 3, 2025

Uh oh!

TomeHirata Oct 3, 2025 •

edited

Loading

Uh oh!

chenmoneygithub Oct 3, 2025

Uh oh!

TomeHirata Oct 3, 2025

Uh oh!

Uh oh!

Fix chunk order issue in native field streaming #8892

Are you sure you want to change the base?

Fix chunk order issue in native field streaming #8892

Conversation

TomeHirata commented Oct 3, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

chenmoneygithub Oct 3, 2025

Choose a reason for hiding this comment

Uh oh!

TomeHirata Oct 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

chenmoneygithub Oct 3, 2025

Choose a reason for hiding this comment

Uh oh!

TomeHirata Oct 3, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

TomeHirata Oct 3, 2025 •

edited

Loading