Make the buffer condition more precise #8907

TomeHirata · 2025-10-06T14:29:47Z

Currently, StreamListener always buffers tokens up to 10 chunks to find the end token. This behavior has an issue that 1) it causes unnecessary delay in token streaming and 2) may change the chunk order for native response chunks. This PR updates the buffer condition to buffer chunks only when it is possible to form the end boilerplate.

Copilot

Pull Request Overview

This PR optimizes the StreamListener buffering logic to reduce unnecessary delays in token streaming by implementing more precise buffer conditions. Instead of always buffering up to 10 chunks, the system now intelligently determines when buffering is needed based on whether the current content could potentially form an end identifier pattern.

Adds _could_form_end_identifier() method to detect when buffering is necessary
Updates buffering logic to yield tokens immediately when they cannot form end patterns
Introduces adapter-specific pattern configuration for precise end identifier detection

Reviewed Changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 2 comments.

File	Description
dspy/streaming/streaming_listener.py	Implements smart buffering logic with new pattern matching capabilities and updates receive method
tests/streaming/test_streaming.py	Adds comprehensive test coverage for the new `_could_form_end_identifier` method across all adapter types

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

dspy/streaming/streaming_listener.py

kenctrl · 2025-10-06T16:54:03Z

LGTM, as long as the method matches all potential cases. Thanks Tomu!

chenmoneygithub

Looks great!

dspy/streaming/streaming_listener.py

chenmoneygithub · 2025-10-06T20:43:17Z

dspy/streaming/streaming_listener.py

+            elif not self._could_form_end_identifier(concat_message, adapter_name):
+                # Buffer cannot form end identifier, safe to yield the oldest token
+                # Keep at least 1 token in buffer in case next token creates end pattern
+                if self.field_end_queue.qsize() > 1:


shall we just call flush() here? this won't affect the use case we are tackling, but technically a direct flush() can fit here.

tests/streaming/test_streaming.py

Make the buffer condition more precise

3a26a2a

TomeHirata requested a review from Copilot October 6, 2025 14:29

Copilot AI reviewed Oct 6, 2025

View reviewed changes

dspy/streaming/streaming_listener.py Outdated Show resolved Hide resolved

dspy/streaming/streaming_listener.py Outdated Show resolved Hide resolved

test

b07ce49

TomeHirata requested a review from chenmoneygithub October 6, 2025 14:39

chenmoneygithub reviewed Oct 6, 2025

View reviewed changes

comments

53f9386

chenmoneygithub approved these changes Oct 6, 2025

View reviewed changes

TomeHirata added 3 commits October 7, 2025 08:39

test

6d10913

lm test

d7a27ac

lm test

7dd56fb

TomeHirata merged commit 6224eb3 into stanfordnlp:main Oct 7, 2025
10 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Make the buffer condition more precise #8907

Make the buffer condition more precise #8907

Uh oh!

TomeHirata commented Oct 6, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

kenctrl commented Oct 6, 2025

Uh oh!

chenmoneygithub left a comment

Uh oh!

Uh oh!

chenmoneygithub Oct 6, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Make the buffer condition more precise #8907

Make the buffer condition more precise #8907

Uh oh!

Conversation

TomeHirata commented Oct 6, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

kenctrl commented Oct 6, 2025

Uh oh!

chenmoneygithub left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

chenmoneygithub Oct 6, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants