Emit <thinking> tag boundaries during streaming by cpsievert · Pull Request #294 · posit-dev/chatlas

cpsievert · 2026-05-06T22:12:39Z

Summary

When ContentThinking chunks are emitted during streaming, the streaming loop now emits <thinking>\n before the first thinking chunk and \n</thinking>\n\n when transitioning to non-thinking content (or at end of stream)
For content="text" consumers, tags are yielded as string chunks — concatenated output is well-formed
For content="all" consumers, behavior is unchanged — typed ContentThinking objects are yielded, no tag strings
ContentThinking now has a _complete PrivateAttr (default True). Streaming chunks are constructed via _as_chunk() with _complete=False, so __str__() returns bare text for fragments instead of wrapping each one in <thinking>...</thinking> tags
Removes the synthetic "\n\n" separator from the OpenAI provider's reasoning_summary_text.done event (now redundant)

Motivation

Currently, streaming thinking content has two issues:

content="all" mode yields ContentThinking objects whose __str__() independently wraps each chunk in <thinking>...</thinking> — printing produces repeated tags around each fragment
content="text" mode yields thinking as bare strings indistinguishable from response text

After this change, concatenating a content="text" stream produces:

<thinking>
reasoning content here...
</thinking>

Response text here...

And calling str() on individual ContentThinking chunks in content="all" mode returns the raw thinking text without tag wrapping.

Test plan

10 unit tests covering sync/async, text/all modes, thinking-only streams, text-only streams, tag chunk boundaries, and str() on chunks
Full existing test suite passes (190 passed, 3 skipped — only bedrock fails due to live API requirement)
Pyright passes with 0 errors on changed files

The streaming loop now emits `<thinking>\n` before the first thinking chunk and `\n</thinking>\n\n` on transition to non-thinking content (or at end of stream), giving consumers well-formed output. For `content="text"` mode, tags are yielded as string chunks so concatenated output is properly delimited. For `content="all"` mode, behavior is unchanged — typed ContentThinking objects are yielded. Also removes the synthetic "\n\n" separator from the OpenAI provider's reasoning_summary_text.done event since the thinking→text transition now provides the visual break. Companion to tidyverse/ellmer#975.

Streaming chunks are fragments, not complete thoughts. Adding a _complete PrivateAttr (default True) lets __str__() skip tag wrapping for chunks emitted during streaming, preventing repeated <thinking>...</thinking> around each fragment in content="all" mode. Providers now use ContentThinking._as_chunk() for streaming fragments.

Copilot

Pull request overview

Copilot reviewed 7 out of 7 changed files in this pull request and generated no new comments.

cpsievert force-pushed the fix/streaming-thinking-tags branch from c748d39 to d0badcb Compare May 6, 2026 22:15

cpsievert requested a review from Copilot May 6, 2026 22:20

docs: add changelog entry for streaming thinking tag boundaries

13b5b5f

Copilot started reviewing on behalf of cpsievert May 6, 2026 22:20 View session

This comment was marked as resolved.

Sign in to view

cpsievert requested a review from Copilot May 6, 2026 22:59

Copilot started reviewing on behalf of cpsievert May 6, 2026 23:00 View session

cpsievert mentioned this pull request May 6, 2026

Simplify thinking: remove server-side complexity and transport type posit-dev/shinychat#210

Closed

6 tasks

Copilot AI reviewed May 6, 2026

View reviewed changes

cpsievert merged commit dbb45f0 into main May 6, 2026
12 checks passed

cpsievert deleted the fix/streaming-thinking-tags branch May 6, 2026 23:08

This was referenced May 7, 2026

fix: yield thinking tag boundaries in content='all' mode #297

Merged

Introduce ContentThinkingDelta for streaming thinking tidyverse/ellmer#975

Open

Replace tag injection with ContentThinkingDelta class #299

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Emit <thinking> tag boundaries during streaming#294

Emit <thinking> tag boundaries during streaming#294
cpsievert merged 3 commits intomainfrom
fix/streaming-thinking-tags

cpsievert commented May 6, 2026 •

edited

Loading

Uh oh!

This comment was marked as resolved.

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

cpsievert commented May 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Motivation

Test plan

Uh oh!

This comment was marked as resolved.

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

cpsievert commented May 6, 2026 •

edited

Loading