Fix chat continuation failing on models that reject assistant-prefill by threepointone · Pull Request #1618 · cloudflare/agents

threepointone · 2026-05-29T23:20:19Z

Summary

When a chat turn is interrupted mid-stream (e.g. by a deploy) and recovery
continues it, continueLastTurn replays a transcript whose final message is
the partial assistant message — an "assistant prefill". Modern chat models
reject a request that ends in an assistant message, so the continuation throws
and the turn is left interrupted.

Anthropic Claude 4.6+ (Opus 4.6/4.7, Sonnet 4.6) returns a 400:
This model does not support assistant message prefill. The conversation must end with a user message. (Anthropic's own migration guidance: "move the
continuation text into a user message".)
Kimi 2.6 (Workers AI) tolerates it (responds normally) — so this is
model-specific, and bites the modern Anthropic models the customer runs.

Fix

ensureValidContinueCheckpoint(messages) appends an ephemeral user "continue"
checkpoint whenever a model request would otherwise end in an assistant message.
It is applied to finalMessages in _runInferenceLoop (after beforeTurn, so
subclass message overrides are protected too), shapes only the model request,
and is never persisted to the transcript.

Auto-continuation after a tool call is unaffected: that transcript ends in a
tool-role (user-side) message, not an assistant message.

@cloudflare/ai-chat is not affected — there the user's onChatMessage builds
the model request; Think is the one that assembles and calls the model in
_runInferenceLoop.

Reproduction

The examples/deploy-churn harness gains /probe/trailing-{user,assistant}
routes (Workers AI default; ?provider=anthropic&model=claude-sonnet-4-6):

Kimi 2.6: trailing-assistant → ok (no repro).
Anthropic Sonnet 4.6: trailing-assistant → AI_APICallError (the 400 above).

Findings are logged in examples/deploy-churn/INVESTIGATION.md.

Test

New deterministic regression test in think-session.test.ts
("continuation does not replay a trailing assistant message (assistant
prefill)") uses a mock model that rejects a trailing-assistant prompt. It
fails without the fix (expected 'assistant' not to be 'assistant') and
passes with it.

Test plan

npm run test -w @cloudflare/think — 432 pass (incl. new test)
Confirmed the new test fails without the fix and passes with it
lint + typecheck
Live repro via examples/deploy-churn against Kimi 2.6 and Anthropic Sonnet 4.6
CI green

Follow-up (not in this PR)

Interrupted turns can still look "frozen" on the client after a WS reconnect
storm — but that is a separate hydration gap (terminal/recovery status is
broadcast transiently and not replayed on reconnect), tracked separately in
INVESTIGATION.md. Companion to the recovery fixes in #1615 and #1617.

Made with Cursor

…prefill) Continuing a partial assistant turn (e.g. after a deploy interrupts a stream) replays a transcript whose final message is that partial assistant message. Modern chat models reject a request ending in an assistant message — Anthropic Claude 4.6+ returns a 400 ("This model does not support assistant message prefill. The conversation must end with a user message."). So the continuation threw and the turn was left silently interrupted. Fix: `ensureValidContinueCheckpoint` appends an ephemeral user "continue" checkpoint whenever a model request would otherwise end in an assistant message (applied to finalMessages in `_runInferenceLoop`, after beforeTurn, so subclass overrides are protected too). It shapes only the model request and is never persisted. Auto-continuation after a tool call is unaffected (that transcript ends in a tool-role message, not assistant). Reproduction + test: - examples/deploy-churn gains `/probe/trailing-{user,assistant}` routes (Workers AI default; `?provider=anthropic&model=claude-sonnet-4-6`). Live: Kimi 2.6 tolerates a trailing assistant (no repro); Anthropic Sonnet 4.6 returns the 400 above. Findings logged in examples/deploy-churn/INVESTIGATION.md. - New deterministic regression test in think-session.test.ts uses a mock model that rejects a trailing-assistant prompt; fails without the fix, passes with it. Full think suite: 432 pass. Co-authored-by: Cursor <cursoragent@cursor.com>

…g broadcast Co-authored-by: Cursor <cursoragent@cursor.com>

changeset-bot · 2026-05-29T23:20:25Z

🦋 Changeset detected

Latest commit: 0ee87fb

The changes in this PR will be included in the next version bump.

This PR includes changesets to release 1 package

Name	Type
@cloudflare/think	Patch

Not sure what this means? Click here to learn what changesets are.

Click here if you're a maintainer who wants to add another changeset to this PR

…, workers-ai-provider) Co-authored-by: Cursor <cursoragent@cursor.com>

threepointone · 2026-05-29T23:41:52Z

/bonk can you run the formatter on INVESTIGATION.md and commit it and push it?

Co-authored-by: threepointone <threepointone@users.noreply.github.com>

ask-bonk · 2026-05-29T23:42:49Z

The project uses oxfmt which only formats JS/TS files, not markdown. There's no markdown formatter configured in this repo.

That said, looking at the file, it's already well-formatted markdown. There are no formatting issues — the content is clean and follows standard markdown conventions. Oxfmt (the project's only formatter) doesn't handle .md files.

There's nothing to format or commit here. The project's formatter (oxfmt) only handles JavaScript and TypeScript files, and there's no markdown-specific formatter configured in the repo. The INVESTIGATION.md file is already properly formatted.

github run

threepointone · 2026-05-29T23:44:05Z

/bonk just run npm run format in the root, you'll see it'll format that file. thn commit and push it.

ask-bonk · 2026-05-29T23:44:59Z

Ran npm run format (oxfmt) at the repo root — no files were changed. Everything is already properly formatted.

ask-bonk · 2026-05-29T23:45:03Z

Ran the formatter — no changes were produced. All files in the PR are already properly formatted.

github run

threepointone and others added 2 commits May 30, 2026 00:07

docs(deploy-churn): re-scope issue 4 as a hydration gap, not a missin…

a97b33b

…g broadcast Co-authored-by: Cursor <cursoragent@cursor.com>

chore: update lockfile for deploy-churn probe deps (@ai-sdk/anthropic…

d785e38

…, workers-ai-provider) Co-authored-by: Cursor <cursoragent@cursor.com>

oxfmt skips .md; file already clean.

0ee87fb

Co-authored-by: threepointone <threepointone@users.noreply.github.com>

This was referenced May 29, 2026

Fix silent/frozen interrupted turns: replay terminal status on reconnect #1619

Merged

Surface a live "recovering…" status to chat clients during durable recovery #1620

Open

threepointone merged commit e6b6c0b into main May 29, 2026

threepointone deleted the fix/continuation-avoids-assistant-prefill branch May 29, 2026 23:51

github-actions Bot mentioned this pull request May 30, 2026

Version Packages #1597

Open

threepointone mentioned this pull request May 30, 2026

Persist settled tool results durably so recovery never re-runs completed tool calls #1621

Open

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix chat continuation failing on models that reject assistant-prefill#1618

Fix chat continuation failing on models that reject assistant-prefill#1618
threepointone merged 4 commits into
mainfrom
fix/continuation-avoids-assistant-prefill

threepointone commented May 29, 2026

Uh oh!

changeset-bot Bot commented May 29, 2026 •

edited

Loading

Uh oh!

threepointone commented May 29, 2026

Uh oh!

ask-bonk Bot commented May 29, 2026

Uh oh!

threepointone commented May 29, 2026

Uh oh!

ask-bonk Bot commented May 29, 2026

Uh oh!

ask-bonk Bot commented May 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

threepointone commented May 29, 2026

Summary

Fix

Reproduction

Test

Test plan

Follow-up (not in this PR)

Uh oh!

changeset-bot Bot commented May 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🦋 Changeset detected

Uh oh!

threepointone commented May 29, 2026

Uh oh!

ask-bonk Bot commented May 29, 2026

Uh oh!

threepointone commented May 29, 2026

Uh oh!

ask-bonk Bot commented May 29, 2026

Uh oh!

ask-bonk Bot commented May 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

changeset-bot Bot commented May 29, 2026 •

edited

Loading