fix: inject cache_control on content blocks for openai-compatible proxies to Anthropic backends (Bifrost, LiteLLM, Databricks) by KTS-o7 · Pull Request #25985 · anomalyco/opencode

KTS-o7 · 2026-05-06T07:42:14Z

Issue for this PR

Type of change

Bug fix
New feature
Refactor / code improvement
Documentation

What does this PR do?

setCacheKey: true on @ai-sdk/openai-compatible providers was causing promptCacheKey to be sent as a top-level request option. Bifrost and LiteLLM (which proxy to Bedrock/Anthropic) don't use this field — they require cache_control: { type: "ephemeral" } on individual message content blocks, which they then translate to the backend's native caching format.

The fix adds a new applyCompatCaching() function in transform.ts that:

Converts string system messages into content block arrays with cache_control on each block (message-level injection doesn't work because the SDK spreads it as a top-level field, not a block property)
Annotates the last content block of targeted user messages with cache_control
Gets called from message() when the provider is @ai-sdk/openai-compatible and either cacheStrategy: "bedrock" is set explicitly, or setCacheKey: true with a model ID containing bedrock/

I also added a guard to stop applyCaching() from running on @ai-sdk/openai-compatible providers, since the model.id.includes("claude") heuristic there would have triggered the wrong path for Bifrost models.

I understand why this works: getOpenAIMetadata() in the AI SDK reads message.providerOptions?.openaiCompatible and spreads it onto the serialized message/block objects. So putting { cache_control: { type: "ephemeral" } } under providerOptions.openaiCompatible on a content block means it lands on the wire as { type: "text", text: "...", cache_control: { type: "ephemeral" } }, which is exactly what Bifrost/LiteLLM expect.

How did you verify your code works?

Added 7 tests in packages/opencode/test/provider/transform.test.ts covering: string system → content block conversion, user block annotation, auto-trigger via bedrock/ model ID, negative cases (no opts, non-bedrock model), and multi-part user messages. All 155 tests pass.
Ran bun typecheck from packages/opencode — no errors.
Tested locally with Bifrost running at localhost:24242 routing to bedrock/global.anthropic.claude-sonnet-4-6. Inspected outgoing requests and confirmed cache_control: { type: "ephemeral" } appears on content blocks.

Screenshots / recordings

No UI changes.

Checklist

I have tested my changes locally
I have not included unrelated changes in this PR

github-actions · 2026-05-06T07:48:12Z

Thanks for updating your PR! It now meets our contributing guidelines. 👍

…rock proxies When setCacheKey: true is set on an @ai-sdk/openai-compatible provider and the model ID contains 'bedrock/', or when cacheStrategy: 'bedrock' is explicitly set, OpenCode now injects cache_control: {type:'ephemeral'} onto message content blocks instead of sending a promptCacheKey request option. promptCacheKey is an OpenAI-native mechanism that Bifrost, LiteLLM, and other proxies routing to AWS Bedrock/Anthropic ignore entirely. These proxies require cache_control on individual content blocks (Anthropic-style), which they then translate to the native backend caching format. Key changes: - applyCompatCaching(): new function that converts string system messages to content block arrays and annotates the last block of system/user messages with cache_control via providerOptions.openaiCompatible — matching what Bifrost and LiteLLM expect on the wire - Guards applyCaching() from running on @ai-sdk/openai-compatible models to prevent the 'claude' model-id heuristic from triggering the wrong caching path - Passes provider options (item.options) into ProviderTransform.message() so setCacheKey / cacheStrategy are available at message-transform time - Adds cacheStrategy: 'bedrock' option to provider config schema - Docs: new section explaining caching for openai-compatible Bedrock proxies

KTS-o7 · 2026-05-10T05:05:49Z

Since opening this PR, the underlying issue has been confirmed by two more users on different providers:

Databricks (@jairbubbles in setCacheKey sends promptCacheKey (wrong) instead of cache_control on content blocks for openai-compatible Bedrock proxies (Bifrost, LiteLLM) #25984): Databricks model serving also expects cache_control: { type: "ephemeral" } in messages.content blocks — the same mechanism this fix implements. Their docs confirm it.
Xiaomi Mimo direct API (@xenstar in setCacheKey sends promptCacheKey (wrong) instead of cache_control on content blocks for openai-compatible Bedrock proxies (Bifrost, LiteLLM) #25984): Without the fix, there is zero caching and credits burn rapidly. With OpenRouter (which handles the translation itself), the same model hits ~95% cache rate.

This makes it clear the issue affects a broad class of OpenAI-compatible proxies that route to Anthropic-capable backends — not just Bifrost/LiteLLM. The cacheStrategy: "bedrock" approach this PR introduces generalises cleanly to all of them.

The fix is minimal and isolated to transform.ts with a guard that keeps the existing applyCaching() path completely unchanged for native providers. Happy to address any review feedback.

KTS-o7 · 2026-05-10T05:42:34Z

Hey @rekram1-node and @thdxr — would love to get a review on this when you have a moment.

This fixes a caching issue for users routing Claude models through OpenAI-compatible proxies (Bifrost, LiteLLM, Databricks, Xiaomi Mimo) to Bedrock/Anthropic backends. The root cause: setCacheKey: true sends promptCacheKey as a top-level option, which these proxies don't recognise — they require cache_control: { type: "ephemeral" } injected directly onto message content blocks.

@rekram1-node — you just touched this area in #26276, so you likely have the most context right now. The fix lives entirely in transform.ts with a new applyCompatCaching() function, guarded so it only runs for @ai-sdk/openai-compatible providers and never interferes with the existing applyCaching() path.

The issue has been independently confirmed by users on Databricks and Xiaomi Mimo direct API (see #25984) — so this affects a broad class of OpenAI-compatible proxies, not just Bifrost/LiteLLM.

github-actions Bot added the needs:compliance This means the issue will auto-close after 2 hours. label May 6, 2026

KTS-o7 mentioned this pull request May 6, 2026

setCacheKey sends promptCacheKey (wrong) instead of cache_control on content blocks for openai-compatible Bedrock proxies (Bifrost, LiteLLM) #25984

Open

github-actions Bot removed the needs:compliance This means the issue will auto-close after 2 hours. label May 6, 2026

KTS-o7 force-pushed the feat/openai-compatible-bedrock-cache-control branch from ea6982e to cc3b3b9 Compare May 6, 2026 07:49

KTS-o7 added 3 commits May 6, 2026 13:49

Merge branch 'dev' into feat/openai-compatible-bedrock-cache-control

8a4d6d2

Merge branch 'dev' into feat/openai-compatible-bedrock-cache-control

2ecf99f

Merge branch 'dev' into feat/openai-compatible-bedrock-cache-control

bfc9a46

Astro-Han mentioned this pull request May 6, 2026

[Task] Track upstream sync after opencode 17701628bd Astro-Han/pawwork#477

Closed

66 tasks

Merge branch 'dev' into feat/openai-compatible-bedrock-cache-control

92919e2

KTS-o7 changed the title ~~fix: inject cache_control on content blocks for openai-compatible Bedrock proxies (Bifrost, LiteLLM)~~ fix: inject cache_control on content blocks for openai-compatible proxies to Anthropic backends (Bifrost, LiteLLM, Databricks) May 10, 2026

KTS-o7 mentioned this pull request May 10, 2026

The Claude model cannot properly enable the caching function #11083

Open

fix: replace any[] with proper UserContent types in applyCompatCaching

96371fb

KTS-o7 and others added 3 commits May 10, 2026 11:18

refactor: align applyCompatCaching with project style guide

cc502c6

Merge branch 'dev' into feat/openai-compatible-bedrock-cache-control

b15108f

Merge branch 'dev' into feat/openai-compatible-bedrock-cache-control

8720152

github-actions Bot mentioned this pull request May 29, 2026

feat(opencode): add LiteLLM provider integration #29937

Open

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: inject cache_control on content blocks for openai-compatible proxies to Anthropic backends (Bifrost, LiteLLM, Databricks)#25985

fix: inject cache_control on content blocks for openai-compatible proxies to Anthropic backends (Bifrost, LiteLLM, Databricks)#25985
KTS-o7 wants to merge 9 commits into
anomalyco:devfrom
KTS-o7:feat/openai-compatible-bedrock-cache-control

KTS-o7 commented May 6, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented May 6, 2026

Uh oh!

KTS-o7 commented May 10, 2026

Uh oh!

KTS-o7 commented May 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

KTS-o7 commented May 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Issue for this PR

Type of change

What does this PR do?

How did you verify your code works?

Screenshots / recordings

Checklist

Uh oh!

github-actions Bot commented May 6, 2026

Uh oh!

KTS-o7 commented May 10, 2026

Uh oh!

KTS-o7 commented May 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

KTS-o7 commented May 6, 2026 •

edited

Loading