fix(compiler): strip cache_control for non-Anthropic providers by Aldominguez12 · Pull Request #154 · VectifyAI/OpenKB

Aldominguez12 · 2026-06-30T06:02:58Z

Problem

_cached_text tags reusable prompt context (system prompt, document, summary, known-targets) with an Anthropic ephemeral cache_control marker — added in #37 for Anthropic prompt caching. Its docstring assumes non-supporting providers simply ignore the marker:

For providers that ignore cache_control, the list-of-blocks payload remains a valid OpenAI-compatible content shape.

That assumption does not hold for Gemini. LiteLLM translates the cache_control block into a Gemini CachedContent object, which then conflicts with system_instruction/tools, so every request fails with:

400 CachedContent can not be used with system_instruction, tools or tool_config ...

Net effect: every compile with a gemini/* model fails out of the box — anything that routes through the compiler (openkb add, lint, query, skill, deck). Reproduced on gemini/gemini-2.5-pro and gemini/gemini-3-flash-preview with v0.4.2.

Fix

Strip the marker at the single request egress (_llm_call / _llm_call_async) for any provider that won't honour it, keeping it for Anthropic (direct) and Claude served via OpenRouter / Bedrock / Vertex. Producers keep tagging optimistically; only the egress decides whether the marker survives.

Anthropic prompt caching is unchanged.
Gemini's implicit caching still applies to the remaining plain-text blocks (verified the cached= token count is still reported).
supports_prompt_caching() is deliberately not used as the gate: it returns True for Gemini and GPT-4o (they have some caching), which is not the same as accepting Anthropic's cache_control block. Provider identity via get_llm_provider is the correct signal.

Tests

TestCacheControlStripping (5 tests): provider gating (Anthropic / OpenRouter-Anthropic kept; Gemini / GPT stripped), non-mutating marker removal, and the keep/strip paths through _llm_call. The existing TestCacheControl (Anthropic still gets breakpoints) and TestLLMCallExtraHeaders continue to pass.

get_llm_provider is imported locally inside the gate so provider detection stays correct even under tests that patch the module-level litellm.

The compiler tags reusable prompt context with an Anthropic ephemeral `cache_control` marker (`_cached_text`). The docstring assumed providers that don't support it would simply ignore it — but LiteLLM translates the marker into a provider-native cached-content object for Gemini, which then conflicts with `system_instruction`/`tools` and fails every request with `400 CachedContent can not be used with ...`. As a result, *all* Gemini compiles fail out of the box. Strip the marker at the single request egress (`_llm_call` / `_llm_call_async`) for any non-Anthropic provider, keeping it for Anthropic direct and Claude via OpenRouter/Bedrock/Vertex. Anthropic prompt caching is unchanged; Gemini's implicit caching still applies to the plain text blocks. Provider detection uses litellm.get_llm_provider, imported locally so it stays correct even when tests patch the module-level `litellm` reference. Adds TestCacheControlStripping covering provider gating, marker removal (non-mutating), and the sync stripping/keeping paths.

KylinMountain

LGTM, thanks for the thorough fix.

Aldominguez12 force-pushed the fix/cache-control-non-anthropic-400 branch from 51b0791 to f454572 Compare June 30, 2026 06:07

KylinMountain approved these changes Jun 30, 2026

View reviewed changes

KylinMountain merged commit e896070 into VectifyAI:main Jun 30, 2026

KylinMountain mentioned this pull request Jul 2, 2026

[bug] Gemini ingest always fails: cache_control can't be combined with system_instruction/tools #157

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(compiler): strip cache_control for non-Anthropic providers#154

fix(compiler): strip cache_control for non-Anthropic providers#154
KylinMountain merged 1 commit into
VectifyAI:mainfrom
Aldominguez12:fix/cache-control-non-anthropic-400

Aldominguez12 commented Jun 30, 2026

Uh oh!

KylinMountain left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

Aldominguez12 commented Jun 30, 2026

Problem

Fix

Tests

Uh oh!

KylinMountain left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants