fix(llma): Langchain cache token double subtraction for non-Anthropic providers #369

carlos-marchal-ph · 2025-11-10T19:24:51Z

Langchain internally transforms Anthropic token counts to an OpenAI like format (input tokens include in cached tokens). This was causing errors in our logic since we already undo this transformation in the plugin server. A fix for this was merged in #346.

Unfortunately the fix was too eager and it also started applying to other, non-Anthropic providers, like OpenAI itself. This causes incorrect costs (even negative ones) for generations with cache usage.

PR #346 introduced cache token subtraction for all providers, causing double subtraction for OpenAI/OpenRouter and resulting in negative costs. The plugin-server only subtracts cache tokens for Anthropic providers (exact match on provider name or substring match on model name). This fix aligns the Python SDK with that behavior. Changes: - Only subtract cache tokens when provider="anthropic" OR model contains "anthropic" - Passes provider and model metadata to usage parsing functions - Updates tests to reflect correct behavior (no subtraction for OpenAI) - Adds test for Anthropic provider subtraction Fixes negative cost calculations for users on OpenAI/OpenRouter with cached tokens.

greptile-apps

_{4 files reviewed, no comments}

_{Edit Code Review Agent Settings | Greptile}

carlos-marchal-ph requested a review from a team November 10, 2025 19:24

carlos-marchal-ph self-assigned this Nov 10, 2025

carlos-marchal-ph added bug Something isn't working team/llm-analytics labels Nov 10, 2025

greptile-apps bot reviewed Nov 10, 2025

View reviewed changes

andrewm4894 approved these changes Nov 11, 2025

View reviewed changes

carlos-marchal-ph merged commit 6c815df into master Nov 11, 2025
12 checks passed

carlos-marchal-ph deleted the fix(llma)/cache-token-double-subtraction branch November 11, 2025 08:57

carlos-marchal-ph mentioned this pull request Nov 18, 2025

fix: langchain double subtraction PostHog/posthog-js#2591

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(llma): Langchain cache token double subtraction for non-Anthropic providers #369

fix(llma): Langchain cache token double subtraction for non-Anthropic providers #369

Uh oh!

carlos-marchal-ph commented Nov 10, 2025

Uh oh!

greptile-apps bot left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

fix(llma): Langchain cache token double subtraction for non-Anthropic providers #369

fix(llma): Langchain cache token double subtraction for non-Anthropic providers #369

Uh oh!

Conversation

carlos-marchal-ph commented Nov 10, 2025

Uh oh!

greptile-apps bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants