fix(openai): subtract cached tokens from input tokens to avoid double counting by andreynering · Pull Request #176 · charmbracelet/fantasy

andreynering · 2026-03-18T14:17:54Z

OpenAI's API reports prompt_tokens/input_tokens INCLUDING cached tokens, while also separately reporting cached_tokens in prompt_tokens_details. This caused double-counting when users summed InputTokens + CacheReadTokens.

For example, if OpenAI reports:

prompt_tokens: 1000 (includes 900 cached)
cached_tokens: 900

Before this fix, fantasy reported:

InputTokens: 1000
CacheReadTokens: 900

After this fix, fantasy reports:

InputTokens: 100 (non-cached only)
CacheReadTokens: 900

This matches the behavior of Vercel AI SDK and prevents billing miscalculations when pricing input tokens and cache read tokens separately.

See: https://platform.openai.com/docs/guides/prompt-caching#requirements

💘 Generated with Crush

Assisted-by: Kimi K2.5 via Crush crush@charm.land

… counting OpenAI's API reports prompt_tokens/input_tokens INCLUDING cached tokens, while also separately reporting cached_tokens in prompt_tokens_details. This caused double-counting when users summed InputTokens + CacheReadTokens. For example, if OpenAI reports: - prompt_tokens: 1000 (includes 900 cached) - cached_tokens: 900 Before this fix, fantasy reported: - InputTokens: 1000 - CacheReadTokens: 900 After this fix, fantasy reports: - InputTokens: 100 (non-cached only) - CacheReadTokens: 900 This matches the behavior of Vercel AI SDK and prevents billing miscalculations when pricing input tokens and cache read tokens separately. See: https://platform.openai.com/docs/guides/prompt-caching#requirements 💘 Generated with Crush Assisted-by: Kimi K2.5 via Crush <crush@charm.land>

andreynering requested a review from meowgorithm March 18, 2026 14:17

andreynering self-assigned this Mar 18, 2026

charmcli added the provider: openai chatgpt label Mar 18, 2026

kylecarbs approved these changes Mar 18, 2026

View reviewed changes

aymanbagabas approved these changes Mar 18, 2026

View reviewed changes

andreynering merged commit 22c3e9a into main Mar 18, 2026
17 checks passed

andreynering deleted the fix-openai-token-reporting branch March 18, 2026 16:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(openai): subtract cached tokens from input tokens to avoid double counting#176

fix(openai): subtract cached tokens from input tokens to avoid double counting#176
andreynering merged 1 commit intomainfrom
fix-openai-token-reporting

andreynering commented Mar 18, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

andreynering commented Mar 18, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants