Bugfix/usage for openrouter #11627

daarko10 · 2025-06-11T14:46:12Z

Propogate the openrouter usage information back correctly

Add OpenRouter include_usage flag mapping and propagate cost & is_byok in responses

Relevant issues

Fixes #11626

Pre-Submission checklist

Added or updated tests under tests/litellm/
Tests pass locally (see attached log snippet)
All unit tests pass via make test-unit
Changes are scoped to only OpenRouter usage-flag handling and cost propagation

Type

🐛 Bug Fix

Changes

Parameter mapping
- In litellm/llms/openrouter/chat/transformation.py, detect stream_options.include_usage and emit:
```
extra_body["usage"] = {"include": True}
```
- Removes manual default—only when include_usage: true is set.
OpenRouter chunk parsing
- In OpenRouterChatCompletionStreamingHandler.chunk_parser, read chunk["usage"]["cost"] and chunk["usage"]["is_byok"] and assign them directly to ModelResponseStream.usage.
Final stream builder
- In litellm/main.py’s stream_chunk_builder, prefer the OpenRouter-provided usage (including cost/is_byok) over any manual fallback computation.
Cost calculator wrappers
- In litellm/cost_calculator.py, switched to using PromptTokensDetailsWrapper and CompletionTokensDetailsWrapper in combine_usage_objects to carry nested token details.
Response conversion
- In litellm_core_utils/llm_response_utils/convert_dict_to_response.py, unified both streaming and non-streaming paths to use:
```
usage_object = Usage(**response_object["usage"])
setattr(model_response_object, "usage", usage_object)
```
Logging utils
- Minor tweak in litellm_core_utils/logging_utils.py to preserve exception context when logging errors in _assemble_complete_response_from_streaming_chunks.
Streaming chunk builder utilities
- In litellm_core_utils/streaming_chunk_builder_utils.py:
  - Updated _usage_chunk_calculation_helper to include cost and is_byok in its returned dict.
  - Revised _process_usage_chunks to pull cost/is_byok from each chunk and feed them into the final usage data.
Streaming handler iteration
- In litellm_core_utils/streaming_handler.py, refactored both __next__ and __anext__ to propagate intermediate chunk usage objects and combine them into a final Usage without re-computing cost.
Type extensions
- In litellm/types/utils.py, extended the Usage model to include:
```
cost: Optional[float]
is_byok: Optional[bool]
```
  and retained private cache-token attributes for prompt caching.
Tests
- tests/test_litellm/llms/openrouter/chat/test_openrouter_chat_transformation.py: new test verifying that, when stream_options.include_usage is true, the parsed ModelResponseStream.usage includes the OpenRouter cost and is_byok.

…sage handling and calculation logic for better modularity and accuracy."

…of cost and token details, enhance chunk processing, and fix PromptTokensDetails to PromptTokensDetailsWrapper.

…ng response processing.

…ng handler

… to ignore new usage metadata fields.

vercel · 2025-06-11T14:47:12Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Comments	Updated (UTC)
litellm	✅ Ready (Inspect)	Visit Preview	💬 Add feedback	Jun 11, 2025 2:47pm

daarko10 · 2025-06-11T15:04:05Z

@krrishdholakia the test fails but locally it does work, can you please have a look and let me know what I need to fix?

krrishdholakia · 2025-06-11T16:31:56Z

Hey @daarko10 this PR changes a lot of files - including core components which could impact other providers. Can we reduce the scope somehow?

daarko10 · 2025-06-11T16:38:05Z

Unfortunatly, @krrishdholakia , this was necessary as the current implementation completly swallows the usage with openrouter as it comes in the chunk after the chunk with finish_reason, so I added another state tracker of the usage, I construct it based of its arriving or not, make test-unit locally work nicely, and tested it with openrouter+multiplr providers, anthropic, gemini.

Also added fallbacks that revert to earlier functionality to preserve previous behaviour in case it fails for whatever reason

matannahmani · 2025-06-16T09:54:51Z

any chance of getting this approved @krrishdholakia we are using this extensively at kodu-ai we need accurate usage reporting for open router users

daarko10 · 2025-06-17T15:04:02Z

Hey, @krrishdholakia, I played with that a bit and had a few of issues of inconsistency cause of pydantic fails, this made it stable and not fail.

daarko10 added 8 commits June 11, 2025 02:06

"Enhance usage tracking by adding cost and is_byok fields, refactor U…

a467bab

…sage handling and calculation logic for better modularity and accuracy."

Remove unused model_config attribute in utils.py

354168c

Refactor usage handling in streaming responses to ensure propagation …

d4586f9

…of cost and token details, enhance chunk processing, and fix PromptTokensDetails to PromptTokensDetailsWrapper.

Remove redundant logging and usage handling logic for cleaner streami…

298c803

…ng response processing.

Add handling and mapping of usage data to model response in streami…

9cbaf52

…ng handler

Refactor streaming_handler token parsing for readability; update test…

b1ede29

… to ignore new usage metadata fields.

Fix linter issues

d7396db

Refactor: Remove redundant comment in streaming_chunk_builder_utils.py

f18a036

vercel bot deployed to Preview June 11, 2025 14:47 View deployment

krrishdholakia changed the base branch from main to litellm_openrouter_improvement_staging June 17, 2025 02:24

krrishdholakia merged commit 6c18b9a into BerriAI:litellm_openrouter_improvement_staging Jun 17, 2025
5 of 6 checks passed

lumitry mentioned this pull request Aug 10, 2025

Add model cost and detail to model settings lumitry/vela-chat#21

Open

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Bugfix/usage for openrouter #11627

Bugfix/usage for openrouter #11627

Uh oh!

daarko10 commented Jun 11, 2025 •

edited

Loading

Uh oh!

vercel bot commented Jun 11, 2025

Uh oh!

daarko10 commented Jun 11, 2025

Uh oh!

krrishdholakia commented Jun 11, 2025

Uh oh!

daarko10 commented Jun 11, 2025

Uh oh!

matannahmani commented Jun 16, 2025 •

edited

Loading

Uh oh!

Uh oh!

daarko10 commented Jun 17, 2025

Uh oh!

Uh oh!

Uh oh!

Bugfix/usage for openrouter #11627

Bugfix/usage for openrouter #11627

Uh oh!

Conversation

daarko10 commented Jun 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Propogate the openrouter usage information back correctly

Relevant issues

Pre-Submission checklist

Type

Changes

Uh oh!

vercel bot commented Jun 11, 2025

Uh oh!

daarko10 commented Jun 11, 2025

Uh oh!

krrishdholakia commented Jun 11, 2025

Uh oh!

daarko10 commented Jun 11, 2025

Uh oh!

matannahmani commented Jun 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

daarko10 commented Jun 17, 2025

Uh oh!

Uh oh!

daarko10 commented Jun 11, 2025 •

edited

Loading

matannahmani commented Jun 16, 2025 •

edited

Loading