feat: flatten LiteLLM cache/reasoning usage sub-counts in _usage_to_dict by lucasgomide · Pull Request #6033 · crewAIInc/crewAI

lucasgomide · 2026-06-03T18:37:00Z

LiteLLM returns provider usage as-is, nesting cache-read / cache-creation / reasoning counts under provider-specific shapes (e.g. prompt_tokens_details.cached_tokens, Anthropic-style cache_read_input_tokens). Surface them as flat cached_prompt_tokens / reasoning_tokens / cache_creation_tokens keys so the span pipeline can read them; prompt / completion / total token counts are left untouched.

Note

Low Risk
Observability-only normalization on usage payloads before events; behavior change is copying dicts instead of returning the same reference, with broad test coverage.

Overview
LiteLLM usage dicts are normalized before LLM call events and spans see them. LLM._usage_to_dict no longer returns raw provider shapes unchanged; it copies dict/Pydantic/__dict__ usage into a new dict and promotes cache-read, cache-creation, and reasoning counts from nested or Anthropic-style fields into top-level cached_prompt_tokens, reasoning_tokens, and cache_creation_tokens, matching what BaseLLM._track_token_usage_internal and the span pipeline already expect. Core counts (prompt_tokens, completion_tokens, total_tokens) are not rewritten, and plain usage without those buckets is left without the derived keys.

Tests replace the old “dict pass-through” assertion with coverage that inputs are copied (not mutated), parametrized provider shapes normalize correctly, core totals stay intact, and missing buckets are omitted.

^{Reviewed by Cursor Bugbot for commit 9123b32. Bugbot is set up for automated code reviews on this repo. Configure here.}

Summary by CodeRabbit

Bug Fixes
- Enhanced token usage normalization to ensure consistent handling across different LLM providers, properly flattening nested token details into expected top-level fields for cached tokens, reasoning tokens, and cache creation tokens.
Tests
- Expanded test coverage for token usage handling, adding parametrized tests to verify normalization across various provider formats and confirming core token counts are preserved correctly.

LiteLLM returns provider usage as-is, nesting cache-read / cache-creation / reasoning counts under provider-specific shapes (e.g. prompt_tokens_details.cached_tokens, Anthropic-style cache_read_input_tokens). Surface them as flat cached_prompt_tokens / reasoning_tokens / cache_creation_tokens keys so the span pipeline can read them; prompt / completion / total token counts are left untouched.

coderabbitai · 2026-06-03T18:37:28Z

📝 Walkthrough

Walkthrough

The LLM _usage_to_dict method now normalizes provider usage objects (dict, BaseModel, or plain object instances) into plain dicts and flattens nested token fields into standardized top-level keys (cached_prompt_tokens, reasoning_tokens, cache_creation_tokens) with multi-source fallback precedence. The test suite validates normalization across flat and nested usage shapes while preserving primary token counts.

Changes

Token Usage Normalization

Layer / File(s)	Summary
`_usage_to_dict` normalization implementation `lib/crewai/src/crewai/llm.py`	`LLM._usage_to_dict` converts provider usage objects and dicts to plain dicts, extracts nested/provider-specific token fields (`_tokens_details`, `cache_read_`, `cache_creation_*`) with key precedence lookup, and flattens them into standardized top-level keys while preserving `prompt_tokens`, `completion_tokens`, and `total_tokens`.
Test coverage for usage normalization `lib/crewai/tests/events/test_llm_usage_event.py`	Tests assert flat dicts are returned unchanged without adding derived bucket keys, parametrized variants verify normalization of nested LiteLLM-style buckets into flattened token fields, primary token counts are preserved, and absent bucket inputs do not produce added derived fields.

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~20 minutes

Poem

🐰 Token fields, once scattered and deep,
Now flattened and tidy, a promise to keep,
With nested buckets brought into the light,
Usage normalization shines crystal bright! ✨

🚥 Pre-merge checks | ✅ 4 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 22.22% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (4 passed)

Check name	Status	Explanation
Title check	✅ Passed	The title accurately and specifically describes the main change: flattening LiteLLM cache/reasoning usage sub-counts in the _usage_to_dict method, which aligns with the changeset's core objective.
Linked Issues check	✅ Passed	Check skipped because no linked issues were found for this pull request.
Out of Scope Changes check	✅ Passed	Check skipped because no linked issues were found for this pull request.
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

📝 Generate docstrings

Create stacked PR
Commit on current branch

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch luzk/surface-cache-litellm

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

🧹 Nitpick comments (1)

lib/crewai/src/crewai/llm.py (1)
1962-1967: ⚡ Quick win

Be aware: or chaining treats explicit zero as absent.

The precedence chain uses or, so if cached_tokens is explicitly 0, the chain continues and might select a non-zero value from cached_prompt_tokens or other sources. Example: {cached_tokens: 0, cached_prompt_tokens: 5} yields 5.

This mirrors the existing pattern in BaseLLM._track_token_usage_internal (as documented), and in practice providers typically populate only one field, so the risk is low. However, it's worth being aware of this behavior.
📋 Consider adding test coverage for zero-value edge case

Add a test case to document the intended behavior when a field is explicitly 0:
def test_zero_cached_tokens_with_alternative_source():
    """Document behavior when primary source is 0 and alternative exists."""
    usage = {
        "cached_tokens": 0,
        "cached_prompt_tokens": 5,
    }
    result = LLM._usage_to_dict(usage)
    # Current behavior: returns 5 (continues precedence chain)
    # Alternative: could return 0 (stop at first explicit value)
    assert result["cached_prompt_tokens"] == 5  # or 0, depending on intent
This would clarify whether 0 means "no cached tokens" (continue checking) or "explicitly zero cached tokens" (stop here).
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@lib/crewai/src/crewai/llm.py` around lines 1962 - 1967, The current
precedence chain for cached_prompt_tokens uses boolean "or" which treats 0 as
falsy and skips it; update the selection logic in the block that computes
cached_prompt_tokens (referencing variables/data keys cached_tokens,
cached_prompt_tokens, cache_read_input_tokens and helper _nested) to explicitly
check for None (e.g., use "is not None" or a sentinel) so an explicit 0 is
preserved as a valid value; also add a unit test (e.g., in tests covering
LLM._usage_to_dict or the relevant conversion path) that asserts when
{"cached_tokens": 0, "cached_prompt_tokens": 5} the function returns
cached_tokens == 0 (or documents the chosen behavior) to prevent regressions.

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Nitpick comments:
In `@lib/crewai/src/crewai/llm.py`:
- Around line 1962-1967: The current precedence chain for cached_prompt_tokens
uses boolean "or" which treats 0 as falsy and skips it; update the selection
logic in the block that computes cached_prompt_tokens (referencing
variables/data keys cached_tokens, cached_prompt_tokens, cache_read_input_tokens
and helper _nested) to explicitly check for None (e.g., use "is not None" or a
sentinel) so an explicit 0 is preserved as a valid value; also add a unit test
(e.g., in tests covering LLM._usage_to_dict or the relevant conversion path)
that asserts when {"cached_tokens": 0, "cached_prompt_tokens": 5} the function
returns cached_tokens == 0 (or documents the chosen behavior) to prevent
regressions.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro Plus

Run ID: 0c8cab37-1177-4c91-8bc9-2c60f0b0d113

📥 Commits

Reviewing files that changed from the base of the PR and between ea88904 and 3da3d46.

📒 Files selected for processing (2)

lib/crewai/src/crewai/llm.py
lib/crewai/tests/events/test_llm_usage_event.py

cursor

Cursor Bugbot has reviewed your changes and found 1 potential issue.

^{❌ Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, enable autofix in the Cursor dashboard.}

^{Reviewed by Cursor Bugbot for commit 9123b32. Configure here.}

github-actions Bot added the size/M label Jun 3, 2026

coderabbitai Bot reviewed Jun 3, 2026

View reviewed changes

coderabbitai Bot approved these changes Jun 3, 2026

View reviewed changes

mattatcha approved these changes Jun 3, 2026

View reviewed changes

Merge branch 'main' into luzk/surface-cache-litellm

9123b32

cursor Bot reviewed Jun 3, 2026

View reviewed changes

Comment thread lib/crewai/src/crewai/llm.py

gabemilani approved these changes Jun 3, 2026

View reviewed changes

lucasgomide merged commit d09e3f4 into main Jun 3, 2026
56 checks passed

lucasgomide deleted the luzk/surface-cache-litellm branch June 3, 2026 19:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: flatten LiteLLM cache/reasoning usage sub-counts in _usage_to_dict#6033

feat: flatten LiteLLM cache/reasoning usage sub-counts in _usage_to_dict#6033
lucasgomide merged 2 commits into
mainfrom
luzk/surface-cache-litellm

lucasgomide commented Jun 3, 2026 •

edited by cursor Bot

Loading

Uh oh!

coderabbitai Bot commented Jun 3, 2026 •

edited

Loading

Walkthrough

Changes

Estimated code review effort

Poem

❌ Failed checks (1 warning)

Uh oh!

coderabbitai Bot left a comment

Uh oh!

cursor Bot left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

lucasgomide commented Jun 3, 2026 • edited by cursor Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

coderabbitai Bot commented Jun 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Estimated code review effort

Poem

❌ Failed checks (1 warning)

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

cursor Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

lucasgomide commented Jun 3, 2026 •

edited by cursor Bot

Loading

coderabbitai Bot commented Jun 3, 2026 •

edited

Loading