feat(openai_agents): pull cached tokens through into metrics by cjgalione · Pull Request #364 · braintrustdata/braintrust-sdk-python

Curtis Galione (cjgalione) · 2026-04-29T04:56:35Z

Summary

Walk *_tokens_details sub-objects in _usage_to_metrics so the OpenAI Agents SDK integration picks up cached / reasoning / audio token counts (e.g. input_tokens_details.cached_tokens → prompt_cached_tokens). Mirrors the JS fix in braintrust-sdk-javascript#1186.
Route _response_log_data through _usage_to_metrics instead of hardcoding the three total/input/output fields, so the Responses API path benefits from the same extraction.
_task_log_data and _turn_log_data already delegated to _usage_to_metrics, so they inherit the fix.

Why

A customer reported that cached tokens are not showing up in the Python BraintrustTracingProcessor. The narrow 3-field extraction in _response_log_data (Responses API) and _usage_to_metrics (chat-completions / Generation spans) drops input_tokens_details.cached_tokens even though the OpenAI wrapper (braintrust/oai.py's _parse_metrics_from_usage) already handles it correctly. The JS SDK was patched in December but the Python equivalent was never written.

Test plan

test_response_span_extracts_cached_tokens_from_usage — Response span sees prompt_cached_tokens
test_response_span_handles_zero_cached_tokens — zero is preserved, not dropped
test_response_span_handles_missing_cached_tokens — no prompt_cached_tokens key when details absent
test_generation_span_extracts_cached_tokens_from_usage — Generation span path
Existing non-VCR processor tests still pass

Both _response_log_data (Responses API) and _usage_to_metrics (chat-completions / Generation spans) only emitted total / prompt / completion tokens. Cached / reasoning / audio token counts surfaced via the OpenAI usage `*_tokens_details` sub-objects were dropped, so the OpenAI Agents SDK integration never logged metrics like prompt_cached_tokens — even though the OpenAI wrapper already does. Walk *_tokens_details inside _usage_to_metrics (mapping the input/output prefix to prompt/completion to stay consistent with Braintrust's convention) and route _response_log_data through the same helper. Mirrors the JS fix in #1186. Tests cover the four cases from the JS PR: cached tokens present on a Response span, zero is preserved, missing details produces no metric, and Generation spans extract cached tokens too.

Curtis Galione (cjgalione) · 2026-04-29T05:17:11Z

I should've done this a long time ago, when I did this one: braintrustdata/braintrust-sdk-javascript@a05dc4d

Abhijeet Prasad (AbhiPrasad) · 2026-04-29T16:29:32Z

gonna push up some commits re: testing for this! and then we can get it merged in!

Includes - #355 - #360 - #358 - #357 - #361 - #364 - #370

Abhijeet Prasad (AbhiPrasad) · 2026-04-29T18:04:53Z

released with https://github.com/braintrustdata/braintrust-sdk-python/releases/tag/py-sdk-v0.18.0!

Curtis Galione (cjgalione) requested a review from Abhijeet Prasad (AbhiPrasad) April 29, 2026 05:18

leverage VCR tests

4ea89ee

Abhijeet Prasad (AbhiPrasad) self-assigned this Apr 29, 2026

Abhijeet Prasad (AbhiPrasad) approved these changes Apr 29, 2026

View reviewed changes

Abhijeet Prasad (AbhiPrasad) enabled auto-merge (squash) April 29, 2026 16:47

Abhijeet Prasad (AbhiPrasad) merged commit de41845 into main Apr 29, 2026
161 of 163 checks passed

Abhijeet Prasad (AbhiPrasad) deleted the curtis/cached_tokens_tracking branch April 29, 2026 16:49

Abhijeet Prasad (AbhiPrasad) mentioned this pull request Apr 29, 2026

chore: bump to 0.18.0 #371

Merged

Abhijeet Prasad (AbhiPrasad) added a commit that referenced this pull request Apr 29, 2026

chore: bump to 0.18.0 (#371)

4b0ec7f

Includes - #355 - #360 - #358 - #357 - #361 - #364 - #370

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(openai_agents): pull cached tokens through into metrics#364

feat(openai_agents): pull cached tokens through into metrics#364
Abhijeet Prasad (AbhiPrasad) merged 2 commits intomainfrom
curtis/cached_tokens_tracking

Curtis Galione (cjgalione) commented Apr 29, 2026

Uh oh!

Curtis Galione (cjgalione) commented Apr 29, 2026

Uh oh!

Abhijeet Prasad (AbhiPrasad) commented Apr 29, 2026

Uh oh!

Uh oh!

Abhijeet Prasad (AbhiPrasad) commented Apr 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Curtis Galione (cjgalione) commented Apr 29, 2026

Summary

Why

Test plan

Uh oh!

Curtis Galione (cjgalione) commented Apr 29, 2026

Uh oh!

Abhijeet Prasad (AbhiPrasad) commented Apr 29, 2026

Uh oh!

Uh oh!

Abhijeet Prasad (AbhiPrasad) commented Apr 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants