Does the Agents / Runs API support prompt caching or expose cached token counts in usage? #4555

annanasi-mon · 2026-03-09T08:53:01Z

annanasi-mon
Mar 9, 2026

I’m using the Azure AI Agent client (e.g. agent_framework_azure_ai with AzureAIAgentClient and agent.run()), which uses the Agents Runs API (e.g. runs.stream()). I’m trying to verify prompt cache usage.

Cached token counts in usage
The run completion usage I see only has prompt_tokens, completion_tokens, and total_tokens (e.g. from RunStepCompletionUsage / RunCompletionUsage in the SDK). There is no prompt_tokens_details or cached_tokens (unlike the Chat Completions / Responses API, which can return prompt_tokens_details.cached_tokens).
Can you confirm that the Agents/Runs API does not currently expose cached token counts in usage, and whether there are plans to add this?
Prompt caching support
Is prompt caching (and optional prompt_cache_key-style routing) supported for agent runs at all? If yes, is it documented anywhere and are there plans to expose cache-related fields in run/step usage?

aniruddhaadak80 · 2026-03-09T22:49:16Z

aniruddhaadak80
Mar 9, 2026

From my point of view, this is exactly the kind of observability gap that makes prompt caching hard to operationalize. If cached token counts are hidden by the Agents and Runs abstraction, teams cannot really tell whether caching is helping cost and latency or only assume that it is.

Even if the underlying API does not expose those fields yet, the docs should probably say that very directly so people stop hunting for metrics that are not currently available.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Does the Agents / Runs API support prompt caching or expose cached token counts in usage? #4555

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Does the Agents / Runs API support prompt caching or expose cached token counts in usage? #4555

Uh oh!

annanasi-mon Mar 9, 2026

Replies: 1 comment

Uh oh!

aniruddhaadak80 Mar 9, 2026

annanasi-mon
Mar 9, 2026

aniruddhaadak80
Mar 9, 2026