diff --git a/docs/user-guide/observability-evaluation/traces.md b/docs/user-guide/observability-evaluation/traces.md index a990172b..95518373 100644 --- a/docs/user-guide/observability-evaluation/traces.md +++ b/docs/user-guide/observability-evaluation/traces.md @@ -30,6 +30,8 @@ Each trace consists of multiple spans that represent different operations in you | - gen_ai.usage.completion_tokens: | | - gen_ai.usage.output_tokens: | | - gen_ai.usage.total_tokens: | +| - gen_ai.usage.cache_read_input_tokens: | +| - gen_ai.usage.cache_write_input_tokens: | | | | +-------------------------------------------------------------------------------+ | | | Cycle | | @@ -56,6 +58,8 @@ Each trace consists of multiple spans that represent different operations in you | | | - gen_ai.usage.completion_tokens: | | | | | | - gen_ai.usage.output_tokens: | | | | | | - gen_ai.usage.total_tokens: | | | +| | | - gen_ai.usage.cache_read_input_tokens: | | | +| | | - gen_ai.usage.cache_write_input_tokens: | | | | | +-----------------------------------------------------------------------+ | | | | | | | | +-----------------------------------------------------------------------+ | | @@ -179,6 +183,8 @@ Strands traces include rich attributes that provide context for each operation: | `gen_ai.usage.completion_tokens` | Total tokens used for completions | | `gen_ai.usage.output_tokens` | Total tokens used for completions (duplicate) | | `gen_ai.usage.total_tokens` | Total token usage | +| `gen_ai.usage.cache_read_input_tokens` | Number of input tokens read from cache (Note: Not all model providers support cache tokens. This defaults to 0 in that case) | +| `gen_ai.usage.cache_write_input_tokens` | Number of input tokens written to cache (Note: Not all model providers support cache tokens. This defaults to 0 in that case) | ### Cycle-Level Attributes @@ -209,6 +215,8 @@ Strands traces include rich attributes that provide context for each operation: | `gen_ai.usage.completion_tokens` | Total tokens used for completions | | `gen_ai.usage.output_tokens` | Total tokens used for completions (duplicate) | | `gen_ai.usage.total_tokens` | Total token usage | +| `gen_ai.usage.cache_read_input_tokens` | Number of input tokens read from cache (Note: Not all model providers support cache tokens. This defaults to 0 in that case) | +| `gen_ai.usage.cache_write_input_tokens` | Number of input tokens written to cache (Note: Not all model providers support cache tokens. This defaults to 0 in that case) | ### Tool-Level Attributes