Manual Caching (Anthropic)

https://github.com/posit-dev/chatlas/blob/0ac18d2b393eebd0bccf082fce07027de41069a1/chatlas/_provider_anthropic.py#L614-L616

Many-turn agentic use cases are simply not feasible cost wise without caching. 

Are there any plans to bring in support for manual caching?

I think since `Turn`'s support the `system` role, a relatively straightfoward and non-invasive way of doing this would be to expose a callback that will run before transforming `list[Turn]` to `list[ProviderXYZ's Messages]`

i.e.

```py
def cache_last_message(turns: List[Turn]) -> List[Turn]:
    if not turns: return turns
    
    for content in turns[-1].contents:
        content[-1].cache_control = {"type": "ephemeral"}

    return turns

chat = ChatAnthropic(...)
chat.set_turn_callback(cache_system_and_last)
```

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Manual Caching (Anthropic) #158

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

	# N.B. Currently, Anthropic doesn't cache by default and we currently do not support
	# manual caching in chatlas. Note also that this only tracks reads, NOT writes, which
	# have their own cost. To track that properly, we would need another caching category and per-token cost.

Manual Caching (Anthropic) #158

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions