Hitting Anthropic API Rate Limits

Are there any plans to add a context compacting feature or something like it?

I have tier 3 Anthropic api rate limits, and on small to mid-size projects I'm hitting the max input tokens per minute pretty regularly (at a total context size of around 30-50k tokens). Ultimately for me this means that it'll only run for around 10 minutes before I need to start a new session. I've considered Claude Max, as it seems my Claude pro account doesn't have this same limitation until it hits the hard cap, but I don't use $100 in api credits per month yet so it's a hard sell. 

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hitting Anthropic API Rate Limits #180

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Hitting Anthropic API Rate Limits #180

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions