Are there any plans to add a context compacting feature or something like it?
I have tier 3 Anthropic api rate limits, and on small to mid-size projects I'm hitting the max input tokens per minute pretty regularly (at a total context size of around 30-50k tokens). Ultimately for me this means that it'll only run for around 10 minutes before I need to start a new session. I've considered Claude Max, as it seems my Claude pro account doesn't have this same limitation until it hits the hard cap, but I don't use $100 in api credits per month yet so it's a hard sell.
Are there any plans to add a context compacting feature or something like it?
I have tier 3 Anthropic api rate limits, and on small to mid-size projects I'm hitting the max input tokens per minute pretty regularly (at a total context size of around 30-50k tokens). Ultimately for me this means that it'll only run for around 10 minutes before I need to start a new session. I've considered Claude Max, as it seems my Claude pro account doesn't have this same limitation until it hits the hard cap, but I don't use $100 in api credits per month yet so it's a hard sell.