Rate limit across batches, not within batches #235

ianarawjo · 2024-03-05T15:58:18Z

ChainForge currently does rate-limiting but within batches of requests to model providers. If one calls queryLLM lots of times for single calls, the rate limiting won't be enforced. To fix this, we should rate-limit at the level of the API request itself ---i.e., in call_chatgpt or call_anthropic, using a rate-limiting library that works on requests per minute (on time).

Rate limits should also be updated to reflect current standards.

The text was updated successfully, but these errors were encountered:

ianarawjo · 2024-03-27T03:15:30Z

This change is also complete but on the Typescript frontend branch. It will be pushed in that update.

ianarawjo added the low priority Would be nice to have, but not mission-critical label Mar 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rate limit across batches, not within batches #235

Rate limit across batches, not within batches #235

ianarawjo commented Mar 5, 2024

ianarawjo commented Mar 27, 2024

Rate limit across batches, not within batches #235

Rate limit across batches, not within batches #235

Comments

ianarawjo commented Mar 5, 2024

ianarawjo commented Mar 27, 2024