Why can I send multiple requests at once without a TPM limit? #21359
Labels
🔌: openai
Primarily related to OpenAI integrations
🤖:question
A specific question about the codebase, product, project, or how to use a feature
Checked other resources
Example Code
Error Message and Stack Trace (if applicable)
No response
Description
I calculated that each time I submit to GPT, it will cost me prompt: 20214 tokens and completion: 358 tokens. The TPM limit of gpt-4-32k is 80k TPM. So why do I make 7 requests at the same time in the same minute and why are no requests blocked?
System Info
AzureChatOpenAI
langchain
The text was updated successfully, but these errors were encountered: