diff --git a/pages/organizations-and-projects/additional-content/organization-quotas.mdx b/pages/organizations-and-projects/additional-content/organization-quotas.mdx index d77818bb6b..f61a701229 100644 --- a/pages/organizations-and-projects/additional-content/organization-quotas.mdx +++ b/pages/organizations-and-projects/additional-content/organization-quotas.mdx @@ -168,6 +168,7 @@ Managed Inference Deployments are limited to a maximum number of nodes, dependin Generative APIs are rate limited based on: - Tokens per minute (total input and output tokens) - Requests per minute +- Concurrent requests (total active HTTP sessions at the same time) [Contact our support team](https://console.scaleway.com/support/create) if you want to increase your quotas above these limits. @@ -194,6 +195,9 @@ Generative APIs are rate limited based on: | qwen2.5-32b-instruct | 300 | 300 | | bge-multilingual-gemma2 | 300 | 300 | +| Concurrent requests | [Payment method validated](/billing/how-to/add-payment-method/#how-to-add-a-credit-card) | Payment method and [identity validated](/account/how-to/verify-identity/) | +|-------------|:----------------------------------------------------------------------------------------------------------:|:-------------------------------------------------------------:| +| All models | 25 | 25 | ## Apple silicon