Update rate-limits.mdx (#4516)

fpagny · web-flow · commit 55262de7bb1d · 2025-02-28T14:04:33.000+01:00
diff --git a/pages/generative-apis/reference-content/rate-limits.mdx b/pages/generative-apis/reference-content/rate-limits.mdx
@@ -21,28 +21,28 @@ Any model served through Scaleway Generative APIs gets limited by:
 These limits only apply if you created a Scaleway Account and registered a valid payment method. Otherwise, stricter limits apply to ensure usage stays within Free Tier only.
 </Message>
 
+## How can I increase the rate limits?
+
+We actively monitor usage and will improve rates based on feedback.
+If you need to increase your rate limits, [contact our support team](https://console.scaleway.com/support/create), providing details on the model used and specific use case.
+Note that for increases of up to x5 or x10 volumes, we highly recommend using dedicated deployments with [Managed Inference](https://console.scaleway.com/inference/deployments), which provides exactly the same features and API compatibility.
+
 ### Chat models
 
 | Model string | Requests per minute | Total tokens per minute |
 |-----------------|-----------------|-----------------|
-| `llama-3.1-8b-instruct` | 300 | 100K |
-| `llama-3.1-70b-instruct` | 300 | 100K |
-| `mistral-nemo-instruct-2407`| 300 | 100K |
-| `pixtral-12b-2409`| 300 | 100K |
-| `qwen2.5-32b-instruct`| 300 | 100K |
+| `llama-3.1-8b-instruct` | 300 | 200K |
+| `llama-3.1-70b-instruct` | 300 | 200K |
+| `mistral-nemo-instruct-2407`| 300 | 200K |
+| `pixtral-12b-2409`| 300 | 200K |
+| `qwen2.5-32b-instruct`| 300 | 200K |
 
 ### Embedding models 
 
 | Model string | Requests per minute | Input tokens per minute |
 |-----------------|-----------------|-----------------|
-| `sentence-t5-xxl` | 100 | 200K |
-| `bge-multilingual-gemma2` | 100 | 200K |
+| `bge-multilingual-gemma2` | 300 | 400K |
 
 ## Why do we set rate limits?
 
 These limits safeguard against abuse or misuse of Scaleway Generative APIs, helping to ensure fair access to the API with consistent performance.
-
-## How can I increase the rate limits?
-
-We actively monitor usage and will improve rates based on feedback.
-If you need to increase your rate limits, contact us via the support team, providing details on the model used and specific use case.