From ae3d1bde840e49826dddd6c269606c085c2def61 Mon Sep 17 00:00:00 2001 From: fpagny Date: Mon, 4 Aug 2025 18:14:04 +0200 Subject: [PATCH 1/3] feat(genapi): add qwen3 235b model --- .../reference-content/supported-models.mdx | 8 ++------ 1 file changed, 2 insertions(+), 6 deletions(-) diff --git a/pages/generative-apis/reference-content/supported-models.mdx b/pages/generative-apis/reference-content/supported-models.mdx index c131e5ca25..7c203134f5 100644 --- a/pages/generative-apis/reference-content/supported-models.mdx +++ b/pages/generative-apis/reference-content/supported-models.mdx @@ -24,16 +24,12 @@ Our API supports the most popular models for [Chat](/generative-apis/how-to/quer | Meta | `llama-3.3-70b-instruct` | 100k | 4096 | [Llama 3.3 Community](https://www.llama.com/llama3_3/license/) | [HF](https://huggingface.co/meta-llama/Llama-3.3-70B-Instruct) | | Meta | `llama-3.1-8b-instruct` | 128k | 16384 | [Llama 3.1 Community](https://llama.meta.com/llama3_1/license/) | [HF](https://huggingface.co/meta-llama/Llama-3.1-8B-Instruct) | | Mistral | `mistral-nemo-instruct-2407` | 128k | 8192 | [Apache-2.0](https://www.apache.org/licenses/LICENSE-2.0) | [HF](https://huggingface.co/mistralai/Mistral-Nemo-Instruct-2407) | +| Qwen | `qwen3-235b-a22b-instruct-2507` | 40k | 4096 | [Apache-2.0](https://www.apache.org/licenses/LICENSE-2.0) | [HF](https://huggingface.co/Qwen/Qwen3-235B-A22B-Instruct-2507) | | Qwen | `qwen2.5-coder-32b-instruct` | 32k | 8192 | [Apache-2.0](https://www.apache.org/licenses/LICENSE-2.0) | [HF](https://huggingface.co/Qwen/Qwen2.5-Coder-32B-Instruct) | -| DeepSeek (Preview) | `deepseek-r1` | 20k | 4096 | [MIT](https://huggingface.co/datasets/choosealicense/licenses/blob/main/markdown/mit.md) | [HF](https://huggingface.co/deepseek-ai/DeepSeek-R1) | | DeepSeek | `deepseek-r1-distill-llama-70b` | 32k | 4096 | [MIT](https://huggingface.co/datasets/choosealicense/licenses/blob/main/markdown/mit.md) | [HF](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Llama-70B) | - - DeepSeek-R1 is currently under maintenance and therefore not available on Generative APIs anymore. - - - If you are unsure which chat model to use, we currently recommend Llama 3.1 8B Instruct (`llama-3.1-8b-instruct`) to get started. + If you are unsure which chat model to use, we currently recommend Mistral Small 3.1 24B Instruct (`mistral-small-3.1-24b-instruct-2503`) to get started. ## Vision models From 16d0f54ffef2bd5feebdb1efaf085c3c207ab817 Mon Sep 17 00:00:00 2001 From: fpagny Date: Mon, 4 Aug 2025 18:29:12 +0200 Subject: [PATCH 2/3] feat(inference): add qwen3 235b model --- .../reference-content/model-catalog.mdx | 14 ++++++++++++++ 1 file changed, 14 insertions(+) diff --git a/pages/managed-inference/reference-content/model-catalog.mdx b/pages/managed-inference/reference-content/model-catalog.mdx index 1411911b1c..cb81a82ff6 100644 --- a/pages/managed-inference/reference-content/model-catalog.mdx +++ b/pages/managed-inference/reference-content/model-catalog.mdx @@ -16,6 +16,7 @@ A quick overview of available models in Scaleway's catalog and their core attrib | Model name | Provider | Maximum Context length (tokens) | Modalities | Compatible Instances (Max Context in tokens\*) | License | |------------|----------|--------------|------------|-----------|---------| +| [`qwen3-235b-a22b-instruct-2507`](#qwen3-235b-a22b-instruct-2507) | Qwen | 40k | Text | H100-2 | [Apache 2.0](https://www.apache.org/licenses/LICENSE-2.0) | | [`gemma-3-27b-it`](#gemma-3-27b-it) | Google | 40k | Text, Vision | H100, H100-2 | [Gemma](https://ai.google.dev/gemma/terms) | | [`llama-3.3-70b-instruct`](#llama-33-70b-instruct) | Meta | 128k | Text | H100 (15k), H100-2 | [Llama 3.3 Community](https://huggingface.co/meta-llama/Llama-3.3-70B-Instruct) | | [`llama-3.1-70b-instruct`](#llama-31-70b-instruct) | Meta | 128k | Text | H100 (15k), H100-2 | [Llama 3.1 Community](https://huggingface.co/meta-llama/Llama-3.1-70B-Instruct/blob/main/LICENSE) | @@ -44,6 +45,7 @@ A quick overview of available models in Scaleway's catalog and their core attrib ## Models feature summary | Model name | Structured output supported | Function calling | Supported languages | | --- | --- | --- | --- | +| `qwen3-235b-a22b-instruct-2507` | Yes | Yes | English, French, German, Chinese, Japanese, Korean and 113 additional languages and dialects | | `gemma-3-27b-it` | Yes | Partial | English, Chinese, Japanese, Korean and 31 additional languages | | `llama-3.3-70b-instruct` | Yes | Yes | English, German, French, Italian, Portuguese, Hindi, Spanish, and Thai | | `llama-3.1-70b-instruct` | Yes | Yes | English, German, French, Italian, Portuguese, Hindi, Spanish, and Thai | @@ -145,6 +147,18 @@ allenai/molmo-72b-0924:fp8 ## Text models +### Qwen3-235b-a22b-instruct-2507 +Released July 23, 2025, Qwen 3 235B A22B is an open-weight model, competitive in multiple benchmarks (such as [LM Arena for Text use cases](https://lmarena.ai/leaderboard)) compared to Gemini 2.5 Pro and GPT4.5. + +| Attribute | Value | +|-----------|-------| +| Supports parallel tool calling | Yes | + +#### Model name +``` +qwen/qwen3-235b-a22b-instruct-2507:awq +``` + ### Llama-3.3-70b-instruct Released December 6, 2024, Meta’s Llama 3.3 70b is a fine-tune of the [Llama 3.1 70b](/managed-inference/reference-content/model-catalog/#llama-31-70b-instruct) model. This model is still text-only (text in/text out). However, Llama 3.3 was designed to approach the performance of Llama 3.1 405B on some applications. From 41a647d4763d5f9f26ed056318cdb812591aaed8 Mon Sep 17 00:00:00 2001 From: fpagny Date: Tue, 5 Aug 2025 15:49:05 +0200 Subject: [PATCH 3/3] Update pages/managed-inference/reference-content/model-catalog.mdx Co-authored-by: Rowena Jones <36301604+RoRoJ@users.noreply.github.com> --- pages/managed-inference/reference-content/model-catalog.mdx | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/pages/managed-inference/reference-content/model-catalog.mdx b/pages/managed-inference/reference-content/model-catalog.mdx index cb81a82ff6..c985489aac 100644 --- a/pages/managed-inference/reference-content/model-catalog.mdx +++ b/pages/managed-inference/reference-content/model-catalog.mdx @@ -148,7 +148,7 @@ allenai/molmo-72b-0924:fp8 ## Text models ### Qwen3-235b-a22b-instruct-2507 -Released July 23, 2025, Qwen 3 235B A22B is an open-weight model, competitive in multiple benchmarks (such as [LM Arena for Text use cases](https://lmarena.ai/leaderboard)) compared to Gemini 2.5 Pro and GPT4.5. +Released July 23, 2025, Qwen 3 235B A22B is an open-weight model, competitive in multiple benchmarks (such as [LM Arena for text use cases](https://lmarena.ai/leaderboard)) compared to Gemini 2.5 Pro and GPT4.5. | Attribute | Value | |-----------|-------|