Skip to content

Conversation

@fpagny
Copy link
Contributor

@fpagny fpagny commented Mar 4, 2025

Update supported models in Managed Inference.
Added fp8 quantization support for three models and support for Mistral Small 24B Instruct.

@fpagny fpagny requested a review from bene2k1 as a code owner March 4, 2025 12:48
…struct-2501.mdx

Co-authored-by: Rowena Jones <36301604+RoRoJ@users.noreply.github.com>
@ldecarvalho-doc ldecarvalho-doc changed the title Update deepseek-r1-distill-llama-70b.mdx fix(genapi): update deepseek-r1-distill-llama-70b.mdx Mar 5, 2025
@bene2k1 bene2k1 merged commit df0a872 into main Mar 5, 2025
3 checks passed
@bene2k1 bene2k1 deleted the fpagny-patch-4 branch March 5, 2025 12:05
bene2k1 added a commit that referenced this pull request Mar 12, 2025
* Update deepseek-r1-distill-llama-70b.mdx

Update supported models in Managed Inference.

* Update deepseek-r1-distill-llama-8b.mdx

* Update llama-3.3-70b-instruct.mdx

* Create mistral-small-24b-instruct-2501.mdx

* Update pages/managed-inference/reference-content/mistral-small-24b-instruct-2501.mdx

Co-authored-by: Rowena Jones <36301604+RoRoJ@users.noreply.github.com>

---------

Co-authored-by: Benedikt Rollik <brollik@scaleway.com>
Co-authored-by: Rowena Jones <36301604+RoRoJ@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants