model: support Ministral3 #17644

ngxson · 2025-12-01T09:42:15Z

Ref upstream PR: huggingface/transformers#42498

Disclosure: This PR is made with collaboration from Mistral. Huge thanks to @juliendenize for coordination!

Note: The model weight is not yet released

PPl results: for the 14B model (-Instruct variant, f16, ctx=32000, batch=8192), ppl is Final estimate: PPL = 5.5389 +/- 0.03163

…ral3

ngxson · 2025-12-01T09:46:16Z

convert_hf_to_gguf.py

 @ModelBase.register("Mistral3ForConditionalGeneration")
 class Mistral3Model(LlamaModel):
-    model_arch = gguf.MODEL_ARCH.LLAMA
+    model_arch = gguf.MODEL_ARCH.MISTRAL3


Note for maintainers: while the ministral3 and the old mistral models have almost the same cgraph, the hparams handling in llama_model::load_hparams is quite more complicated. Therefore, it's better to separate the 2 archs to make it more readable.

This also make the code to be more future-proof, in case future mistral models become significantly more complicated than the traditional llama arch.

ngxson · 2025-12-01T10:04:53Z

convert_hf_to_gguf.py

+        # for compatibility, we use LLAMA arch for older models
+        # TODO: remove this once everyone has migrated to newer version of llama.cpp
+        if self.hparams.get("model_type") != "ministral3":
+            self.model_arch = gguf.MODEL_ARCH.LLAMA
+            self.gguf_writer.arch = str(self.model_arch)
+            self.gguf_writer.add_architecture()
+            self.tensor_map = gguf.get_tensor_name_map(self.model_arch, self.block_count)


I think a time frame of ~1 week to remove this could be a reasonable timeline to remove this.

This is for the case where users using new version of script (i.e. via gguf-my-repo) to convert old models, while their local llama.cpp version probably not yet up-to-date

convert_hf_to_gguf.py

src/models/models.h

Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>

ngxson and others added 7 commits November 25, 2025 14:23

conversion script

3e41c14

support ministral 3

2b2f411

maybe this is better?

4cebf7b

add TODO for rope_yarn_log_mul

84be00f

better ppl (tested on 14B-Instruct)

786b3f8

Merge remote-tracking branch 'mistral/xsn/ministral3' into xsn/minist…

55a196f

…ral3

Add Ministral3 support to Mistral format

a4f540b

ngxson commented Dec 1, 2025

View reviewed changes

ngxson added 2 commits December 1, 2025 11:00

improve arch handling

bf08fcc

add sizes

34234a5

ngxson commented Dec 1, 2025

View reviewed changes

ngxson marked this pull request as ready for review December 1, 2025 10:05

ngxson requested review from CISC and ggerganov as code owners December 1, 2025 10:05

ggerganov approved these changes Dec 1, 2025

View reviewed changes

loci-dev mentioned this pull request Dec 1, 2025

UPSTREAM PR #17644: model: support Ministral3 auroralabs-loci/llama.cpp#387

Open

CISC approved these changes Dec 1, 2025

View reviewed changes

convert_hf_to_gguf.py Outdated Show resolved Hide resolved

convert_hf_to_gguf.py Outdated Show resolved Hide resolved

src/models/models.h Outdated Show resolved Hide resolved

ngxson and others added 2 commits December 1, 2025 11:44

Apply suggestions from code review

b185b7f

Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>

nits

5600361

github-actions bot added model Model specific python python script changes labels Dec 1, 2025

ngxson merged commit cd3c118 into ggml-org:master Dec 1, 2025
67 of 69 checks passed

giladgd mentioned this pull request Dec 1, 2025

model: fix llama arch implementation #17665

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

model: support Ministral3 #17644

model: support Ministral3 #17644

ngxson commented Dec 1, 2025 •

edited

Loading

Uh oh!

ngxson Dec 1, 2025

Uh oh!

ngxson Dec 1, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

model: support Ministral3 #17644

model: support Ministral3 #17644

Conversation

ngxson commented Dec 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ngxson Dec 1, 2025

Choose a reason for hiding this comment

Uh oh!

ngxson Dec 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

ngxson commented Dec 1, 2025 •

edited

Loading

ngxson Dec 1, 2025 •

edited

Loading