Add AutoTokenizer mapping for mistral3 and ministral #42198

patrickvonplaten · 2025-11-13T23:40:36Z

What does this PR do?

This PR makes sure that:

from transformers import AutoTokenizer

tok = AutoTokenizer.from_pretrained("mistralai/Mistral-Small-3.2-24B-Instruct-2506")
tok = AutoTokenizer.from_pretrained("mistralai/Ministral-8B-Instruct-2410")

works correctly

github-actions · 2025-11-13T23:41:39Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: auto

patrickvonplaten · 2025-11-14T00:03:22Z

None of the test failures look related

vasqu

The auto mapping is incorrect and only works with mistral_common, see #41553 (comment). This is probably an issue for multiple mistral-related models now.

Tl;dr: It defines a llama backup but does not have the necessary files uploaded to the hub for that to work

ArthurZucker · 2025-11-14T12:41:04Z

Yep, but we need to sort that on our side iMO!

* WIP * WIP

WIP

d6bb622

WIP

4b30918

patrickvonplaten changed the title ~~WIP~~ Add AutoTokenizer mapping for mistral3 and ministral Nov 13, 2025

patrickvonplaten mentioned this pull request Nov 13, 2025

AutoTokenizer/Processor does not work with Mistral3 models #36968

Closed

4 tasks

ArthurZucker approved these changes Nov 14, 2025

View reviewed changes

ArthurZucker added the for patch Tag issues / labels that should be included in the next patch label Nov 14, 2025

ArthurZucker merged commit 7607d80 into huggingface:main Nov 14, 2025
18 of 21 checks passed

vasqu reviewed Nov 14, 2025

View reviewed changes

vasqu mentioned this pull request Nov 19, 2025

AutoTokenizer / Process does not support Magistral model #42283

Closed

4 tasks

Cyrilvallez pushed a commit that referenced this pull request Nov 24, 2025

Add AutoTokenizer mapping for mistral3 and ministral (#42198)

804038f

* WIP * WIP

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add AutoTokenizer mapping for mistral3 and ministral #42198

Add AutoTokenizer mapping for mistral3 and ministral #42198

Uh oh!

patrickvonplaten commented Nov 13, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Nov 13, 2025

Uh oh!

patrickvonplaten commented Nov 14, 2025

Uh oh!

Uh oh!

vasqu left a comment

Uh oh!

ArthurZucker commented Nov 14, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Add AutoTokenizer mapping for mistral3 and ministral #42198

Add AutoTokenizer mapping for mistral3 and ministral #42198

Uh oh!

Conversation

patrickvonplaten commented Nov 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Uh oh!

github-actions bot commented Nov 13, 2025

Uh oh!

patrickvonplaten commented Nov 14, 2025

Uh oh!

Uh oh!

vasqu left a comment

Choose a reason for hiding this comment

Uh oh!

ArthurZucker commented Nov 14, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

patrickvonplaten commented Nov 13, 2025 •

edited

Loading