Move VLM conversions to the main mapping by zucchini-nlp · Pull Request #44627 · huggingface/transformers

zucchini-nlp · 2026-03-12T12:00:31Z

What does this PR do?

The diff in revert mapping is needed, otherwise we get failures in a few models, see https://app.circleci.com/pipelines/github/huggingface/transformers/167425/workflows/fa96efe5-f810-408e-bafd-de03b7e881aa/jobs/2208432/tests

The literal dot is a nice feat to have in general, we actually mean a literal dot in all of the patterns where . is used. The being and end patterns are needed when reverse mapping from ^model.language_model to model but the state dict has other keys such as pre_layer_proj_model. So it is reversed incorrectly if not marked explicitly as $ or ^

src/transformers/models/sam3_tracker/modeling_sam3_tracker.py

src/transformers/models/ernie4_5_vl_moe/modeling_ernie4_5_vl_moe.py

src/transformers/conversion_mapping.py

src/transformers/core_model_loading.py

HuggingFaceDocBuilderDev · 2026-03-12T12:09:39Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

zucchini-nlp · 2026-03-12T12:18:24Z

run-slow: llava, qwen2_vl, paligemma, gemma3

github-actions · 2026-03-12T12:19:49Z

Workflow Run ⚙️

This comment contains run-slow, running the specified jobs:

models: ["models/gemma3", "models/llava", "models/paligemma", "models/qwen2_vl"]
quantizations: []

src/transformers/conversion_mapping.py

src/transformers/core_model_loading.py

github-actions · 2026-03-12T12:31:14Z

CI Results

Workflow Run ⚙️

Commit Info

Context	Commit	Description
RUN	520c0913	workflow commit (merge commit)
PR	a4392ef9	branch commit (from PR)
main	dc4016ff	base commit (on `main`)

✅ No failing test specific to this PR 🎉 👏 !

cursor

Cursor Bugbot has reviewed your changes and found 1 potential issue.

^{Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, enable autofix in the Cursor dashboard.}

src/transformers/models/sam3_tracker_video/modeling_sam3_tracker_video.py

Cyrilvallez · 2026-03-13T16:00:53Z

run-slow: colmodernvbert got_ocr2 paligemma qwen3_5_moe llava_next_video fuyu qwen2_5_vl cohere2_vision lfm2_vl llava_next ovis2 mllama shieldgemma2 qwen3_vl_moe glm4v_moe glm46v aria gemma3 llava llava_onevision mistral3 sam3_tracker fast_vlm florence2 colpali glm4v emu3 aya_vision qwen3_vl vipllava paddleocr_vl lighton_ocr video_llava glm_ocr colqwen2 ernie4_5_vl_moe video_llama_3 glm_image qwen3_5 internvl sam3 gemma3n sam3_tracker_video qwen2_vl perception_lm

github-actions · 2026-03-13T16:02:07Z

Workflow Run ⚙️

This comment contains run-slow, running the specified jobs:

models: ["models/aria", "models/aya_vision", "models/cohere2_vision", "models/colmodernvbert", "models/colpali", "models/colqwen2", "models/emu3", "models/ernie4_5_vl_moe", "models/fast_vlm", "models/florence2", "models/fuyu", "models/gemma3", "models/gemma3n", "models/glm46v", "models/glm4v", "models/glm4v_moe", "models/glm_image", "models/glm_ocr", "models/got_ocr2", "models/internvl", "models/lfm2_vl", "models/lighton_ocr", "models/llava", "models/llava_next", "models/llava_next_video", "models/llava_onevision", "models/mistral3", "models/mllama", "models/ovis2", "models/paddleocr_vl", "models/paligemma", "models/perception_lm", "models/qwen2_5_vl", "models/qwen2_vl", "models/qwen3_5", "models/qwen3_5_moe", "models/qwen3_vl", "models/qwen3_vl_moe", "models/sam3", "models/sam3_tracker", "models/sam3_tracker_video", "models/shieldgemma2", "models/video_llama_3", "models/video_llava", "models/vipllava"]
quantizations: []

Cyrilvallez

Alright, just pushed a few changes to simplify the loading logic a bit!
LGTM otherwise, let's just wait for the slow tests as touching the mappings can silently break all the real-world weights!

Cyrilvallez · 2026-03-13T16:05:44Z

src/transformers/conversion_mapping.py

-        "timm_wrapper": [
-            # Simply add the prefix `timm_model`
-            # TODO: Would be probably much cleaner with a `add_prefix` argument in WeightRenaming
-            WeightRenaming(
-                source_patterns=r"(.+)",
-                target_patterns=r"timm_model.\1",
-            )
-        ],


Are you really really sure about this one? The base_model_prefix really covers it completely even when used as submodel etc?

github-actions · 2026-03-13T22:01:48Z

CI Results

Workflow Run ⚙️

Commit Info

Context	Commit	Description
RUN	ba12425d	workflow commit (merge commit)
PR	d2fb5024	branch commit (from PR)
main	036340b7	base commit (on `main`)

⚠️ Model CI failed to report results

The test failure analysis could not be completed. Please check the workflow run for details.

zucchini-nlp · 2026-03-16T10:59:36Z

run-slow: colmodernvbert, got_ocr2, paligemma, qwen2_5_vl, cohere2_vision, lfm2_vl, llava, shieldgemma2, sam3_tracker, paddleocr_vl, colpali

github-actions · 2026-03-16T11:00:56Z

Workflow Run ⚙️

This comment contains run-slow, running the specified jobs:

models: ["models/cohere2_vision", "models/colmodernvbert", "models/colpali", "models/got_ocr2", "models/lfm2_vl", "models/llava", "models/paddleocr_vl", "models/paligemma", "models/qwen2_5_vl", "models/sam3_tracker", "models/shieldgemma2"]
quantizations: []

github-actions · 2026-03-16T11:12:00Z

CI Results

Workflow Run ⚙️

Commit Info

Context	Commit	Description
RUN	9610fd00	workflow commit (merge commit)
PR	47c2f837	branch commit (from PR)
main	0832773d	base commit (on `main`)

✅ No failing test specific to this PR 🎉 👏 !

zucchini-nlp · 2026-03-16T12:07:19Z

run-slow: aria, aya_vision, cohere2_vision, colmodernvbert, colpali, colqwen2, emu3, ernie4_5_vl_moe, fast_vlm, florence2, fuyu, gemma3, gemma3n, glm46v, glm4v, glm4v_moe

github-actions · 2026-03-16T12:08:34Z

Workflow Run ⚙️

This comment contains run-slow, running the specified jobs:

models: ["models/aria", "models/aya_vision", "models/cohere2_vision", "models/colmodernvbert", "models/colpali", "models/colqwen2", "models/emu3", "models/ernie4_5_vl_moe", "models/fast_vlm", "models/florence2", "models/fuyu", "models/gemma3", "models/gemma3n", "models/glm46v", "models/glm4v", "models/glm4v_moe"]
quantizations: []

github-actions · 2026-03-16T17:10:18Z

CI Results

Workflow Run ⚙️

Commit Info

Context	Commit	Description
RUN	6f52db6b	workflow commit (merge commit)
PR	3acd5ac1	branch commit (from PR)
main	0832773d	base commit (on `main`)

Model CI Report

❌ 5 new failed tests from this PR 😭

colqwen2:
tests/models/colqwen2/test_modeling_colqwen2.py::ColQwen2ForRetrievalModelTest::test_reverse_loading_mapping (✅ ⟹ ❌)
emu3:
tests/models/emu3/test_modeling_emu3.py::Emu3IntegrationTest::test_model_generate_images (❌ ⟹ ❌)
tests/models/emu3/test_modeling_emu3.py::Emu3IntegrationTest::test_model_generation (❌ ⟹ ❌)
tests/models/emu3/test_modeling_emu3.py::Emu3IntegrationTest::test_model_generation_batched (❌ ⟹ ❌)
tests/models/emu3/test_modeling_emu3.py::Emu3IntegrationTest::test_model_generation_multi_image (❌ ⟹ ❌)

zucchini-nlp · 2026-03-17T09:22:48Z

Emu3 fails on main as well because of OOM, and locally the loading works without missing keys

…-to-main-mapping

Cyrilvallez

Alright thanks! Let's keep an eye on tomorrow's slow CI run though to be sure!

github-actions · 2026-03-17T10:05:56Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: aria, aya_vision, cohere2_vision, colmodernvbert, colpali, colqwen2, emu3, ernie4_5_vl_moe, fast_vlm, florence2, fuyu, gemma3, gemma3n, glm46v, glm4v, glm4v_moe

github-actions · 2026-03-17T10:13:02Z

View the CircleCI Test Summary for this PR:

https://huggingface.co/spaces/transformers-community/circle-ci-viz?pr=44627&sha=90f253

oke, let's see

3c8321d

cursor bot reviewed Mar 12, 2026

View reviewed changes

revert core and check

a4392ef

cursor bot reviewed Mar 12, 2026

View reviewed changes

src/transformers/conversion_mapping.py Show resolved Hide resolved

src/transformers/core_model_loading.py Outdated Show resolved Hide resolved

ooops

0ef25ae

cursor bot reviewed Mar 12, 2026

View reviewed changes

src/transformers/models/sam3_tracker_video/modeling_sam3_tracker_video.py Show resolved Hide resolved

zucchini-nlp added 2 commits March 12, 2026 15:11

nope revert it back

098569b

forgot revert

4178df9

zucchini-nlp requested a review from Cyrilvallez March 12, 2026 14:19

zucchini-nlp mentioned this pull request Mar 12, 2026

Dynamic weight conversion is recursive #44300

Merged

Cyrilvallez added 4 commits March 13, 2026 16:46

simplify loading changes

cd0a995

make sure classes will be reset

1816404

merge with main

9d19472

fix modular

d2fb502

Cyrilvallez reviewed Mar 13, 2026

View reviewed changes

zucchini-nlp added 3 commits March 16, 2026 11:24

these don't recurse so copy

82bc1f2

a few typos in model type and regex

6a509c0

Merge branch 'main' into move-vlm-conversion-to-main-mapping

47c2f83

another fix: model type has underscores unlike class names

230b3f0

rebase brought an extra closing bracket, delete

3acd5ac

Merge branch 'main' into move-vlm-conversion-to-main-mapping

9496a1b

how did it end up here?

22ebea1

zucchini-nlp added 4 commits March 17, 2026 10:27

Merge remote-tracking branch 'upstream/main' into move-vlm-conversion…

6bd56e1

…-to-main-mapping

fix repo

2aa105e

t5 gemma is back

2f54ca2

revert not related

90f2537

Cyrilvallez approved these changes Mar 17, 2026

View reviewed changes

Cyrilvallez merged commit bbe251a into huggingface:main Mar 17, 2026
11 of 26 checks passed

Cyrilvallez mentioned this pull request Mar 19, 2026

🚨 Refactor ViT to updated standards #41693

Open

77 tasks

Conversation

zucchini-nlp commented Mar 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Mar 12, 2026

Uh oh!

zucchini-nlp commented Mar 12, 2026

Uh oh!

github-actions bot commented Mar 12, 2026

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Mar 12, 2026

CI Results

Commit Info

Uh oh!

cursor bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Cyrilvallez commented Mar 13, 2026

Uh oh!

github-actions bot commented Mar 13, 2026

Uh oh!

Cyrilvallez left a comment

Choose a reason for hiding this comment

Uh oh!

Cyrilvallez Mar 13, 2026

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Mar 13, 2026

CI Results

Commit Info

Uh oh!

zucchini-nlp commented Mar 16, 2026

Uh oh!

github-actions bot commented Mar 16, 2026

Uh oh!

github-actions bot commented Mar 16, 2026

CI Results

Commit Info

Uh oh!

zucchini-nlp commented Mar 16, 2026

Uh oh!

github-actions bot commented Mar 16, 2026

Uh oh!

github-actions bot commented Mar 16, 2026

CI Results

Commit Info

Model CI Report

Uh oh!

zucchini-nlp commented Mar 17, 2026

Uh oh!

Cyrilvallez left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

github-actions bot commented Mar 17, 2026

Uh oh!

github-actions bot commented Mar 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

zucchini-nlp commented Mar 12, 2026 •

edited

Loading