Let's break Qwen-VL 🚨 #42420

zucchini-nlp · 2025-11-26T11:41:03Z

What does this PR do?

As per title, break the cycle. Qwen will no longer have access to text attributes with config.vocab_size which solves the duplicated attribute issue we had in TRL. A config should not have two different values for the same key

Tested with the TRL issue that there is no regression and ran slow test locally

zucchini-nlp · 2025-11-26T11:42:38Z

run-slow: qwen2_5_vl, qwen2_vl

github-actions · 2025-11-26T11:43:46Z

This comment contains run-slow, running the specified jobs:

models: ["models/qwen2_5_vl", "models/qwen2_vl"]
quantizations: []

HuggingFaceDocBuilderDev · 2025-11-26T11:49:55Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

github-actions · 2025-11-26T11:59:58Z

CI Results

Workflow Run ⚙️

✅ No failing test specific to this PR 🎉 !

…Reported by TRL in the past

zucchini-nlp · 2025-11-26T12:20:10Z

run-slow: qwen2_5_vl, qwen2_vl

github-actions · 2025-11-26T12:21:16Z

This comment contains run-slow, running the specified jobs:

models: ["models/qwen2_5_vl", "models/qwen2_vl"]
quantizations: []

github-actions · 2025-11-26T13:03:28Z

CI Results

Workflow Run ⚙️

Model CI Report

❌ Failed tests

qwen2_5_vl:
tests/models/qwen2_5_vl/test_modeling_qwen2_5_vl.py::Qwen2_5_VLIntegrationTest::test_small_model_integration_test
tests/models/qwen2_5_vl/test_modeling_qwen2_5_vl.py::Qwen2_5_VLIntegrationTest::test_small_model_integration_test_batch
tests/models/qwen2_5_vl/test_modeling_qwen2_5_vl.py::Qwen2_5_VLIntegrationTest::test_small_model_integration_test_batch_wo_image
tests/models/qwen2_5_vl/test_modeling_qwen2_5_vl.py::Qwen2_5_VLIntegrationTest::test_small_model_integration_test_expand
qwen2_vl:
tests/models/qwen2_vl/test_modeling_qwen2_vl.py::Qwen2VLIntegrationTest::test_small_model_integration_test
tests/models/qwen2_vl/test_modeling_qwen2_vl.py::Qwen2VLIntegrationTest::test_small_model_integration_test_batch
tests/models/qwen2_vl/test_modeling_qwen2_vl.py::Qwen2VLIntegrationTest::test_small_model_integration_test_batch_different_resolutions
tests/models/qwen2_vl/test_modeling_qwen2_vl.py::Qwen2VLIntegrationTest::test_small_model_integration_test_batch_wo_image
tests/models/qwen2_vl/test_modeling_qwen2_vl.py::Qwen2VLIntegrationTest::test_small_model_integration_test_expand

zucchini-nlp · 2025-11-26T20:52:14Z

run-slow: qwen2_5_vl, qwen2_vl

github-actions · 2025-11-26T20:53:19Z

This comment contains run-slow, running the specified jobs:

models: ["models/qwen2_5_vl", "models/qwen2_vl"]
quantizations: []

zucchini-nlp · 2025-11-26T21:34:22Z

run-slow: qwen2_5_vl, qwen2_vl

zucchini-nlp · 2025-11-26T21:39:21Z

run-slow: qwen2_5_vl, qwen2_vl

zucchini-nlp · 2025-11-26T21:52:10Z

run-slow: qwen2_5_vl, qwen2_vl

zucchini-nlp · 2025-11-26T22:13:07Z

src/transformers/models/qwen2_5_vl/configuration_qwen2_5_vl.py

+            # Hub configs are saved as flat dicts so we pop some of kwargs to init `TextConfig`
+            text_params = inspect.signature(self.sub_configs["text_config"].__init__).parameters.keys()
+            text_params = list(text_params) + ["rope_scaling", "rope_theta"]
+            text_config = {key: kwargs.pop(key) for key in text_params if key in kwargs}
+            text_config["dtype"] = kwargs.get("torch_dtype", kwargs.get("dtype"))  # don't pop the dtype


not the best way, i should admit. The problem is that config will assign all kwargs as attributes, and if we don't pop we end up with the same set of kwargs in text config and in the general config

github-actions · 2025-11-26T22:16:47Z

CI Results

Workflow Run ⚙️

⚠️ No test being reported (jobs are skipped or cancelled)!

github-actions · 2025-11-26T22:18:00Z

This comment contains run-slow, running the specified jobs:

models: ["models/qwen2_5_vl", "models/qwen2_vl"]
quantizations: []

github-actions · 2025-11-26T22:35:13Z

CI Results

Workflow Run ⚙️

✅ No failing test specific to this PR 🎉 !

albertvillanova · 2025-11-27T08:23:24Z

Thanks for addressing this issue, @zucchini-nlp. 🤗

For completeness, here is the original issue we opened in TRL that tracked this problem:

Config rope_scaling and text_config.rope_scaling might be the same or different dict objects #41020
- More concretely, this comment: Config rope_scaling and text_config.rope_scaling might be the same or different dict objects #41020 (comment)

ArthurZucker

Thanks let's add a note in the migration guide please!

github-actions · 2025-11-27T15:18:40Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: qwen2_5_vl, qwen2_vl

Lets braeaak!

018c8e0

pop attributes so we don't have 2 different values for the same key. …

a2a5e71

…Reported by TRL in the past

oh no, tying is messing up with slow tests....

5c2466e

fix copies

7745406

zucchini-nlp mentioned this pull request Nov 26, 2025

Generate tiny models fails for Qwen2.5-VL: KeyError: 'rope_theta' huggingface/trl#4583

Closed

zucchini-nlp commented Nov 26, 2025

View reviewed changes

zucchini-nlp requested a review from ArthurZucker November 27, 2025 08:01

ArthurZucker approved these changes Nov 27, 2025

View reviewed changes

zucchini-nlp added 2 commits November 27, 2025 16:15

Merge remote-tracking branch 'upstream/main' into break-qwen-finally

90eb550

migration guide

6913b3c

zucchini-nlp enabled auto-merge (squash) November 27, 2025 15:17

zucchini-nlp added the for_v5? label Nov 27, 2025

zucchini-nlp merged commit 9bdec65 into huggingface:main Nov 27, 2025
18 checks passed

hmellor mentioned this pull request Dec 1, 2025

Fix some Transformers nightly tests vllm-project/vllm#29802

Merged

Let's break Qwen-VL 🚨 #42420

Let's break Qwen-VL 🚨 #42420

Uh oh!

Conversation

zucchini-nlp commented Nov 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Uh oh!

zucchini-nlp commented Nov 26, 2025

Uh oh!

github-actions bot commented Nov 26, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Nov 26, 2025

Uh oh!

github-actions bot commented Nov 26, 2025

CI Results

Uh oh!

zucchini-nlp commented Nov 26, 2025

Uh oh!

github-actions bot commented Nov 26, 2025

Uh oh!

github-actions bot commented Nov 26, 2025

CI Results

Model CI Report

❌ Failed tests

Uh oh!

zucchini-nlp commented Nov 26, 2025

Uh oh!

github-actions bot commented Nov 26, 2025

Uh oh!

zucchini-nlp commented Nov 26, 2025

Uh oh!

zucchini-nlp commented Nov 26, 2025

Uh oh!

zucchini-nlp commented Nov 26, 2025

Uh oh!

zucchini-nlp Nov 26, 2025

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Nov 26, 2025

CI Results

Uh oh!

github-actions bot commented Nov 26, 2025

Uh oh!

github-actions bot commented Nov 26, 2025

CI Results

Uh oh!

albertvillanova commented Nov 27, 2025

Uh oh!

ArthurZucker left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Nov 27, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

zucchini-nlp commented Nov 26, 2025 •

edited

Loading