[tests] fix blip2 edge case #40699

gante · 2025-09-04T16:27:16Z

What does this PR do?

(Carved from #40553, which is becoming messy)

Our tests have the model.config.get_text_config(decoder=True).is_encoder_decoder pattern, which doesn't make sense -- we're pulling the decoder if it exists, and then checking if it is encoder-decoder.

The pattern exists because of blip2, which wasn't setting is_encoder_decoder correctly -- if its LLM is an encoder-decoder model, then it is also an encoder-decoder model.

✅ blip 2 slow tests are passing

HuggingFaceDocBuilderDev · 2025-09-04T16:36:45Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

gante · 2025-09-04T16:50:33Z

run-slow: blip_2

github-actions · 2025-09-04T16:52:01Z

This comment contains run-slow, running the specified jobs:

models: ['models/blip_2']
quantizations: [] ...

gante · 2025-09-04T17:00:28Z

setup.py

-            "I will remove it now.\n\n"
-            "See https://github.com/pypa/pip/issues/5466 for details.\n"
-        ).format(stale_egg_info)
+        f"Warning: {stale_egg_info} exists.\n\n"


make fixup corrected this 👀

gante · 2025-09-04T17:00:50Z

tests/models/blip_2/test_modeling_blip_2.py

        pass

+    @parameterized.expand(TEST_EAGER_MATCHES_SDPA_INFERENCE_PARAMETERIZATION)
+    @unittest.skip("Won't fix: Blip2 + T5 backbone needs custom input preparation for this test")


can be fixed, but I don't think it's worth it 👀

agreed, let's assume that t5 passes the test so we are fine :)

gante · 2025-09-04T17:46:12Z

run-slow: blip_2

github-actions · 2025-09-04T17:47:43Z

This comment contains run-slow, running the specified jobs:

models: ['models/blip_2']
quantizations: [] ...

zucchini-nlp

Thanks, it is much better to have it in model config level. Can you also verify how Florence2 works with this PR? Those are the only two VLMs with encoder-decoder backbone afaik

zucchini-nlp · 2025-09-05T09:01:17Z

tests/generation/test_utils.py


-            if model.config.get_text_config(decoder=True).is_encoder_decoder:
+            if model.config.is_encoder_decoder:
                self.assertTrue(output_generate.sequences.shape[1] == self.max_new_tokens + 1)


cool thanks, can you check if Florence2 model works with these changes? It is same as blip and uses Bart as backbone

zucchini-nlp · 2025-09-05T09:01:50Z

tests/models/blip_2/test_modeling_blip_2.py

        pass

+    @parameterized.expand(TEST_EAGER_MATCHES_SDPA_INFERENCE_PARAMETERIZATION)
+    @unittest.skip("Won't fix: Blip2 + T5 backbone needs custom input preparation for this test")


agreed, let's assume that t5 passes the test so we are fine :)

zucchini-nlp · 2025-09-05T09:02:41Z

tests/models/blip_2/test_modeling_blip_2.py

-                    [0, 3, 7, 152, 2515, 11389, 3523, 1],
-                    "san francisco",  # TODO: check if this is ok
-                ),
+                ("cuda", None): ([0, 3, 7, 152, 2515, 11389, 3523, 1], "san francisco"),


i believe the CUDA expectation incorrect from the beginning

yes, I agree 👀

gante · 2025-09-05T09:46:51Z

run-slow: florence2

github-actions · 2025-09-05T09:48:13Z

This comment contains run-slow, running the specified jobs:

models: ['models/florence2']
quantizations: [] ...

zucchini-nlp

Forgot to ✅

gante · 2025-09-05T10:24:28Z

@zucchini-nlp florence2 slow tests were mostly green (there were 3 FA2 failures, which are unrelated to this PR) ✅

In florence2, is_encoder_decoder was correctly set for the start 👍

github-actions · 2025-09-05T10:24:39Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: blip_2

fix blip2 edge case

1e7560b

skip

92d5828

gante requested a review from zucchini-nlp September 4, 2025 17:00

gante commented Sep 4, 2025

View reviewed changes

fix slow tests

1d7c3b5

zucchini-nlp reviewed Sep 5, 2025

View reviewed changes

zucchini-nlp approved these changes Sep 5, 2025

View reviewed changes

Merge branch 'main' into blip_2_edge_case

2182061

gante merged commit a2a8a3c into huggingface:main Sep 5, 2025
23 checks passed

gante deleted the blip_2_edge_case branch September 5, 2025 10:35

[tests] fix blip2 edge case #40699

[tests] fix blip2 edge case #40699

Uh oh!

Conversation

gante commented Sep 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Uh oh!

HuggingFaceDocBuilderDev commented Sep 4, 2025

Uh oh!

gante commented Sep 4, 2025

Uh oh!

github-actions bot commented Sep 4, 2025

Uh oh!

gante Sep 4, 2025

Choose a reason for hiding this comment

Uh oh!

gante Sep 4, 2025

Choose a reason for hiding this comment

Uh oh!

zucchini-nlp Sep 5, 2025

Choose a reason for hiding this comment

Uh oh!

gante commented Sep 4, 2025

Uh oh!

github-actions bot commented Sep 4, 2025

Uh oh!

zucchini-nlp left a comment

Choose a reason for hiding this comment

Uh oh!

zucchini-nlp Sep 5, 2025

Choose a reason for hiding this comment

Uh oh!

zucchini-nlp Sep 5, 2025

Choose a reason for hiding this comment

Uh oh!

zucchini-nlp Sep 5, 2025

Choose a reason for hiding this comment

Uh oh!

gante Sep 5, 2025

Choose a reason for hiding this comment

Uh oh!

gante commented Sep 5, 2025

Uh oh!

github-actions bot commented Sep 5, 2025

Uh oh!

zucchini-nlp left a comment

Choose a reason for hiding this comment

Uh oh!

gante commented Sep 5, 2025

Uh oh!

github-actions bot commented Sep 5, 2025

Uh oh!

Uh oh!

Uh oh!

gante commented Sep 4, 2025 •

edited

Loading