[gemma4] infer from config instead of hardcoding by eustlb · Pull Request #45606 · huggingface/transformers

eustlb · 2026-04-23T14:42:54Z

What does this PR do?

As per title. Fix #45468

HuggingFaceDocBuilderDev · 2026-04-23T14:56:14Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

mathceo · 2026-04-23T14:57:43Z

@eustlb thanks for the clarification and the fix - context_size // 2 + 1 makes sense here given the blocked attention path and _rel_shift.

One thing that may still be useful is a small regression test for nondefault audio configs, so this does not silently stay tied to the default length again.

Something along these lines in tests/models/gemma4/test_modeling_gemma4.py:

def test_audio_rel_pos_encoding_uses_context_size_from_config(self):
    from transformers.models.gemma4.configuration_gemma4 import Gemma4AudioConfig
    from transformers.models.gemma4.modeling_gemma4 import Gemma4AudioRelPositionalEncoding

    config = Gemma4AudioConfig(
        hidden_size=32,
        attention_chunk_size=6,
        attention_context_left=5,
        attention_context_right=1,
        use_clipped_linears=False,
    )

    module = Gemma4AudioRelPositionalEncoding(config)
    hidden_states = torch.zeros(1, 3, config.hidden_size)

    pos = module(hidden_states)

    context_size = config.attention_chunk_size + config.attention_context_left - 1 + config.attention_context_right
    expected_len = context_size // 2 + 1

    self.assertEqual(pos.shape, (1, expected_len, config.hidden_size))

    position_ids = torch.arange(context_size // 2, -1, -1, device=hidden_states.device)[..., None]
    scaled_time = position_ids * module.inv_timescales.to(device=hidden_states.device)
    expected = torch.cat([torch.sin(scaled_time), torch.cos(scaled_time)], dim=-1).to(hidden_states.dtype)

    torch.testing.assert_close(pos, expected)

Happy to open a follow-up PR for the test as well if that is preferred.

vasqu

Thanks! Like @mathceo also mentioned, let's add a small regression test just in case 🫡

mathceo · 2026-04-23T17:16:52Z

I already implemented the regression test in #45607. Since #45606 is not my branch, I can't push directly there from my side. Feel free to cherry-pick the test commit from #45607
@eustlb

…s-gemma4 Co-Authored-By: Omar Zoloev <ozoloevwork@gmail.com>

eustlb · 2026-04-24T09:23:10Z

thanks @mathceo! added you as a co-author here

vasqu · 2026-04-27T12:20:47Z

run-slow: gemma4

github-actions · 2026-04-27T12:21:27Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: gemma4

github-actions · 2026-04-27T12:22:12Z

Workflow Run ⚙️

This comment contains run-slow, running the specified jobs:

models: ["models/gemma4"]
quantizations: []

github-actions · 2026-04-27T12:52:22Z

CI Results

Workflow Run ⚙️

Commit Info

Context	Commit	Description
RUN	32ada79d	workflow commit (merge commit)
PR	d404da97	branch commit (from PR)
main	bbb51c83	base commit (on `main`)

Model CI Report

❌ 1 new failed tests from this PR 😭

gemma4:
tests/models/gemma4/test_modeling_gemma4.py::Gemma4IntegrationTest::test_export_text_only (❌ ⟹ ❌)

vasqu · 2026-04-27T12:58:17Z

The different failure is a different PID in the test itself lol, merging

* infer from config instead of hardcoding * Update test_modeling_gemma4.py * Update modeling_gemma4.py * Update modeling_gemma4.py * Update modeling_gemma4.py * make style * add small docstring for reference --------- Co-authored-by: omar zoloev <ozoloevwork@gmail.com> Co-authored-by: vasqu <antonprogamer@gmail.com>

infer from config instead of hardcoding

5233d19

eustlb requested a review from vasqu April 23, 2026 14:42

eustlb mentioned this pull request Apr 23, 2026

[BUG] Gemma-4 Gemma4AudioRelPositionalEncoding #45468

Closed

4 tasks

Update test_modeling_gemma4.py

535353c

vasqu approved these changes Apr 23, 2026

View reviewed changes

mathceo mentioned this pull request Apr 23, 2026

Add regression test for Gemma4 audio relative positional range #45607

Closed

mathceo added 3 commits April 23, 2026 19:38

Update modeling_gemma4.py

8aa98af

Update modeling_gemma4.py

3a6ec39

Update modeling_gemma4.py

80f2d35

eustlb and others added 2 commits April 24, 2026 11:17

Merge branch 'main' into remove-hardcoded-rel-pos-gemma4

bbe5074

Merge branch 'gemma4-audio-rel-pos-test' into remove-hardcoded-rel-po…

cc87b12

…s-gemma4 Co-Authored-By: Omar Zoloev <ozoloevwork@gmail.com>

eustlb force-pushed the remove-hardcoded-rel-pos-gemma4 branch from 05e6e94 to cc87b12 Compare April 24, 2026 09:22

eustlb and others added 2 commits April 24, 2026 11:23

make style

da66c69

add small docstring for reference

d404da9

vasqu added this pull request to the merge queue Apr 27, 2026

Merged via the queue into main with commit 5d24d8c Apr 27, 2026
22 of 23 checks passed

vasqu deleted the remove-hardcoded-rel-pos-gemma4 branch April 27, 2026 13:05

evalstate mentioned this pull request Apr 28, 2026

Cumulative defect fixes from recent Transformers PRs evalstate/transformers#41

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[gemma4] infer from config instead of hardcoding#45606

[gemma4] infer from config instead of hardcoding#45606
vasqu merged 9 commits intomainfrom
remove-hardcoded-rel-pos-gemma4

eustlb commented Apr 23, 2026

Uh oh!

HuggingFaceDocBuilderDev commented Apr 23, 2026

Uh oh!

mathceo commented Apr 23, 2026

Uh oh!

vasqu left a comment

Uh oh!

mathceo commented Apr 23, 2026

Uh oh!

eustlb commented Apr 24, 2026

Uh oh!

vasqu commented Apr 27, 2026

Uh oh!

github-actions Bot commented Apr 27, 2026

Uh oh!

github-actions Bot commented Apr 27, 2026

Uh oh!

github-actions Bot commented Apr 27, 2026

Uh oh!

vasqu commented Apr 27, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

eustlb commented Apr 23, 2026

What does this PR do?

Uh oh!

HuggingFaceDocBuilderDev commented Apr 23, 2026

Uh oh!

mathceo commented Apr 23, 2026

Uh oh!

vasqu left a comment

Choose a reason for hiding this comment

Uh oh!

mathceo commented Apr 23, 2026

Uh oh!

eustlb commented Apr 24, 2026

Uh oh!

vasqu commented Apr 27, 2026

Uh oh!

github-actions Bot commented Apr 27, 2026

Uh oh!

github-actions Bot commented Apr 27, 2026

Uh oh!

github-actions Bot commented Apr 27, 2026

CI Results

Commit Info

Model CI Report

Uh oh!

vasqu commented Apr 27, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants