[bugfix] fix MHA for models like OpenGVLab/InternVL3_5-38B #25146

yma11 · 2025-09-18T07:08:18Z

Purpose

The query of models like OpenGVLab/InternVL3_5-38B is 4 dimension so will pop error:

^[[1;36m(Worker_TP2 pid=946)^[[0;0m ERROR 09-15 13:21:14 [multiproc_executor.py:654]   File "/workspace/vllm/vllm/attention/layer.py", line 383, in forward

^[[1;36m(Worker_TP2 pid=946)^[[0;0m ERROR 09-15 13:21:14 [multiproc_executor.py:654]     bsz, q_len, _ = query.size()

^[[1;36m(Worker_TP2 pid=946)^[[0;0m ERROR 09-15 13:21:14 [multiproc_executor.py:654]     ^^^^^^^^^^^^^

^[[1;36m(Worker_TP2 pid=946)^[[0;0m ERROR 09-15 13:21:14 [multiproc_executor.py:654] ValueError: too many values to unpack (expected 3)

Test Plan

Test Result

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

yma11 · 2025-09-18T07:09:30Z

@Isotr0py can you help review this?

gemini-code-assist

Code Review

This pull request addresses a ValueError that occurs in MultiHeadAttention when processing input query tensors with more than three dimensions, as seen with models like OpenGVLab/InternVL3_5-38B. The fix correctly extracts the batch size and sequence length by slicing the shape tuple, making the implementation more robust and compatible with higher-dimensional inputs. The change is correct and effectively resolves the bug. I've added one high-severity comment regarding an outdated docstring, which should be updated to reflect the new capability and ensure long-term maintainability.

gemini-code-assist · 2025-09-18T07:09:38Z

vllm/attention/layer.py

While this change correctly handles tensors with more than 3 dimensions, the method's docstring on line 433 was not updated. It still states Input shape: batch_size x seq_len x hidden_size, which is now misleading. Inaccurate documentation for a core component like this can lead to incorrect usage and bugs in the future. Please update the docstring to reflect that the input can have more than three dimensions, for example, by mentioning that additional dimensions are allowed after seq_len.

Isotr0py

Thanks for fixing!

Isotr0py · 2025-09-18T18:46:16Z

vllm/attention/layer.py

Suggested change

"""Input shape: batch_size x seq_len x hidden_size"""

"""Input shape:

(batch_size x seq_len x hidden_size) or

(batch_size x seq_len x num_heads x head_size)

"""

Let's also update the docstring.

Should we also remove TODO(Isotr0py): Use existing backend implementations and support FA3 as FA3 supported by #24337?

Signed-off-by: Yan Ma <yan.ma@intel.com>

…ect#25146) Signed-off-by: Yan Ma <yan.ma@intel.com> Co-authored-by: Isotr0py <mozf@mail2.sysu.edu.cn>

…ect#25146) Signed-off-by: Yan Ma <yan.ma@intel.com> Co-authored-by: Isotr0py <mozf@mail2.sysu.edu.cn> Signed-off-by: charlifu <charlifu@amd.com>

yma11 requested a review from LucasWilkinson as a code owner September 18, 2025 07:08

gemini-code-assist bot reviewed Sep 18, 2025

View reviewed changes

Isotr0py approved these changes Sep 18, 2025

View reviewed changes

yma11 added 2 commits September 19, 2025 02:47

[bugfix] fix MHA for models like OpenGVLab/InternVL3_5-38B

984e5b9

Signed-off-by: Yan Ma <yan.ma@intel.com>

address comment

55f32c0

Signed-off-by: Yan Ma <yan.ma@intel.com>

yma11 force-pushed the mha-fix branch from 33df059 to 55f32c0 Compare September 19, 2025 03:31

wwl2755 mentioned this pull request Sep 19, 2025

[Bug]: internVL3.5 38B not working in vlllm 0.10.2 #25227

Closed

1 task

Merge branch 'main' into mha-fix

b6039fb

Isotr0py enabled auto-merge (squash) September 19, 2025 06:43

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Sep 19, 2025

Isotr0py merged commit a684c01 into vllm-project:main Sep 19, 2025
52 checks passed

Isotr0py mentioned this pull request Sep 19, 2025

[Bug] vllm deploy InternVL3_5-241B-A28B error OpenGVLab/InternVL#1175

Open

3 tasks

debroy-rh pushed a commit to debroy-rh/vllm that referenced this pull request Sep 19, 2025

[bugfix] fix MHA for models like OpenGVLab/InternVL3_5-38B (vllm-proj…

84aa0d4

…ect#25146) Signed-off-by: Yan Ma <yan.ma@intel.com> Co-authored-by: Isotr0py <mozf@mail2.sysu.edu.cn>

FeiDaLI pushed a commit to FeiDaLI/vllm that referenced this pull request Sep 25, 2025

[bugfix] fix MHA for models like OpenGVLab/InternVL3_5-38B (vllm-proj…

50e98a8

…ect#25146) Signed-off-by: Yan Ma <yan.ma@intel.com> Co-authored-by: Isotr0py <mozf@mail2.sysu.edu.cn>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[bugfix] fix MHA for models like OpenGVLab/InternVL3_5-38B #25146

[bugfix] fix MHA for models like OpenGVLab/InternVL3_5-38B #25146

Uh oh!

yma11 commented Sep 18, 2025 •

edited by github-actions bot

Loading

Uh oh!

yma11 commented Sep 18, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Sep 18, 2025

Uh oh!

Isotr0py left a comment

Uh oh!

Isotr0py Sep 18, 2025

Uh oh!

yma11 Sep 19, 2025

Uh oh!

Isotr0py Sep 19, 2025

Uh oh!

Uh oh!

Uh oh!

-        """Input shape: batch_size x seq_len x hidden_size"""
+        """Input shape:
+        (batch_size x seq_len x hidden_size) or
+        (batch_size x seq_len x num_heads x head_size)
+        """

Uh oh!

[bugfix] fix MHA for models like OpenGVLab/InternVL3_5-38B #25146

[bugfix] fix MHA for models like OpenGVLab/InternVL3_5-38B #25146

Uh oh!

Conversation

yma11 commented Sep 18, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

yma11 commented Sep 18, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Sep 18, 2025

Choose a reason for hiding this comment

Uh oh!

Isotr0py left a comment

Choose a reason for hiding this comment

Uh oh!

Isotr0py Sep 18, 2025

Choose a reason for hiding this comment

Uh oh!

yma11 Sep 19, 2025

Choose a reason for hiding this comment

Uh oh!

Isotr0py Sep 19, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

yma11 commented Sep 18, 2025 •

edited by github-actions bot

Loading