[Bugfix] fix Qwen3VLMoe load when pp > 1 #25838

JJJYmmm · 2025-09-28T14:23:03Z

Purpose

Test Plan

test with command

python3 -m vllm.entrypoints.openai.api_server   
  --model  /path/to/Qwen3-VL-235B-A22B-Instruct   
  --served-model-name Qwen3-VL-235B-A22B-Instruct   
  --tensor-parallel-size 4
  --pipeline-parallel-size 2   
  --mm-encoder-tp-mode data   
  --limit-mm-per-prompt.video 0   
  --mm-processor-cache-type shm   
  --enable-expert-parallel   
  --host 0.0.0.0   
  --port 22002   
  --dtype bfloat16   
  --gpu-memory-utilization 0.95   
  --distributed-executor-backend mp 
  --max-model-len 40960

Test Result

before

(EngineCore_DP0 pid=9735) ERROR 09-28 20:05:29 [core.py:712] param = params_dict[name]
(EngineCore_DP0 pid=9735) ERROR 09-28 20:05:29 [core.py:712] ~~~~~~~~~~~^^^^^^
(EngineCore_DP0 pid=9735) ERROR 09-28 20:05:29 [core.py:712] KeyError: 'layers.0.mlp.experts.w2_weight'

now fixed!

Signed-off-by: liuye.hj <liuye.hj@alibaba-inc.com>

gemini-code-assist

Code Review

This pull request addresses a KeyError that occurs when loading Qwen3VLMoe models with pipeline parallelism (pp > 2). The issue stems from the is_pp_missing_parameter check being incorrectly placed within the logic for non-fused experts only, causing it to be skipped for fused experts. The fix correctly relocates this check to a common path before the branching logic for fused and non-fused experts. This ensures that weights for experts on other pipeline stages are correctly skipped, resolving the loading error. The change is logical, well-targeted, and effectively fixes the reported bug.

Isotr0py

Thanks for fixing!

Signed-off-by: liuye.hj <liuye.hj@alibaba-inc.com> Co-authored-by: liuye.hj <liuye.hj@alibaba-inc.com> Signed-off-by: baonudesifeizhai <baonudesifeizhai@gmail.com>

Signed-off-by: liuye.hj <liuye.hj@alibaba-inc.com> Co-authored-by: liuye.hj <liuye.hj@alibaba-inc.com> Signed-off-by: simon-mo <simon.mo@hey.com>

Signed-off-by: liuye.hj <liuye.hj@alibaba-inc.com> Co-authored-by: liuye.hj <liuye.hj@alibaba-inc.com>

Signed-off-by: liuye.hj <liuye.hj@alibaba-inc.com> Co-authored-by: liuye.hj <liuye.hj@alibaba-inc.com> Signed-off-by: yewentao256 <zhyanwentao@126.com>

Signed-off-by: liuye.hj <liuye.hj@alibaba-inc.com> Co-authored-by: liuye.hj <liuye.hj@alibaba-inc.com> Signed-off-by: xuebwang-amd <xuebwang@amd.com>

Signed-off-by: liuye.hj <liuye.hj@alibaba-inc.com> Co-authored-by: liuye.hj <liuye.hj@alibaba-inc.com> Signed-off-by: simon-mo <simon.mo@hey.com>

Signed-off-by: liuye.hj <liuye.hj@alibaba-inc.com> Co-authored-by: liuye.hj <liuye.hj@alibaba-inc.com>

Signed-off-by: liuye.hj <liuye.hj@alibaba-inc.com> Co-authored-by: liuye.hj <liuye.hj@alibaba-inc.com> Signed-off-by: xuebwang-amd <xuebwang@amd.com>

Signed-off-by: liuye.hj <liuye.hj@alibaba-inc.com> Co-authored-by: liuye.hj <liuye.hj@alibaba-inc.com>

fix qwen3vlmoe load when pp > 2

0cf0800

Signed-off-by: liuye.hj <liuye.hj@alibaba-inc.com>

JJJYmmm requested a review from sighingnow as a code owner September 28, 2025 14:23

JJJYmmm changed the title ~~[Bugfix] fix Qwen3VLMoe load when pp > 2~~ [Bugfix] fix Qwen3VLMoe load when pp > 1 Sep 28, 2025

mergify bot added the qwen Related to Qwen models label Sep 28, 2025

gemini-code-assist bot reviewed Sep 28, 2025

View reviewed changes

Isotr0py approved these changes Sep 28, 2025

View reviewed changes

Isotr0py enabled auto-merge (squash) September 28, 2025 16:09

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Sep 28, 2025

Isotr0py added this to the v0.11.0 Cherry Picks milestone Sep 28, 2025

Isotr0py merged commit 471997a into vllm-project:main Sep 28, 2025
54 checks passed

simon-mo pushed a commit that referenced this pull request Sep 29, 2025

[Bugfix] fix Qwen3VLMoe load when pp > 1 (#25838)

09c2cbc

Signed-off-by: liuye.hj <liuye.hj@alibaba-inc.com> Co-authored-by: liuye.hj <liuye.hj@alibaba-inc.com> Signed-off-by: simon-mo <simon.mo@hey.com>

ywang96 mentioned this pull request Sep 30, 2025

[Bug]: support the Qwen3-VL-235B-A22B model？ #25582

Closed

1 task

pdasigi pushed a commit to pdasigi/vllm that referenced this pull request Oct 2, 2025

[Bugfix] fix Qwen3VLMoe load when pp > 1 (vllm-project#25838)

a6a2b4d

Signed-off-by: liuye.hj <liuye.hj@alibaba-inc.com> Co-authored-by: liuye.hj <liuye.hj@alibaba-inc.com>

yewentao256 pushed a commit that referenced this pull request Oct 3, 2025

[Bugfix] fix Qwen3VLMoe load when pp > 1 (#25838)

0b343e3

Signed-off-by: liuye.hj <liuye.hj@alibaba-inc.com> Co-authored-by: liuye.hj <liuye.hj@alibaba-inc.com> Signed-off-by: yewentao256 <zhyanwentao@126.com>

shyeh25 pushed a commit to shyeh25/vllm that referenced this pull request Oct 14, 2025

fix Qwen3VLMoe load when pp > 1 (vllm-project#25838)

27eb804

Signed-off-by: liuye.hj <liuye.hj@alibaba-inc.com> Co-authored-by: liuye.hj <liuye.hj@alibaba-inc.com> Signed-off-by: simon-mo <simon.mo@hey.com>

lywa1998 pushed a commit to lywa1998/vllm that referenced this pull request Oct 20, 2025

[Bugfix] fix Qwen3VLMoe load when pp > 1 (vllm-project#25838)

d0ef82b

Signed-off-by: liuye.hj <liuye.hj@alibaba-inc.com> Co-authored-by: liuye.hj <liuye.hj@alibaba-inc.com>

alhridoy pushed a commit to alhridoy/vllm that referenced this pull request Oct 24, 2025

[Bugfix] fix Qwen3VLMoe load when pp > 1 (vllm-project#25838)

4f6c5e8

Signed-off-by: liuye.hj <liuye.hj@alibaba-inc.com> Co-authored-by: liuye.hj <liuye.hj@alibaba-inc.com>

rtourgeman pushed a commit to rtourgeman/vllm that referenced this pull request Nov 10, 2025

[Bugfix] fix Qwen3VLMoe load when pp > 1 (vllm-project#25838)

fedda56

Signed-off-by: liuye.hj <liuye.hj@alibaba-inc.com> Co-authored-by: liuye.hj <liuye.hj@alibaba-inc.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Bugfix] fix Qwen3VLMoe load when pp > 1 #25838

[Bugfix] fix Qwen3VLMoe load when pp > 1 #25838

Uh oh!

JJJYmmm commented Sep 28, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Isotr0py left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

[Bugfix] fix Qwen3VLMoe load when pp > 1 #25838

[Bugfix] fix Qwen3VLMoe load when pp > 1 #25838

Uh oh!

Conversation

JJJYmmm commented Sep 28, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Isotr0py left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

JJJYmmm commented Sep 28, 2025 •

edited by github-actions bot

Loading