[Bug]: Loading Qwen3MoE using Transformers backend

### Your current environment

<details>
<summary>The output of <code>python collect_env.py</code></summary>

```text
Your output of `python collect_env.py` here
```

</details>


### 🐛 Describe the bug

vllm serve Qwen/Qwen3-30B-A3B --model-impl=transformers

Result in error:

  File ".../vllm/model_executor/models/transformers.py", line 66, in vllm_flash_attention_forward
    self_attn = attention_instances[module.layer_idx]
TypeError: 'NoneType' object is not subscriptable

which I believe is a unique bug related to MoE structure, as the dense models of Qwen3 can be served normally with transformers backend.

### Before submitting a new issue...

- [x] Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the [documentation page](https://docs.vllm.ai/en/latest/), which can answer lots of frequently asked questions.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Bug]: Loading Qwen3MoE using Transformers backend #19801

Your current environment

🐛 Describe the bug

Before submitting a new issue...

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

[Bug]: Loading Qwen3MoE using Transformers backend #19801

Description

Your current environment

🐛 Describe the bug

Before submitting a new issue...

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions