[V0 Deprecation] Remove placeholder attn #25510

tdoublep · 2025-09-23T19:54:13Z

Purpose

Remove placeholder attention backend. It is no longer needed for Mamba models in V1, since each mamba/linear attention layer has its own "real" attention backend.

Test Plan

Let's see if CI passes

Test Result

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

Signed-off-by: Thomas Parnell <tpa@zurich.ibm.com>

gemini-code-assist

Code Review

This pull request removes the placeholder attention backend and the associated is_attention_free flag, which is a good cleanup as it's no longer needed for Mamba models in V1. The changes are consistent across the modified files. However, I've found a broken test case that needs to be addressed to ensure the integrity of the test suite.

vllm/attention/selector.py

Signed-off-by: Thomas Parnell <tpa@zurich.ibm.com>

WoosukKwon

Thanks for doing this!

yewentao256

LGTM, thanks for the work!

tlrmchlsmth

Nice

vllm-project/vllm#25510 Signed-off-by: Chendi Xue <Chendi.Xue@intel.com>

vllm-project/vllm#25510 Signed-off-by: Chendi Xue <Chendi.Xue@intel.com> Signed-off-by: slokesha <slokeshappa@habana.ai>

Signed-off-by: Thomas Parnell <tpa@zurich.ibm.com>

vllm-project/vllm#25510 Signed-off-by: Chendi Xue <Chendi.Xue@intel.com> Signed-off-by: Iryna Boiko <iboiko@habana.ai>

Signed-off-by: Thomas Parnell <tpa@zurich.ibm.com> Signed-off-by: yewentao256 <zhyanwentao@126.com>

Remove placeholder attn

c4baf50

Signed-off-by: Thomas Parnell <tpa@zurich.ibm.com>

tdoublep requested review from NickLucche, ApostaC and LucasWilkinson as code owners September 23, 2025 19:54

mergify bot added the kv-connector label Sep 23, 2025

tdoublep changed the title ~~Remove placeholder attn~~ [V0 Deprecation] Remove placeholder attn Sep 23, 2025

gemini-code-assist bot reviewed Sep 23, 2025

View reviewed changes

vllm/attention/selector.py Show resolved Hide resolved

DarkLight1337 approved these changes Sep 23, 2025

View reviewed changes

DarkLight1337 enabled auto-merge (squash) September 23, 2025 19:58

DarkLight1337 added this to V0 Deprecation Sep 23, 2025

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Sep 23, 2025

Fix tests

5bdf72a

Signed-off-by: Thomas Parnell <tpa@zurich.ibm.com>

tdoublep requested review from mgoin, tlrmchlsmth, WoosukKwon and yewentao256 as code owners September 23, 2025 19:59

WoosukKwon approved these changes Sep 23, 2025

View reviewed changes

yewentao256 approved these changes Sep 23, 2025

View reviewed changes

tlrmchlsmth approved these changes Sep 23, 2025

View reviewed changes

DarkLight1337 merged commit 969b4da into vllm-project:main Sep 23, 2025
47 checks passed

github-project-automation bot moved this to Done in V0 Deprecation Sep 23, 2025

xuechendi mentioned this pull request Sep 23, 2025

[FIX][upstream crash]Fix due upstream change 25510 vllm-project/vllm-gaudi#241

Merged

xuechendi added a commit to vllm-project/vllm-gaudi that referenced this pull request Sep 24, 2025

[FIX][upstream crash]Fix due upstream change 25510 (#241)

0336b6a

vllm-project/vllm#25510 Signed-off-by: Chendi Xue <Chendi.Xue@intel.com>

slokesha pushed a commit to slokesha/vllm-gaudi that referenced this pull request Sep 24, 2025

[FIX][upstream crash]Fix due upstream change 25510 (vllm-project#241)

ce549c9

vllm-project/vllm#25510 Signed-off-by: Chendi Xue <Chendi.Xue@intel.com> Signed-off-by: slokesha <slokeshappa@habana.ai>

FeiDaLI pushed a commit to FeiDaLI/vllm that referenced this pull request Sep 25, 2025

[V0 Deprecation] Remove placeholder attn (vllm-project#25510)

21e41a9

Signed-off-by: Thomas Parnell <tpa@zurich.ibm.com>

yewentao256 pushed a commit that referenced this pull request Oct 3, 2025

[V0 Deprecation] Remove placeholder attn (#25510)

cf0e250

Signed-off-by: Thomas Parnell <tpa@zurich.ibm.com> Signed-off-by: yewentao256 <zhyanwentao@126.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[V0 Deprecation] Remove placeholder attn #25510

[V0 Deprecation] Remove placeholder attn #25510

tdoublep commented Sep 23, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

WoosukKwon left a comment

Uh oh!

yewentao256 left a comment

Uh oh!

tlrmchlsmth left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

[V0 Deprecation] Remove placeholder attn #25510

[V0 Deprecation] Remove placeholder attn #25510

Conversation

tdoublep commented Sep 23, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

WoosukKwon left a comment

Choose a reason for hiding this comment

Uh oh!

yewentao256 left a comment

Choose a reason for hiding this comment

Uh oh!

tlrmchlsmth left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

tdoublep commented Sep 23, 2025 •

edited by github-actions bot

Loading