[Model][Qwen3VL] Slighly speedup `fast_pos_embed_interpolate` #28434

lgeiger · 2025-11-11T01:33:02Z

This slightly reduces the overhead of fast_pos_embed_interpolate by using an in-place multiplication. Followup to #26647

In a simple benchmark this speedups the function by ~20% for bfloat16 and ~30% for float32. Nothing major, but it's still a nice win.

Signed-off-by: Lukas Geiger <lukas.geiger94@gmail.com>

gemini-code-assist

Code Review

This pull request introduces a minor but effective optimization in the fast_pos_embed_interpolate function by replacing a standard multiplication with an in-place multiplication. This change correctly reduces memory overhead by avoiding the creation of an intermediate tensor, which, as noted in the description, provides a nice performance improvement. The change is safe as the modified tensor is not used elsewhere. This is a good, clean optimization.

…roject#28434) Signed-off-by: Lukas Geiger <lukas.geiger94@gmail.com> Signed-off-by: xuebwang-amd <xuebwang@amd.com>

[Model][Qwen3VL] Slighly speedup fast_pos_embed_interpolate

fafac0f

Signed-off-by: Lukas Geiger <lukas.geiger94@gmail.com>

lgeiger requested a review from sighingnow as a code owner November 11, 2025 01:33

mergify bot added the qwen Related to Qwen models label Nov 11, 2025

gemini-code-assist bot reviewed Nov 11, 2025

View reviewed changes

ywang96 approved these changes Nov 11, 2025

View reviewed changes

ywang96 added the ready ONLY add when PR is ready to merge/full CI is needed label Nov 11, 2025

Merge branch 'main' into qwen3-faster-pos-interp

74a0298

ywang96 enabled auto-merge (squash) November 11, 2025 08:08

ywang96 merged commit 9973e6e into vllm-project:main Nov 11, 2025
54 checks passed

lgeiger deleted the qwen3-faster-pos-interp branch November 11, 2025 10:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Model][Qwen3VL] Slighly speedup `fast_pos_embed_interpolate` #28434

[Model][Qwen3VL] Slighly speedup `fast_pos_embed_interpolate` #28434

Uh oh!

lgeiger commented Nov 11, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

[Model][Qwen3VL] Slighly speedup fast_pos_embed_interpolate #28434

[Model][Qwen3VL] Slighly speedup fast_pos_embed_interpolate #28434

Uh oh!

Conversation

lgeiger commented Nov 11, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[Model][Qwen3VL] Slighly speedup `fast_pos_embed_interpolate` #28434

[Model][Qwen3VL] Slighly speedup `fast_pos_embed_interpolate` #28434

lgeiger commented Nov 11, 2025 •

edited by github-actions bot

Loading