[Chore] Minor simplification for non-PP path #24810

WoosukKwon · 2025-09-13T22:04:49Z

Currently, broadcast_pp_output complicates the logic in the common non-PP path.
This PR simplifies the logic a bit.

Signed-off-by: Woosuk Kwon <woosuk@thinkingmachines.ai>

gemini-code-assist

Code Review

This pull request provides a nice simplification to the execute_model method in GPUModelRunner. By moving the broadcast_pp_output flag to the __init__ method and restructuring the logic to handle the common (non-broadcast) and rare (broadcast) paths separately, the code becomes much clearer and easier to follow. This refactoring also correctly handles kv_connector_output for pooling models and appears to fix a latent bug where logits were not updated on non-last pipeline parallel ranks after being broadcast. The changes are well-contained and improve both readability and correctness. I have no further suggestions.

njhill · 2025-09-13T22:36:50Z

vllm/v1/worker/gpu_model_runner.py

-                model_output_broadcast_data = {
-                    "logits": logits.contiguous(),
-                } if logits is not None else {}
+            else:


just a thought, could we put the else logic in a different function to be even less intrusive to the common path?

Signed-off-by: Woosuk Kwon <woosuk@thinkingmachines.ai>

Signed-off-by: Woosuk Kwon <woosuk@thinkingmachines.ai> Signed-off-by: bbartels <benjamin@bartels.dev>

Signed-off-by: Woosuk Kwon <woosuk@thinkingmachines.ai>

[Chore] Minor simplification for PP

97395c8

Signed-off-by: Woosuk Kwon <woosuk@thinkingmachines.ai>

WoosukKwon requested review from robertgshaw2-redhat, njhill, ywang96, comaniac and alexm-redhat as code owners September 13, 2025 22:04

mergify bot added the v1 label Sep 13, 2025

WoosukKwon added the ready ONLY add when PR is ready to merge/full CI is needed label Sep 13, 2025

gemini-code-assist bot reviewed Sep 13, 2025

View reviewed changes

njhill approved these changes Sep 13, 2025

View reviewed changes

WoosukKwon merged commit 3e903b6 into main Sep 14, 2025
58 checks passed

WoosukKwon deleted the woosuk/minor-simpl-pool branch September 14, 2025 00:41

dsxsteven pushed a commit to dsxsteven/vllm_splitPR that referenced this pull request Sep 15, 2025

[Chore] Minor simplification for non-PP path (vllm-project#24810)

e240c90

Signed-off-by: Woosuk Kwon <woosuk@thinkingmachines.ai>

bbartels pushed a commit to bbartels/vllm that referenced this pull request Sep 15, 2025

[Chore] Minor simplification for non-PP path (vllm-project#24810)

47775f1

Signed-off-by: Woosuk Kwon <woosuk@thinkingmachines.ai> Signed-off-by: bbartels <benjamin@bartels.dev>

FeiDaLI pushed a commit to FeiDaLI/vllm that referenced this pull request Sep 25, 2025

[Chore] Minor simplification for non-PP path (vllm-project#24810)

30f3800

Signed-off-by: Woosuk Kwon <woosuk@thinkingmachines.ai>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Chore] Minor simplification for non-PP path #24810

[Chore] Minor simplification for non-PP path #24810

Uh oh!

WoosukKwon commented Sep 13, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

njhill Sep 13, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

[Chore] Minor simplification for non-PP path #24810

[Chore] Minor simplification for non-PP path #24810

Uh oh!

Conversation

WoosukKwon commented Sep 13, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

njhill Sep 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

WoosukKwon commented Sep 13, 2025 •

edited by github-actions bot

Loading

njhill Sep 13, 2025 •

edited

Loading