[Model] Pass param prefix to LLMHead #24862

whx-sjtu · 2025-09-15T08:35:37Z

Purpose

prefix is an init parameter of ParallelLMHead, which is later used to get_quant_method by VocabParallelEmbedding. Currently only part of models pass this parameter to initialize ParallelLMHead. Missing of this parameter can cause bugs in the process of getting quantization method. This PR completes the passing of the prefix parameter for all models.

Test Plan

No need to add new test.

Test Result

all tests should pass

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

Signed-off-by: whx-sjtu <2952154980@qq.com>

gemini-code-assist

Code Review

This pull request systematically addresses a potential bug related to quantization by ensuring the prefix parameter is passed to ParallelLMHead across all relevant models. The changes are consistent, straightforward, and correctly implemented. This is a valuable fix that enhances the robustness and correctness of the model quantization process. The changes look good.

Signed-off-by: whx-sjtu <2952154980@qq.com>

wangxiyuan · 2025-09-16T02:48:26Z

the prefix is very useful in custom quantization case.

Isotr0py

LGTM, thanks!

Signed-off-by: whx-sjtu <2952154980@qq.com>

Signed-off-by: whx-sjtu <2952154980@qq.com> Signed-off-by: charlifu <charlifu@amd.com>

add prefix to LLMHead

01dbb08

Signed-off-by: whx-sjtu <2952154980@qq.com>

whx-sjtu requested review from patrickvonplaten and sighingnow as code owners September 15, 2025 08:35

mergify bot added deepseek Related to DeepSeek models llama Related to Llama models qwen Related to Qwen models gpt-oss Related to GPT-OSS models speculative-decoding labels Sep 15, 2025

github-project-automation bot added this to gpt-oss Issues & Enhancements Sep 15, 2025

github-project-automation bot moved this to To Triage in gpt-oss Issues & Enhancements Sep 15, 2025

gemini-code-assist bot reviewed Sep 15, 2025

View reviewed changes

fix lint

80aea6b

Signed-off-by: whx-sjtu <2952154980@qq.com>

whx-sjtu force-pushed the add_prefix_to_llmhead branch from 3c216af to 80aea6b Compare September 15, 2025 10:03

DarkLight1337 requested review from mgoin and Isotr0py September 16, 2025 05:05

Isotr0py approved these changes Sep 16, 2025

View reviewed changes

github-project-automation bot moved this from To Triage to Ready in gpt-oss Issues & Enhancements Sep 16, 2025

Isotr0py added the ready ONLY add when PR is ready to merge/full CI is needed label Sep 16, 2025

DarkLight1337 merged commit 4a9375f into vllm-project:main Sep 17, 2025
60 checks passed

github-project-automation bot moved this from Ready to Done in gpt-oss Issues & Enhancements Sep 17, 2025

FeiDaLI pushed a commit to FeiDaLI/vllm that referenced this pull request Sep 25, 2025

[Model] Pass param prefix to LLMHead (vllm-project#24862)

9b1851d

Signed-off-by: whx-sjtu <2952154980@qq.com>

charlifu pushed a commit to ROCm/vllm that referenced this pull request Sep 25, 2025

[Model] Pass param prefix to LLMHead (vllm-project#24862)

52a69b8

Signed-off-by: whx-sjtu <2952154980@qq.com> Signed-off-by: charlifu <charlifu@amd.com>

whx-sjtu mentioned this pull request Sep 27, 2025

[Model] Supplement to PR 24862: Pass param prefix to LLMHead #25805

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Model] Pass param prefix to LLMHead #24862

[Model] Pass param prefix to LLMHead #24862

Uh oh!

whx-sjtu commented Sep 15, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

wangxiyuan commented Sep 16, 2025

Uh oh!

Isotr0py left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

[Model] Pass param prefix to LLMHead #24862

[Model] Pass param prefix to LLMHead #24862

Uh oh!

Conversation

whx-sjtu commented Sep 15, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

wangxiyuan commented Sep 16, 2025

Uh oh!

Isotr0py left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

whx-sjtu commented Sep 15, 2025 •

edited by github-actions bot

Loading