[LoRA][1/N]Remove LoRA extra vocab #28382

jeejeelee · 2025-11-10T07:12:45Z

Purpose

This PR first removes code related to lora extra vocab from the most of models (temporarily keeping llama and mixtral for lora tests to pass), then continues to remove lora extra vocab code in subsequent PRs.

PS: The entire work is planned to be completed in this week.

Test Plan

Test Result

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

Signed-off-by: Jee Jee Li <pandaleefree@gmail.com>

gemini-code-assist

Code Review

This pull request is a significant step towards simplifying the codebase by removing the logic for LoRA extra vocabulary. The changes are extensive, touching many model files, and appear to be mostly correct and consistent with the goal of the refactoring. However, I've identified a recurring critical issue in several files that will lead to a TypeError at runtime due to incorrect tuple unpacking syntax. These issues need to be addressed to ensure the models function correctly.

vllm/model_executor/models/apertus.py

vllm/model_executor/models/aria.py

vllm/model_executor/models/kimi_vl.py

vllm/model_executor/models/lfm2.py

vllm/model_executor/models/lfm2_moe.py

vllm/model_executor/models/medusa.py

vllm/model_executor/models/minimax_text_01.py

vllm/model_executor/models/plamo2.py

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

vllm/model_executor/models/apertus.py

Signed-off-by: Jee Jee Li <pandaleefree@gmail.com>

WoosukKwon

Just to double check: We still keep the default padding for vocab sizes and only remove additional padding from LoRA, right?

jeejeelee · 2025-11-11T00:15:40Z

Just to double check: We still keep the default padding for vocab sizes and only remove additional padding from LoRA, right?

Yes

Signed-off-by: Jee Jee Li <pandaleefree@gmail.com>

WoosukKwon · 2025-11-11T08:54:38Z

@jeejeelee Dumb question: Can you explain more? When skimming the code briefly, I thought this PR removed default padding too. IIRC, the vocab size should be padded for TP > 1 (or maybe for other reasons).

jeejeelee · 2025-11-11T09:42:25Z

@WoosukKwon This is a great question, and I apologize for not describing it clearly in the PR.

This PR does not change the vocab default padding size; it simply doesn't explicitly pass padding_size. According to VocabParallelEmbedding, the default value of padding_size is DEFAULT_VOCAB_PADDING_SIZE

qwen2 and llama are two types for comparison

The current PR does not modify models like llama in order to ensure CI tests can pass. They will be modified in subsequent PRs

WoosukKwon

LGTM. Thanks for the explanation!

Signed-off-by: Jee Jee Li <pandaleefree@gmail.com>

Signed-off-by: Jee Jee Li <pandaleefree@gmail.com> Signed-off-by: George D. Torres <gdavtor@gmail.com>

jeejeelee added 2 commits November 10, 2025 07:00

Init

e5ff287

Signed-off-by: Jee Jee Li <pandaleefree@gmail.com>

Move forward

e1b1970

Signed-off-by: Jee Jee Li <pandaleefree@gmail.com>

jeejeelee requested review from hmellor and sighingnow as code owners November 10, 2025 07:12

Merge branch 'main' into remove-lora-extra-vocab

d9a3128

mergify bot added llama Related to Llama models qwen Related to Qwen models speculative-decoding labels Nov 10, 2025

Clean up

03ea84c

Signed-off-by: Jee Jee Li <pandaleefree@gmail.com>

gemini-code-assist bot reviewed Nov 10, 2025

View reviewed changes

chatgpt-codex-connector bot reviewed Nov 10, 2025

View reviewed changes

vllm/model_executor/models/apertus.py Show resolved Hide resolved

jeejeelee mentioned this pull request Nov 10, 2025

[Core] Remove lora additional vocabulary #23540

Closed

Fix

e80c5b9

Signed-off-by: Jee Jee Li <pandaleefree@gmail.com>

jeejeelee added the ready ONLY add when PR is ready to merge/full CI is needed label Nov 10, 2025

jeejeelee added 3 commits November 10, 2025 08:11

Clean up

447c556

Signed-off-by: Jee Jee Li <pandaleefree@gmail.com>

Fix

f58a296

Signed-off-by: Jee Jee Li <pandaleefree@gmail.com>

Fix

6bfb64a

Signed-off-by: Jee Jee Li <pandaleefree@gmail.com>

WoosukKwon reviewed Nov 11, 2025

View reviewed changes

jeejeelee added 3 commits November 11, 2025 05:49

Merge remote-tracking branch 'origin/main' into remove-lora-extra-vocab

60fb283

Fix

0685ec4

Signed-off-by: Jee Jee Li <pandaleefree@gmail.com>

Merge branch 'main' into remove-lora-extra-vocab

ce345b4

WoosukKwon approved these changes Nov 11, 2025

View reviewed changes

WoosukKwon merged commit 9d1c474 into vllm-project:main Nov 11, 2025
54 checks passed

jeejeelee deleted the remove-lora-extra-vocab branch November 11, 2025 23:16

fangyuchu pushed a commit to fangyuchu/vllm that referenced this pull request Nov 12, 2025

[LoRA][1/N]Remove LoRA extra vocab (vllm-project#28382)

aea3f57

Signed-off-by: Jee Jee Li <pandaleefree@gmail.com>

klshuster mentioned this pull request Nov 13, 2025

[Bugfix][Model] Support (zero-padded) LoRA on Qwen3 output embedding #26115

Open

5 tasks

geodavic pushed a commit to geodavic/vllm that referenced this pull request Nov 16, 2025

[LoRA][1/N]Remove LoRA extra vocab (vllm-project#28382)

f8b0a61

Signed-off-by: Jee Jee Li <pandaleefree@gmail.com> Signed-off-by: George D. Torres <gdavtor@gmail.com>

Uh oh!

[LoRA][1/N]Remove LoRA extra vocab #28382

[LoRA][1/N]Remove LoRA extra vocab #28382

Uh oh!

Conversation

jeejeelee commented Nov 10, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

WoosukKwon left a comment

Choose a reason for hiding this comment

Uh oh!

jeejeelee commented Nov 11, 2025

Uh oh!

WoosukKwon commented Nov 11, 2025

Uh oh!

jeejeelee commented Nov 11, 2025

Uh oh!

WoosukKwon left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

jeejeelee commented Nov 10, 2025 •

edited by github-actions bot

Loading