refactor(mtp): extract BaseMTPModel mixin shared by existing MTP draft models by sufubao · Pull Request #1337 · ModelTC/LightLLM

sufubao · 2026-06-09T00:16:55Z

What

Extracts a small BaseMTPModel mixin that consolidates the draft-model wiring duplicated across the existing MTP models — deepseek_mtp, mistral_mtp, glm4_moe_lite_mtp, and qwen3_moe_mtp.

Each of those draft models repeated the same setup: reuse the main model's req/mem managers and rope caches, pop the main_model / mtp_previous_draft_models kwargs before the base __init__, and mark themselves as draft models. This moves that shared logic into one place.

Why

A behaviour-preserving cleanup (+8/−99 across the four models) that removes duplication and gives a single home for the is_mtp_draft_model marker. It is a preparatory refactor split out of a larger Qwen3.5 / Qwen3.5-MoE MTP change so the follow-up feature PR stays focused on the new capability.

Changes

New lightllm/models/base_mtp_model.py — the BaseMTPModel mixin, plus a unit test for its wiring.
deepseek_mtp / mistral_mtp / glm4_moe_lite_mtp / qwen3_moe_mtp model classes now mix in BaseMTPModel (mixed in before the concrete base model so the shared overrides win via MRO).

Testing

pre-commit (black 21.12b0 + flake8 6.1.0) clean.
unit_tests/models/test_base_mtp_model_mixin.py covers the mixin's kwargs-popping, manager/rope sharing, and the is_mtp_draft_model marker.

…t models Consolidate the duplicated draft-model wiring across the deepseek / mistral / glm4_moe_lite / qwen3_moe MTP models into a single BaseMTPModel mixin: draft models reuse the main model's req/mem managers and rope caches, pop the main_model / previous-draft-models kwargs before the base __init__, and carry the is_mtp_draft_model marker. Behaviour-preserving (+8/-99 across the four models); no new dependencies. Includes a unit test for the mixin wiring.

gemini-code-assist

Code Review

This pull request introduces a new BaseMTPModel mixin class to consolidate duplicated initialization logic, manager sharing, and ROPE cache reuse across multiple MTP draft models (DeepSeek, GLM4, Mistral, and Qwen3). The review feedback correctly identifies a critical issue with the cooperative multiple inheritance (MRO) chain in BaseMTPModel._init_custom, which overrides the method without calling super()._init_custom(). Suggestions are provided to call super()._init_custom() and to update the unit tests using a dummy subclass to properly test this cooperative inheritance behavior.

Important

The consumer version of Gemini Code Assist on GitHub is being sunset. Starting June 18, 2026, new organization installations will be blocked, and all code review activity will officially cease on July 17, 2026.
For more details on the timeline and next steps, please review the Help Documentation.

…ft_model (revert #1337) (#1339)

gemini-code-assist Bot reviewed Jun 9, 2026

View reviewed changes

ModelTC deleted a comment from gemini-code-assist Bot Jun 9, 2026

sufubao merged commit e03ef9a into ModelTC:main Jun 9, 2026
1 check passed

sufubao mentioned this pull request Jun 9, 2026

feat(qwen3_5_mtp): Qwen3.5 / Qwen3.5-MoE MTP speculative decoding #1338

Open

sufubao deleted the qw35_mtp_refactor branch June 9, 2026 01:01

sufubao mentioned this pull request Jun 9, 2026

revert(mtp): drop shared BaseMTPModel base, keep per-model is_mtp_draft_model (revert #1337) #1339

Merged

sufubao added a commit that referenced this pull request Jun 9, 2026

revert(mtp): drop shared BaseMTPModel base, keep per-model is_mtp_dra…

da9dfb8

…ft_model (revert #1337) (#1339)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor(mtp): extract BaseMTPModel mixin shared by existing MTP draft models#1337

refactor(mtp): extract BaseMTPModel mixin shared by existing MTP draft models#1337
sufubao merged 1 commit into
ModelTC:mainfrom
sufubao:qw35_mtp_refactor

sufubao commented Jun 9, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

sufubao commented Jun 9, 2026

What

Why

Changes

Testing

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant