Remove state init by grimoire · Pull Request #4604 · InternLM/lmdeploy

grimoire · 2026-05-20T10:53:42Z

fill conv state in model when forward
update gate to ignore init state of gdr
remove init cache, it take too much times
l2 norm before repeat interleave

Copilot

Pull request overview

This PR removes explicit state-cache initialization for the GatedDelta/SSM path by making the model handle “init state” behavior during forward, and optimizes the kv head replication path by applying Q/K L2-normalization before repeat_interleave to reduce overhead.

Changes:

Add init-state metadata (is_init, is_init_token) to GatedDeltaMeta, zero conv initial states on init, and mask GDR gate for init tokens.
Move kv_ratio replication logic into GatedDelta (and add a helper that normalizes before replication).
Remove StateCacheEngine.init_caches and its call site during model forward.

Reviewed changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated 2 comments.

File	Description
lmdeploy/pytorch/nn/gated_delta.py	Adds init-token handling and moves kv replication + (optional) Q/K L2-norm before replication into the GatedDelta wrapper.
lmdeploy/pytorch/models/qwen3_5.py	Stops repeating Q/K in the model and passes `kv_ratio` into `GatedDelta`.
lmdeploy/pytorch/engine/model_agent/agent.py	Removes the state cache initialization call during forward.
lmdeploy/pytorch/engine/cache_engine.py	Removes `StateCacheEngine.init_caches` implementation.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

RunningLeon · 2026-05-22T05:01:04Z


+        self.is_init = None
+        self.is_init_token = None
+        if not self.is_decoding:


will it work for dp>1?

With self.is_init = (attn_metadata.kv_seqlens - attn_metadata.q_seqlens) == 0 condition, I think it should be ok?

CUHKSZzxy · 2026-05-29T06:44:46Z

        beta = b.sigmoid()
        # If the model is loaded in fp16, without the .float() here, A might be -inf
        g = self.get_A_log_exp() * F.softplus(a.float() + self.dt_bias)
-        if self.kv_ratio > 1:


Should we update in qwen3 next similarly?

https://github.com/InternLM/lmdeploy/blob/main/lmdeploy/pytorch/models/qwen3_next.py#L190

RunningLeon

LGTM

grimoire added 2 commits May 20, 2026 16:29

remove state init

c44f892

change norm order

ac7933b

Copilot AI review requested due to automatic review settings May 20, 2026 10:53

Copilot started reviewing on behalf of grimoire May 20, 2026 10:54 View session

Copilot AI reviewed May 20, 2026

View reviewed changes

Comment thread lmdeploy/pytorch/nn/gated_delta.py Outdated

Comment thread lmdeploy/pytorch/models/qwen3_5.py Outdated

lint

9fd80f8

lvhan028 requested review from CUHKSZzxy and RunningLeon May 21, 2026 07:26

lvhan028 added the improvement label May 21, 2026

RunningLeon reviewed May 22, 2026

View reviewed changes

CUHKSZzxy approved these changes May 29, 2026

View reviewed changes

update qwen3 next

8e54716

RunningLeon approved these changes May 29, 2026

View reviewed changes

lvhan028 merged commit ba0841c into InternLM:main May 29, 2026
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove state init#4604

Remove state init#4604
lvhan028 merged 4 commits into
InternLM:mainfrom
grimoire:remove-state-init

grimoire commented May 20, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

RunningLeon May 22, 2026

Uh oh!

grimoire May 25, 2026

Uh oh!

CUHKSZzxy May 29, 2026

Uh oh!

RunningLeon left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

grimoire commented May 20, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

RunningLeon May 22, 2026

Choose a reason for hiding this comment

Uh oh!

grimoire May 25, 2026

Choose a reason for hiding this comment

Uh oh!

CUHKSZzxy May 29, 2026

Choose a reason for hiding this comment

Uh oh!

RunningLeon left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants