add support for MiMo-V2-Flash by n1ck-guo · Pull Request #1718 · intel/auto-round

n1ck-guo · 2026-04-22T05:53:48Z

Description

Please briefly describe your main changes, the motivation.

Type of Change

Related Issues

Fixes or relates to #

Checklist Before Submitting

My code has been tested locally.
Documentation has been updated as needed.
New or updated tests are included where applicable.

Signed-off-by: n1ck-guo <heng.guo@intel.com>

Copilot

Pull request overview

Adds compatibility patches to support loading/running MiMo-V2-Flash (and some legacy remote-code RoPE behaviors) under newer transformers, plus improves FP8 block dequant handling when scale tensors are over-provisioned.

Changes:

Relax FP8 block scale shape assumptions by rejecting undersized scales and padding weights when scales are over-provisioned.
Apply model-instance monkey patches immediately after llm_load_model() loads/evals the model.
Add transformers compatibility shims for legacy RoPE default init and MiMo-V2-Flash attention helper call signatures.

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 2 comments.

File	Description
auto_round/utils/weight_handler.py	Accept over-provisioned FP8 scale tensors by padding weights; raise early for undersized scales.
auto_round/utils/model.py	Invoke `monkey_patch_model(model)` after model load to apply instance-level compatibility patches.
auto_round/utils/common.py	Add RoPE default init compatibility, patch `_init_weights` for legacy RotaryEmbedding, and patch MiMo attention helper.

n1ck-guo · 2026-04-23T06:37:55Z

/azp run Unit-Test-CUDA-AutoRound

azure-pipelines · 2026-04-23T06:38:06Z

Azure Pipelines successfully started running 1 pipeline(s).

Signed-off-by: n1ck-guo <heng.guo@intel.com>

yiliu30

overall lgtm

n1ck-guo · 2026-04-24T10:07:41Z

/azp run Unit-Test-CUDA-AutoRound

azure-pipelines · 2026-04-24T10:07:50Z

Azure Pipelines successfully started running 1 pipeline(s).

n1ck-guo · 2026-04-25T10:07:58Z

/azp run Unit-Test-CUDA-AutoRound

azure-pipelines · 2026-04-25T10:08:06Z

Azure Pipelines successfully started running 1 pipeline(s).

n1ck-guo · 2026-04-27T00:46:46Z

/azp run Unit-Test-CUDA-AutoRound

azure-pipelines · 2026-04-27T00:46:56Z

Azure Pipelines successfully started running 1 pipeline(s).

n1ck-guo · 2026-04-27T01:47:26Z

/azp run Unit-Test-CUDA-AutoRound

azure-pipelines · 2026-04-27T01:47:35Z

Azure Pipelines successfully started running 1 pipeline(s).

n1ck-guo · 2026-04-27T04:40:38Z

/azp run Unit-Test-CUDA-AutoRound

azure-pipelines · 2026-04-27T04:40:47Z

Azure Pipelines successfully started running 1 pipeline(s).

Signed-off-by: n1ck-guo <heng.guo@intel.com>

add support for MiMo-V2-Flash

1e725fd

Signed-off-by: n1ck-guo <heng.guo@intel.com>

Copilot AI review requested due to automatic review settings April 22, 2026 05:53

Copilot started reviewing on behalf of n1ck-guo April 22, 2026 05:54 View session

Copilot AI reviewed Apr 22, 2026

View reviewed changes

Comment thread auto_round/utils/common.py

Comment thread auto_round/utils/weight_handler.py

n1ck-guo requested a review from xin3he April 23, 2026 06:38

n1ck-guo requested a review from yiliu30 April 23, 2026 06:38

n1ck-guo mentioned this pull request Apr 23, 2026

RuntimeError: Error(s) in loading state_dict for FP8Linear #1268

Closed

update

d268610

Signed-off-by: n1ck-guo <heng.guo@intel.com>

yiliu30 approved these changes Apr 24, 2026

View reviewed changes

Comment thread auto_round/utils/common.py

Comment thread auto_round/utils/common.py

Merge branch 'main' into hengguo/support_mimo

e4884b6

Merge branch 'main' into hengguo/support_mimo

fbfd6bf

Merge branch 'main' into hengguo/support_mimo

f6840fd

n1ck-guo merged commit e62d29d into main Apr 27, 2026
42 checks passed

n1ck-guo deleted the hengguo/support_mimo branch April 27, 2026 05:18

lvliang-intel pushed a commit that referenced this pull request May 12, 2026

add support for MiMo-V2-Flash (#1718)

a4f9bf9

Signed-off-by: n1ck-guo <heng.guo@intel.com>

Conversation

n1ck-guo commented Apr 22, 2026

Description

Type of Change

Related Issues

Checklist Before Submitting

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

n1ck-guo commented Apr 23, 2026

Uh oh!

azure-pipelines Bot commented Apr 23, 2026

Uh oh!

yiliu30 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

n1ck-guo commented Apr 24, 2026

Uh oh!

azure-pipelines Bot commented Apr 24, 2026

Uh oh!

n1ck-guo commented Apr 25, 2026

Uh oh!

azure-pipelines Bot commented Apr 25, 2026

Uh oh!

n1ck-guo commented Apr 27, 2026

Uh oh!

azure-pipelines Bot commented Apr 27, 2026

Uh oh!

n1ck-guo commented Apr 27, 2026

Uh oh!

azure-pipelines Bot commented Apr 27, 2026

Uh oh!

n1ck-guo commented Apr 27, 2026

Uh oh!

azure-pipelines Bot commented Apr 27, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants