Fix bnb 4bit/8bit quantization drop chunked tensors bug by kaixuanliu · Pull Request #46210 · huggingface/transformers

kaixuanliu · 2026-05-26T06:21:02Z

What does this PR do?

Bnb4bitQuantize.convert / Bnb8bitQuantize.convert only quantized the first entry of input_dict and returned {full_layer_name: value}, silently dropping any extra tensors produced by an upstream one-to-many WeightConverter (e.g. a Chunk op splitting a fused weight). Those targets were never loaded, kept their random init,and thus trigger the assert Error:
assert module.weight.shape[1] == 1.

Fix:

iterate over every (param_name, value) in input_dict and quantize each one against its own module.

Repro

RUN_SLOW=1 pytest tests/models/hrm_text/test_modeling_hrm_text.py::HrmTextModelTest::test_flash_attn_2_fp32_ln

hrm_text is affected because it chunks attn.gqkv_proj → gate_proj/q_proj/k_proj/v_proj and mlp.gate_up_proj → gate_proj/up_proj
on load. Without the fix, q/k/v_proj and mlp.up_proj show up as MISSING in the load report and the 4bit forward asserts.
The case passes after this fix.

Who can review?

@SunMarc pls help review, thx!

… load Signed-off-by: Liu, Kaixuan <kaixuan.liu@intel.com>

SunMarc

Nice, thanks a lot for this !

HuggingFaceDocBuilderDev · 2026-05-27T10:54:08Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

…46210) Fix bnb 4bit/8bit quantization dropping chunked tensors during weight load Signed-off-by: Liu, Kaixuan <kaixuan.liu@intel.com>

Fix bnb 4bit/8bit quantization dropping chunked tensors during weight…

28d91cf

… load Signed-off-by: Liu, Kaixuan <kaixuan.liu@intel.com>

kaixuanliu changed the title ~~Fix bnb 4bit/8bit quantization dropping chunked tensors during weight…~~ Fix bnb 4bit/8bit quantization drop chunked tensors bug May 26, 2026

SunMarc approved these changes May 27, 2026

View reviewed changes

SunMarc enabled auto-merge May 27, 2026 10:41

SunMarc added this pull request to the merge queue May 27, 2026

Merged via the queue into huggingface:main with commit 212863b May 27, 2026
31 checks passed

kaixuanliu deleted the bnb-quantize branch May 28, 2026 03:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix bnb 4bit/8bit quantization drop chunked tensors bug#46210

Fix bnb 4bit/8bit quantization drop chunked tensors bug#46210
SunMarc merged 1 commit into
huggingface:mainfrom
kaixuanliu:bnb-quantize

kaixuanliu commented May 26, 2026 •

edited

Loading

Uh oh!

SunMarc left a comment

Uh oh!

HuggingFaceDocBuilderDev commented May 27, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

kaixuanliu commented May 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Fix:

Repro

Who can review?

Uh oh!

SunMarc left a comment

Choose a reason for hiding this comment

Uh oh!

HuggingFaceDocBuilderDev commented May 27, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

kaixuanliu commented May 26, 2026 •

edited

Loading