fix packing nvfp/mxfp `max_wokers` & extend xpu ut by Kaihui-intel · Pull Request #1555 · intel/auto-round

Kaihui-intel · 2026-03-17T08:19:16Z

Description

fix packing nvfp/mxfp max_wokers & extend xpu ut
formats/schemes/vlm/lm_head/

Type of Change

Related Issues

#1490

Fixes or relates to #

Checklist Before Submitting

My code has been tested locally.
Documentation has been updated as needed.
New or updated tests are included where applicable.

Signed-off-by: Kaihui-intel <kaihui.tang@intel.com>

for more information, see https://pre-commit.ci

Signed-off-by: Kaihui-intel <kaihui.tang@intel.com>

Copilot

Pull request overview

Fixes NVFP/MXFP packing thread worker selection and expands XPU unit tests to cover more quantization schemes and model types.

Changes:

Fix max_workers selection logic during NVFP/MXFP packing to avoid problematic concurrency on CUDA.
Update XPU tests to load quantized models with device_map="xpu" instead of "auto".
Add new XPU tests for multiple schemes, VLM quantization/inference, and lm_head quantization.

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 3 comments.

File	Description
`test/test_xpu/test_autoround.py`	Extends XPU tests (schemes/VLM/lm_head) and adjusts device_map usage when reloading quantized models.
`auto_round/export/export_to_autoround/export_to_nvfp_mxfp.py`	Fixes the `max_workers` condition for packing to avoid unintended multi-thread packing on CUDA-only setups.

Comments suppressed due to low confidence (1)

test/test_xpu/test_autoround.py:11

save_tiny_model is imported here but never used in this test module. Please remove it or use it (e.g., to build a tiny model for the new large-model XPU tests) to avoid dead imports and keep intent clear.

from ..helpers import get_model_path

test/test_xpu/test_autoround.py

Signed-off-by: Kaihui-intel <kaihui.tang@intel.com>

test/test_xpu/test_autoround.py

Signed-off-by: chensuyue <suyue.chen@intel.com>

Signed-off-by: Kaihui-intel <kaihui.tang@intel.com>

Signed-off-by: chensuyue <suyue.chen@intel.com>

Kaihui-intel added 3 commits March 17, 2026 15:10

fix packing wokers & extend xpu ut

09c1ab4

Signed-off-by: Kaihui-intel <kaihui.tang@intel.com>

rm import

8d22384

Signed-off-by: Kaihui-intel <kaihui.tang@intel.com>

fix missing import

34f5873

Signed-off-by: Kaihui-intel <kaihui.tang@intel.com>

Copilot AI review requested due to automatic review settings March 17, 2026 08:19

Copilot started reviewing on behalf of Kaihui-intel March 17, 2026 08:19 View session

pre-commit-ci bot and others added 2 commits March 17, 2026 08:19

[pre-commit.ci] auto fixes from pre-commit.com hooks

6085a58

for more information, see https://pre-commit.ci

remove redundant code

e62a479

Signed-off-by: Kaihui-intel <kaihui.tang@intel.com>

Kaihui-intel requested review from chensuyue, n1ck-guo and xin3he March 17, 2026 08:20

Copilot AI reviewed Mar 17, 2026

View reviewed changes

test/test_xpu/test_autoround.py Show resolved Hide resolved

test/test_xpu/test_autoround.py Show resolved Hide resolved

test/test_xpu/test_autoround.py Show resolved Hide resolved

chensuyue and others added 2 commits March 19, 2026 09:12

Merge branch 'main' into kaihui/xpu_ut

bacdefe

update xup ut requirment

b133794

Signed-off-by: Kaihui-intel <kaihui.tang@intel.com>

chensuyue reviewed Mar 20, 2026

View reviewed changes

test/test_xpu/test_autoround.py Outdated Show resolved Hide resolved

chensuyue and others added 4 commits March 20, 2026 21:06

add requirements.txt for xpu test

7d0537e

Signed-off-by: chensuyue <suyue.chen@intel.com>

rm invalid comments

e7f5e6f

Signed-off-by: Kaihui-intel <kaihui.tang@intel.com>

Merge branch 'main' into kaihui/xpu_ut

0573048

add torchvision

6613393

Signed-off-by: chensuyue <suyue.chen@intel.com>

chensuyue approved these changes Mar 21, 2026

View reviewed changes

chensuyue merged commit 79fa1a9 into main Mar 21, 2026
30 checks passed

chensuyue deleted the kaihui/xpu_ut branch March 21, 2026 13:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix packing nvfp/mxfp `max_wokers` & extend xpu ut#1555

fix packing nvfp/mxfp `max_wokers` & extend xpu ut#1555
chensuyue merged 11 commits intomainfrom
kaihui/xpu_ut

Kaihui-intel commented Mar 17, 2026 •

edited by chensuyue

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

Kaihui-intel commented Mar 17, 2026 • edited by chensuyue Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Type of Change

Related Issues

Checklist Before Submitting

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Kaihui-intel commented Mar 17, 2026 •

edited by chensuyue

Loading