refine transform config api. by lkk12014402 · Pull Request #1607 · intel/auto-round

lkk12014402 · 2026-03-24T15:16:08Z

#1577

Description

refine transform config api, which supports str|dict|TransformConfig|None

Signed-off-by: lkk12014402 <kaokao.lv@intel.com>

lkk12014402 · 2026-03-24T15:16:38Z

TODO: add document

for more information, see https://pre-commit.ci

Copilot

Pull request overview

Refines the transform config API so callers can pass a str | dict | TransformConfig | None, and normalizes inputs before applying transforms.

Changes:

Added a shared normalization helper to validate/standardize transform config inputs.
Updated apply_transform to accept multiple config input types plus an optional quantization scheme.
Updated compressor initialization to accept and normalize transform_config directly.

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 6 comments.

File	Description
auto_round/experimental/transform/helper.py	Adds `_normalize_transform_config` for validating and converting `transform_config` inputs into a normalized dict.
auto_round/experimental/transform/apply.py	Expands `apply_transform` signature and normalizes user inputs before building `TransformConfig`.
auto_round/compressors/base.py	Adds `transform_config` parameter and normalizes it during compressor initialization.

auto_round/experimental/transform/helper.py

auto_round/experimental/transform/apply.py

auto_round/compressors/base.py

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

for more information, see https://pre-commit.ci

Signed-off-by: lkk12014402 <kaokao.lv@intel.com>

for more information, see https://pre-commit.ci

auto_round/compressors/base.py

wenhuach21 · 2026-03-25T08:29:43Z

As also mentioned before, these two args in transform_config should not be exposed to users.

  location: str = Field(default="weight", exclude=True)

  # apply transform inside modules for nvfp4, autoround tuning etc.
  need_calibration: bool = Field(default=False, exclude=True)

wenhuach21 · 2026-03-25T08:31:07Z

and this one

 # required, currently only supports mxfp4
    quant_scheme: str = Field(..., description="Quantization scheme. Currently supports 'MXFP4/MXFP8'.")

why users need to pass the scheme once again?

lkk12014402 · 2026-03-25T08:36:51Z

and this one

 # required, currently only supports mxfp4
    quant_scheme: str = Field(..., description="Quantization scheme. Currently supports 'MXFP4/MXFP8'.")

why users need to pass the scheme once again?

need check the supported quantization scheme with transform

lkk12014402 · 2026-03-25T08:41:31Z

As also mentioned before, these two args in transform_config should not be exposed to users.

  location: str = Field(default="weight", exclude=True)

  # apply transform inside modules for nvfp4, autoround tuning etc.
  need_calibration: bool = Field(default=False, exclude=True)

the arg location determines where the Hadamard transform is applied, like weight or activation, we need set the parameter when need transform activation

the arg need_calibration is for calibration (like nvfp4) and tuning (iters > 200)

wenhuach21 · 2026-03-25T08:41:41Z

For API design, please take a position of uers, not developer

and this one
 # required, currently only supports mxfp4
    quant_scheme: str = Field(..., description="Quantization scheme. Currently supports 'MXFP4/MXFP8'.")
why users need to pass the scheme once again?
need check the supported quantization scheme with transform

I know, this is not the right logic. you should check it in another place

Signed-off-by: lkk12014402 <kaokao.lv@intel.com>

for more information, see https://pre-commit.ci

Signed-off-by: lkk12014402 <kaokao.lv@intel.com>

docs/step_by_step.md

wenhuach21 · 2026-03-25T13:44:21Z

@chensuyue @XuehaoSun To support Hadamard, a safetensor file has been added to the main branch. I assume we will not include this file in the release package by default. Please help include it if Kaokao thinks it is a better option than generating it at runtime.

Signed-off-by: lkk12014402 <kaokao.lv@intel.com>

for more information, see https://pre-commit.ci

auto_round/compressors/base.py

auto_round/experimental/transform/hadamards.py

auto_round/experimental/transform/helper.py

auto_round/inference/backend.py

docs/step_by_step.md

Signed-off-by: lkk12014402 <kaokao.lv@intel.com>

for more information, see https://pre-commit.ci

Signed-off-by: lkk12014402 <kaokao.lv@intel.com>

xin3he · 2026-03-26T07:30:12Z

/azp run Unit-Test-CUDA-AutoRound

azure-pipelines · 2026-03-26T07:30:22Z

Azure Pipelines successfully started running 1 pipeline(s).

chensuyue · 2026-03-26T08:14:53Z

/azp run Unit-Test-CUDA-AutoRound

azure-pipelines · 2026-03-26T08:15:02Z

Azure Pipelines successfully started running 1 pipeline(s).

chensuyue · 2026-03-26T08:28:31Z

@chensuyue @XuehaoSun To support Hadamard, a safetensor file has been added to the main branch. I assume we will not include this file in the release package by default. Please help include it if Kaokao thinks it is a better option than generating it at runtime.

Added into release package.

refine transform config api.

faa014d

Signed-off-by: lkk12014402 <kaokao.lv@intel.com>

Copilot AI review requested due to automatic review settings March 24, 2026 15:16

[pre-commit.ci] auto fixes from pre-commit.com hooks

00d248a

for more information, see https://pre-commit.ci

Copilot AI reviewed Mar 24, 2026

View reviewed changes

Copilot started reviewing on behalf of lkk12014402 March 24, 2026 15:32 View session

lkk12014402 and others added 9 commits March 25, 2026 10:24

Update auto_round/experimental/transform/helper.py

3842e33

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Update auto_round/experimental/transform/helper.py

f4ff447

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Update auto_round/experimental/transform/apply.py

1351df9

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Update auto_round/experimental/transform/apply.py

cb2624a

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Update auto_round/compressors/base.py

a5b72a1

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

[pre-commit.ci] auto fixes from pre-commit.com hooks

351ea57

for more information, see https://pre-commit.ci

add more ut.

58da3d2

Signed-off-by: lkk12014402 <kaokao.lv@intel.com>

[pre-commit.ci] auto fixes from pre-commit.com hooks

79b7cf9

for more information, see https://pre-commit.ci

fix typo.

76d2806

lkk12014402 requested review from wenhuach21 and yiliu30 March 25, 2026 08:04

wenhuach21 reviewed Mar 25, 2026

View reviewed changes

auto_round/compressors/base.py Outdated Show resolved Hide resolved

wenhuach21 reviewed Mar 25, 2026

View reviewed changes

auto_round/compressors/base.py Outdated Show resolved Hide resolved

chensuyue added this to the 0.12.0 milestone Mar 25, 2026

wenhuach21 requested a review from n1ck-guo March 25, 2026 08:32

lkk12014402 and others added 4 commits March 25, 2026 11:58

replace to .

e9cccd6

Signed-off-by: lkk12014402 <kaokao.lv@intel.com>

[pre-commit.ci] auto fixes from pre-commit.com hooks

93ad904

for more information, see https://pre-commit.ci

fix typo

9982de6

add initial hadamard tranform document.

90bb890

Signed-off-by: lkk12014402 <kaokao.lv@intel.com>

wenhuach21 reviewed Mar 25, 2026

View reviewed changes

docs/step_by_step.md Outdated Show resolved Hide resolved

lkk12014402 and others added 4 commits March 25, 2026 14:48

update hadamard transform api for better usage.

53c460a

Signed-off-by: lkk12014402 <kaokao.lv@intel.com>

[pre-commit.ci] auto fixes from pre-commit.com hooks

bde783f

for more information, see https://pre-commit.ci

fix typo

03d5bc4

update hadamard transform doc.

b7ec9d7