Fix FP8 quantizer for Transformers v4 by yiliu30 · Pull Request #1504 · intel/auto-round

yiliu30 · 2026-03-06T03:12:37Z

Description

Please briefly describe your main changes, the motivation.

Type of Change

Related Issues

Fixes or relates to #

Checklist Before Submitting

My code has been tested locally.
Documentation has been updated as needed.
New or updated tests are included where applicable.

Signed-off-by: yiliu30 <yi4.liu@intel.com>

for more information, see https://pre-commit.ci

Copilot

Pull request overview

This PR aims to restore/fix FP8 fine-grained integration compatibility with Transformers v4 by switching the HPU finegrained-fp8 monkeypatch to a version-specific patch module, and by introducing a dedicated v4 patch implementation.

Changes:

Update HPU patching to select a Transformers v4 vs v5+ compatible finegrained_fp8 replacement module at runtime.
Add a new finegrained_fp8_patch_v4.py module providing FP8Linear replacement logic for Transformers v4.
Add an example script for running FP8 static quantization and saving output.

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 10 comments.

File	Description
examples/quant_model.py	Adds an example CLI for quantizing/saving a model with AutoRound FP8 static scheme.
auto_round/modeling/hpu_patch.py	Selects the correct `finegrained_fp8` patch module depending on Transformers major version.
auto_round/modeling/finegrained_fp8_patch_v4.py	Introduces a Transformers v4-specific FP8Linear replacement implementation.

auto_round/modeling/finegrained_fp8_patch_v4.py

examples/quant_model.py

auto_round/modeling/finegrained_fp8_patch_v4.py

examples/quant_model.py

auto_round/modeling/hpu_patch.py

Signed-off-by: yiliu30 <yi4.liu@intel.com>

for more information, see https://pre-commit.ci

Signed-off-by: yiliu30 <yi4.liu@intel.com>

a32543254

LGTM

yiliu30 added 4 commits February 26, 2026 13:56

add v4 patch

0488453

Signed-off-by: yiliu30 <yi4.liu@intel.com>

fix import

ce04f76

Signed-off-by: yiliu30 <yi4.liu@intel.com>

quick fix

0dda8ab

Signed-off-by: yiliu30 <yi4.liu@intel.com>

add quant code

619d935

Signed-off-by: yiliu30 <yi4.liu@intel.com>

Copilot AI review requested due to automatic review settings March 6, 2026 03:12

Copilot started reviewing on behalf of yiliu30 March 6, 2026 03:13 View session

[pre-commit.ci] auto fixes from pre-commit.com hooks

b45122f

for more information, see https://pre-commit.ci

Copilot AI reviewed Mar 6, 2026

View reviewed changes

Merge branch 'main' into hpu-v4

893169d

chensuyue added this to the 0.10.3 milestone Mar 7, 2026

yiliu30 and others added 5 commits March 9, 2026 08:51

remove example

15c5ca8

Signed-off-by: yiliu30 <yi4.liu@intel.com>

fix

c24c2bd

Signed-off-by: yiliu30 <yi4.liu@intel.com>

add todo

d814c91

Signed-off-by: yiliu30 <yi4.liu@intel.com>

[pre-commit.ci] auto fixes from pre-commit.com hooks

5ebebd5

for more information, see https://pre-commit.ci

update license

6170a1d

Signed-off-by: yiliu30 <yi4.liu@intel.com>

yiliu30 requested review from a32543254, lkk12014402 and n1ck-guo March 9, 2026 01:24

yiliu30 added the hpu label Mar 9, 2026

a32543254 approved these changes Mar 9, 2026

View reviewed changes

yiliu30 merged commit 6ab2db2 into main Mar 9, 2026
29 checks passed

yiliu30 deleted the hpu-v4 branch March 9, 2026 02:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix FP8 quantizer for Transformers v4#1504

Fix FP8 quantizer for Transformers v4#1504
yiliu30 merged 11 commits intomainfrom
hpu-v4

yiliu30 commented Mar 6, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

a32543254 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

yiliu30 commented Mar 6, 2026

Description

Type of Change

Related Issues

Checklist Before Submitting

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

a32543254 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants