Skip to content

Fix packing format in quantization config and update variable assignment in tests#1857

Merged
chensuyue merged 8 commits into
mainfrom
xinhe/5-26
May 28, 2026
Merged

Fix packing format in quantization config and update variable assignment in tests#1857
chensuyue merged 8 commits into
mainfrom
xinhe/5-26

Conversation

@xin3he
Copy link
Copy Markdown
Contributor

@xin3he xin3he commented May 26, 2026

Description

Please briefly describe your main changes, the motivation.

Type of Change

Bug fix

Related Issues

Fixes or relates to #1778

Checklist Before Submitting

  • My code has been tested locally.
  • Documentation has been updated as needed.
  • New or updated tests are included where applicable.
  • The CUDA CI has passed. You can trigger it by commenting /azp run Unit-Test-CUDA-AutoRound.

…ent in tests

Signed-off-by: Xin He <xin3.he@intel.com>
Copilot AI review requested due to automatic review settings May 26, 2026 09:38
@xin3he
Copy link
Copy Markdown
Contributor Author

xin3he commented May 26, 2026

/azp run Unit-Test-CUDA-AutoRound

@azure-pipelines
Copy link
Copy Markdown

Azure Pipelines successfully started running 1 pipeline(s).

@xin3he xin3he requested a review from XuehaoSun May 26, 2026 09:39
Copy link
Copy Markdown
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Fixes metadata and test usage around exported quantized model formats, aligning saved config fields and test expectations with the actual export behavior.

Changes:

  • Update model-free quantization_config to emit the intended packing_format value.
  • Update CUDA FP8 export test to use the returned export directory from quantize_and_save().

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 1 comment.

File Description
test/test_cuda/export/test_auto_round_format.py Capture the returned export directory for FP8-block FP8 format test usage.
auto_round/compressors/model_free.py Adjust the emitted quantization_config["packing_format"] value.

Comment thread auto_round/compressors/model_free.py Outdated
xin3he added 2 commits May 26, 2026 20:11
Signed-off-by: Xin He <xin3.he@intel.com>
@xin3he
Copy link
Copy Markdown
Contributor Author

xin3he commented May 26, 2026

/azp run Unit-Test-CUDA-AutoRound

@azure-pipelines
Copy link
Copy Markdown

Azure Pipelines successfully started running 1 pipeline(s).

@xin3he
Copy link
Copy Markdown
Contributor Author

xin3he commented May 26, 2026

/azp run Unit-Test-CUDA-AutoRound

@azure-pipelines
Copy link
Copy Markdown

Azure Pipelines successfully started running 1 pipeline(s).

Signed-off-by: Xin He <xin3.he@intel.com>
@xin3he
Copy link
Copy Markdown
Contributor Author

xin3he commented May 26, 2026

/azp run Unit-Test-CUDA-AutoRound

@azure-pipelines
Copy link
Copy Markdown

Azure Pipelines successfully started running 1 pipeline(s).

@chensuyue chensuyue added this to the 0.13.0 milestone May 26, 2026
xin3he added 2 commits May 27, 2026 11:41
Signed-off-by: Xin He <xin3.he@intel.com>
Signed-off-by: Xin He <xin3.he@intel.com>
@xin3he
Copy link
Copy Markdown
Contributor Author

xin3he commented May 27, 2026

/azp run Unit-Test-CUDA-AutoRound

@azure-pipelines
Copy link
Copy Markdown

Azure Pipelines successfully started running 1 pipeline(s).

@chensuyue
Copy link
Copy Markdown
Contributor

/azp run Unit-Test-CUDA-AutoRound

@azure-pipelines
Copy link
Copy Markdown

Azure Pipelines successfully started running 1 pipeline(s).

Signed-off-by: Xin He <xin3.he@intel.com>
@xin3he
Copy link
Copy Markdown
Contributor Author

xin3he commented May 27, 2026

/azp run Unit-Test-CUDA-AutoRound

@azure-pipelines
Copy link
Copy Markdown

Azure Pipelines successfully started running 1 pipeline(s).

Signed-off-by: Xin He <xin3.he@intel.com>
@chensuyue
Copy link
Copy Markdown
Contributor

/azp run Unit-Test-CUDA-AutoRound

@azure-pipelines
Copy link
Copy Markdown

Azure Pipelines successfully started running 1 pipeline(s).

@chensuyue chensuyue merged commit 64c0a5b into main May 28, 2026
47 checks passed
@chensuyue chensuyue deleted the xinhe/5-26 branch May 28, 2026 03:42
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants