Fix compatibility with latest compressed_tensors compressor refactor by yiliu30 · Pull Request #1576 · intel/auto-round

yiliu30 · 2026-03-19T11:48:21Z

Fix #1566

Summary

Fix ModuleNotFoundError: No module named 'compressed_tensors.config.format' caused by compressed_tensors commit 927f6d5 which removed the config.format module
Add backward-compatible _compress_and_set_format() helper that supports both old and new compressed_tensors APIs
New API (>=0.9.0): uses compress_module() from compressed_tensors.compressors
Old API (<0.9.0): falls back to NaiveQuantizationCompressor + set_per_module_format

Fixes the 3 CI failures in PR #1570:

test_llmc_dynamic_wint8aint8_export
test_llmc_dynamic_wint8aint8_export_with_tuning
test_fp8_block_llm_compressor_format

Test plan

All 3 previously failing tests pass locally (3 passed in 81s)
Full CI run on this PR

This reverts commit 4cc0cce.

Copilot

Pull request overview

This PR updates the LLM-Compressor export path to remain compatible with recent compressed_tensors refactors (notably removal of compressed_tensors.config.format) and adjusts test dependency sourcing to use upstream llmcompressor main.

Changes:

Add _compress_and_set_format() to support both legacy and newer compressed_tensors compression/format APIs.
Update pack_layer() to use the new compatibility helper.
Switch CPU/CUDA test requirements to install llmcompressor from the upstream @main branch (removing the temporary compressed-tensors pin).

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 3 comments.

File	Description
`auto_round/export/export_to_llmcompressor/export.py`	Introduces compatibility helper and routes packing through it to handle new/old `compressed_tensors` APIs.
`test/test_cuda/requirements_llmc.txt`	Updates LLMC test dependency source to `llmcompressor@main`.
`test/test_cpu/requirements.txt`	Updates CPU test dependency source to `llmcompressor@main`.

auto_round/export/export_to_llmcompressor/export.py

test/test_cuda/requirements_llmc.txt

test/test_cpu/requirements.txt

The compressed_tensors commit 927f6d5 removed `compressed_tensors.config.format` and its `set_per_module_format` function. This broke the llmcompressor export path in `pack_layer()`. Add `_compress_and_set_format()` helper with backward compatibility: - New API (>=0.9.0): uses `compress_module()` from `compressed_tensors.compressors` - Old API (<0.9.0): falls back to `NaiveQuantizationCompressor` + `set_per_module_format` Signed-off-by: Yi Liu <yi4.liu@intel.com> Signed-off-by: yiliu30 <yi4.liu@intel.com>

Fetches Azure DevOps CI logs for a given GitHub PR number and parses pytest FAILURES sections to produce a grouped summary with tracebacks. Usage: python scripts/fetch_ci_failures.py <PR_NUMBER> [--save <output.md>] Signed-off-by: Yi Liu <yi4.liu@intel.com> Signed-off-by: yiliu30 <yi4.liu@intel.com>

for more information, see https://pre-commit.ci

CompressedLinear was removed in compressed_tensors PR #610. These 3 tests need weight_handler.py detect_layer updates to work with the new quantization_scheme-based loading path. Signed-off-by: yiliu30 <yi4.liu@intel.com>

Keep as local-only utility script. Signed-off-by: yiliu30 <yi4.liu@intel.com>

yiliu30 added 4 commits March 19, 2026 13:08

Revert "Freeze compressed-tensors & llmc version temporary (#1562)"

c40bec9

This reverts commit 4cc0cce.

Update requirements.txt

f6d2696

Update requirements_llmc.txt

99f552d

Merge branch 'main' into revert-1562-suyue/ut

de829a3

Copilot AI review requested due to automatic review settings March 19, 2026 11:48

Copilot AI reviewed Mar 19, 2026

View reviewed changes

auto_round/export/export_to_llmcompressor/export.py Show resolved Hide resolved

test/test_cuda/requirements_llmc.txt Show resolved Hide resolved

test/test_cpu/requirements.txt Show resolved Hide resolved

yiliu30 force-pushed the fix/pr-1570 branch from fa9bb8a to 6bce128 Compare March 19, 2026 11:52

yiliu30 and others added 4 commits March 19, 2026 12:00

[pre-commit.ci] auto fixes from pre-commit.com hooks

8ea0688

for more information, see https://pre-commit.ci

Remove fetch_ci_failures.py from PR

76a4049

Keep as local-only utility script. Signed-off-by: yiliu30 <yi4.liu@intel.com>

yiliu30 requested review from chensuyue and mengniwang95 March 20, 2026 01:09

yiliu30 added the llmc label Mar 20, 2026

chensuyue approved these changes Mar 20, 2026

View reviewed changes

chensuyue added this to the 0.12.0 milestone Mar 20, 2026

xin3he approved these changes Mar 20, 2026

View reviewed changes

xin3he merged commit 49d2dbb into main Mar 20, 2026
29 checks passed

xin3he deleted the fix/pr-1570 branch March 20, 2026 06:24

yiliu30 mentioned this pull request Mar 20, 2026

[Not Merge][Test Only] "Freeze compressed-tensors & llmc version temporary" #1570

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix compatibility with latest compressed_tensors compressor refactor#1576

Fix compatibility with latest compressed_tensors compressor refactor#1576
xin3he merged 9 commits intomainfrom
fix/pr-1570

yiliu30 commented Mar 19, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

yiliu30 commented Mar 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test plan

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

yiliu30 commented Mar 19, 2026 •

edited

Loading