Skip to content

Conversation

@XueSongTap
Copy link

Changes

This PR adds ONNX backend support for the data-free Weight Compression Algorithm.
The implementation:

  • Creates a new WeightCompressionAlgoBackend for ONNX
  • Supports int8 and uint8 weight compression

Reason for changes

This implementation expands NNCF's weight compression support to the ONNX backend,
complementing existing support for OpenVINO, Torch, and TorchFX backends.
This allows users to directly compress ONNX models without conversion to other formats.

Related tickets

Fixes #3273

Tests

@XueSongTap XueSongTap requested a review from a team as a code owner March 16, 2025 08:53
@github-actions github-actions bot added NNCF ONNX Pull requests that updates NNCF ONNX NNCF PTQ Pull requests that updates NNCF PTQ labels Mar 16, 2025
@alexsu52 alexsu52 requested a review from kshpv March 17, 2025 05:28
@alexsu52 alexsu52 self-requested a review March 17, 2025 05:29
@p-wysocki
Copy link
Contributor

Hello, is this PR still being worked on?

bump @alexsu52

@alexsu52
Copy link

The PR is not active.

@alexsu52 alexsu52 closed this Apr 14, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

NNCF ONNX Pull requests that updates NNCF ONNX NNCF PTQ Pull requests that updates NNCF PTQ

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[Good First Issue][NNCF]: Add ONNX support of data-free Weight Compression Algorithm

4 participants