Quantization: Add 2bit to precision constants, Fix -1 group_size handling in ModelBuilder by jambayk · Pull Request #2338 · microsoft/Olive

jambayk · 2026-02-17T18:55:01Z

Describe your changes

Checklist before requesting a review

Add unit tests for this change.
Make sure all tests can pass.
Update documents if necessary.
Lint and apply fixes to your code by running lintrunner -a
Is this a user-facing change? If yes, give a description of this change to be included in the release notes.

(Optional) Issue link

Copilot

Pull request overview

This PR adds support for 2-bit quantization and fixes handling of per-channel quantization (group_size == -1) in the ModelBuilder pass. The changes enable proper loading and processing of quantized models that use per-channel quantization or 2-bit precision.

Changes:

Added BITS2 = 2 to the PrecisionBits enum to support 2-bit quantization
Fixed ModelBuilder logic to handle group_size == -1 (per-channel quantization) correctly by avoiding division by -1
Enabled and enhanced test coverage for ModelBuilder with Olive-quantized models, including group_size == -1 test cases

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 1 comment.

File	Description
olive/constants.py	Added BITS2 constant to PrecisionBits enum for 2-bit quantization support
olive/passes/onnx/model_builder.py	Refactored qweight tensor processing to properly handle group_size == -1 and simplified the logic by removing separate scales handling
test/passes/onnx/test_model_builder.py	Removed skip decorator and added group_size parametrization to test both normal (16) and per-channel (-1) quantization

olive/passes/onnx/model_builder.py

jambayk added 2 commits February 17, 2026 18:50

2bit precision, mb -1 blocksize

7f6c4b0

ut

3f046d3

jambayk requested a review from Copilot February 17, 2026 18:57

Copilot started reviewing on behalf of jambayk February 17, 2026 18:58 View session

Copilot AI reviewed Feb 17, 2026

View reviewed changes

olive/passes/onnx/model_builder.py Show resolved Hide resolved

xiaoyu-work approved these changes Feb 18, 2026

View reviewed changes

jambayk merged commit 86f9469 into main Feb 18, 2026
17 checks passed

jambayk deleted the 2bit-mb branch February 18, 2026 20:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

Quantization: Add 2bit to precision constants, Fix -1 group_size handling in ModelBuilder#2338

Quantization: Add 2bit to precision constants, Fix -1 group_size handling in ModelBuilder#2338
jambayk merged 2 commits intomainfrom
2bit-mb

jambayk commented Feb 17, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Comments

Conversation

jambayk commented Feb 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Describe your changes

Checklist before requesting a review

(Optional) Issue link

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

jambayk commented Feb 17, 2026 •

edited

Loading