Align _choose_qparams_affine with _choose_scale_float8 behavior by AryanBagade · Pull Request #3447 · pytorch/ao

AryanBagade · 2025-12-05T19:04:09Z

Changes keepdim default from False to True in _choose_qparams_affine to match _choose_scale_float8 behavior. This ensures scale/zero_point maintain the same rank as input tensor, making downstream handling more consistent.

Part 1 of fixing #3324

Changes

Core Changes (`torchao/quantization/quant_primitives.py`)

Changed keepdim: bool = False → keepdim: bool = True in both choose_qparams_affine (line 1220) and _choose_qparams_affine (line 1526)
Added reshape logic (lines 1600-1608) to match _choose_scale_float8 behavior
Saved original_input_size before reshaping to compute correct output shape
Added documentation explaining the alignment with _choose_scale_float8

Workflow Simplification (`torchao/quantization/quantize_/workflows/intx/intx_unpacked_to_int8_tensor.py`)

Removed manual reshape logic (lines 247-254) that is no longer needed

Test Updates(`test/quantization/test_quant_primitives.py`)

Updated 3 test cases to squeeze scale/zero_point before comparison with reference values
All 7 test_choose_qparams tests now pass

pytorch-bot · 2025-12-05T19:04:13Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/3447

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 8a7b789 with merge base aa25287 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

torchao/quantization/quant_primitives.py

jerryzh168 · 2025-12-05T19:35:26Z

Thanks, I think it's a good start, we can remove keepdim arg in next PR after this PR is merged

AryanBagade · 2025-12-05T20:02:58Z

I see 25 integration tests failed due to backward compatibility issues with the keepdim=True default change

jerryzh168 · 2025-12-06T00:42:58Z

it's expected, I think maybe just don't change the default for now, but turn the keepdim to True in these tests one by one to make sure these tests are fixed, and alls the callsites are fixed before making the switch would be better

AryanBagade · 2025-12-11T04:02:24Z

Really sorry, had a super busy schedule

AryanBagade · 2025-12-11T04:21:40Z

@jerryzh168 could you please run the CI, made the required changes!

meta-codesync · 2025-12-19T22:48:58Z

@jerryzh168 has imported this pull request. If you are a Meta employee, you can view this in D89579365.

AryanBagade · 2026-01-02T22:56:19Z

@jerryzh168 could you please let me know what further steps to take!

jerryzh168 · 2026-01-05T18:54:01Z

@AryanBagade https://github.com/pytorch/ao/actions/runs/20121850704/job/57745005031?pr=3447 is failing but likely is fixed in main already, can you rebase on main branch? I have imported and confirmed that this change doesn't break any of the internal test, so we should be able to merge after you rebase

jerryzh168 · 2026-01-05T18:54:53Z

torchao/quantization/quant_primitives.py

       and `zero_point_domain`
+
+    Note:
+        keepdim defaults to True to align with _choose_scale_float8 behavior. This ensures


right now it's defaults to False, so should change the comment here

Changes keepdim default from False to True in _choose_qparams_affine to match _choose_scale_float8 behavior. This ensures scale/zero_point maintain the same rank as input tensor, making downstream handling more consistent. Fixes pytorch#3324

…ale_float8

Signed-off-by: Aryan Bagade <aryan@aryanbagade.com>

AryanBagade · 2026-01-05T20:00:39Z

@jerryzh168 Rebased on main and fixed the docstring comment as requested. Ready for re-import and merge!

jerryzh168 · 2026-01-10T20:34:53Z

torchao/quantization/quant_primitives.py

       and `zero_point_domain`
+
+    Note:
+        Set keepdim=True to align with _choose_scale_float8 behavior. This ensures


please change this to False, thanks

oh sorry, seems OK, it's not talking about default

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Dec 5, 2025

jerryzh168 reviewed Dec 5, 2025

View reviewed changes

torchao/quantization/quant_primitives.py Show resolved Hide resolved

AryanBagade requested a review from jerryzh168 December 14, 2025 02:38

jerryzh168 reviewed Jan 5, 2026

View reviewed changes

AryanBagade added 3 commits January 5, 2026 11:57

Use keepdim=True explicitly in intx workflow to align with _choose_sc…

f2a82ac

…ale_float8

Fix docstring: clarify keepdim usage recommendation

8a7b789

Signed-off-by: Aryan Bagade <aryan@aryanbagade.com>

AryanBagade force-pushed the fix-issue-3324-align-choose-qparams branch from a56f742 to 8a7b789 Compare January 5, 2026 19:59

AryanBagade requested a review from jerryzh168 January 5, 2026 20:01

jerryzh168 approved these changes Jan 5, 2026

View reviewed changes

jerryzh168 added the module: not user facing Use this tag if you don't want this PR to show up in release notes label Jan 5, 2026

jerryzh168 reviewed Jan 10, 2026

View reviewed changes

jerryzh168 merged commit fbb3aef into pytorch:main Jan 10, 2026
23 of 25 checks passed

Conversation

AryanBagade commented Dec 5, 2025 • edited by jerryzh168 Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes

Core Changes (torchao/quantization/quant_primitives.py)

Workflow Simplification (torchao/quantization/quantize_/workflows/intx/intx_unpacked_to_int8_tensor.py)

Test Updates(test/quantization/test_quant_primitives.py)

Uh oh!

pytorch-bot bot commented Dec 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/3447

✅ No Failures

Uh oh!

Uh oh!

jerryzh168 commented Dec 5, 2025

Uh oh!

AryanBagade commented Dec 5, 2025

Uh oh!

jerryzh168 commented Dec 6, 2025

Uh oh!

AryanBagade commented Dec 11, 2025

Uh oh!

AryanBagade commented Dec 11, 2025

Uh oh!

meta-codesync bot commented Dec 19, 2025

Uh oh!

AryanBagade commented Jan 2, 2026

Uh oh!

jerryzh168 commented Jan 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jerryzh168 Jan 5, 2026

Choose a reason for hiding this comment

Uh oh!

AryanBagade commented Jan 5, 2026

Uh oh!

jerryzh168 Jan 10, 2026

Choose a reason for hiding this comment

Uh oh!

jerryzh168 Jan 10, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

AryanBagade commented Dec 5, 2025 •

edited by jerryzh168

Loading

Core Changes (`torchao/quantization/quant_primitives.py`)

Workflow Simplification (`torchao/quantization/quantize_/workflows/intx/intx_unpacked_to_int8_tensor.py`)

Test Updates(`test/quantization/test_quant_primitives.py`)

pytorch-bot bot commented Dec 5, 2025 •

edited

Loading

jerryzh168 commented Jan 5, 2026 •

edited

Loading