## Fix GPTQ producing invalid QArray when subchannel quantization not specified by copybara-service[bot] · Pull Request #198 · google/qwix

copybara-service · 2026-01-22T04:28:51Z

Fix GPTQ producing invalid QArray when subchannel quantization not specified

Summary

Fixes a bug where gptq_core.quantize_weight creates inadvertent subchannel quantization that can produce invalid QArrays, even when the user did not request subchannel quantization.

Problem

When subchannel quantization is not specified (tiled_axes is empty), quantize_weight defaults the groupsize to rows:

groupsize = how.tiled_axes.get(1, rows)

This causes scales to be computed every rows columns, creating ceil(columns/rows) scale groups. When columns is not evenly divisible by rows, this produces an invalid QArray because the resulting scale shape violates the QArray contract (all(qvalue_dim % scale_dim == 0 for each axis))

When subchannel quantization is not specified, the scale should have shape (rows, 1) (per-channel quantization), not (rows, num_groups) (sub-channel).

Solution

This change modifies the default groupsize from rows to columns:

groupsize = how.tiled_axes.get(1, columns)

This ensures:

If subchannel is specified: use the user's groupsize
If subchannel is not specified: use columns as groupsize, producing per-channel quantization with scale.shape = (rows, 1)

… specified ### Summary Fixes a bug where `gptq_core.quantize_weight` creates inadvertent subchannel quantization that can produce invalid QArrays, even when the user did not request subchannel quantization. ### Problem When subchannel quantization is not specified (`tiled_axes` is empty), `quantize_weight` defaults the groupsize to `rows`: ```python groupsize = how.tiled_axes.get(1, rows) ``` This causes scales to be computed every `rows` columns, creating `ceil(columns/rows)` scale groups. When `columns` is not evenly divisible by `rows`, this produces an invalid QArray because the resulting scale shape violates the QArray contract (`all(qvalue_dim % scale_dim == 0 for each axis)`) When subchannel quantization is not specified, the scale should have shape `(rows, 1)` (per-channel quantization), not `(rows, num_groups)` (sub-channel). ### Solution This change modifies the default groupsize from `rows` to `columns`: ```python groupsize = how.tiled_axes.get(1, columns) ``` This ensures: - If subchannel is specified: use the user's groupsize - If subchannel is not specified: use `columns` as groupsize, producing per-channel quantization with `scale.shape = (rows, 1)` PiperOrigin-RevId: 859455381

copybara-service Bot force-pushed the test_859382529 branch from 9b946ec to 04b2f75 Compare January 22, 2026 05:51

copybara-service Bot changed the title ~~## Fixes QArray identity reshape failure with subchannel quantization~~ ## Fix GPTQ producing invalid QArray when subchannel quantization not specified Jan 22, 2026

copybara-service Bot force-pushed the test_859382529 branch from 04b2f75 to 2ddf53f Compare January 22, 2026 07:52

copybara-service Bot force-pushed the test_859382529 branch from 2ddf53f to 0684523 Compare January 22, 2026 07:53

copybara-service Bot merged commit 0684523 into main Jan 22, 2026

copybara-service Bot deleted the test_859382529 branch January 22, 2026 07:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

## Fix GPTQ producing invalid QArray when subchannel quantization not specified#198

## Fix GPTQ producing invalid QArray when subchannel quantization not specified#198
copybara-service[bot] merged 1 commit intomainfrom
test_859382529

copybara-service Bot commented Jan 22, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

copybara-service Bot commented Jan 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!