Fix indexing overflow issue for blockwise quantization on AMD #1796

sstamenk · 2025-11-03T21:16:04Z

This PR ports the changes from #1784 to kernels.hip and ops.hip so that the newly added test_dynamic_blockwise_quantization_large test can pass on AMD GPUs. Without this change the test gets aborted.

Tested this on both W7900 (gfx1100) and R9700 (gfx1201) and all 3 unit tests passed successfully.

github-actions · 2025-11-03T21:32:31Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

matthewdouglas

Thanks, LGTM!

Fix int32 overflow for blocksize quantization

e587020

matthewdouglas added the ROCm label Nov 3, 2025

matthewdouglas added this to the v0.49.0 milestone Nov 3, 2025

matthewdouglas approved these changes Nov 3, 2025

View reviewed changes

matthewdouglas merged commit 1920972 into bitsandbytes-foundation:main Nov 3, 2025
53 checks passed

sstamenk deleted the blockwise-quant-index-overflow-amd branch November 3, 2025 22:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Fix indexing overflow issue for blockwise quantization on AMD #1796

Fix indexing overflow issue for blockwise quantization on AMD #1796

Uh oh!

sstamenk commented Nov 3, 2025

Uh oh!

github-actions bot commented Nov 3, 2025

Uh oh!

matthewdouglas left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Fix indexing overflow issue for blockwise quantization on AMD #1796

Fix indexing overflow issue for blockwise quantization on AMD #1796

Uh oh!

Conversation

sstamenk commented Nov 3, 2025

Uh oh!

github-actions bot commented Nov 3, 2025

Uh oh!

matthewdouglas left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants