Remove reduce_range as it is not relevant for HTP #14559

navsud · 2025-09-24T22:04:34Z

Summary:
reduce_range=True reduces the available bit width by 1, in cases where quant_min, quant_max are not provided. It was originally intended for intel fbgemm kernels but I don't think this quantization setting is relevant for HTP.
Also, PTQ quantization config doesn't use it, so removing it in all the QAT configs. This helped improve the QAT model quality.

Differential Revision: D82867843

Summary: To save GPU memory `bfloat16` dtype is commonly used for training of LLMs. Currently, the quantizer ignores quantizing the nodes if they are not float32. This change enables quantization of bf16 nodes as well. Differential Revision: D82866443

Summary: `reduce_range=True` reduces the available bit width by 1, in cases where quant_min, quant_max are not provided. It was originally intended for intel `fbgemm` kernels but I don't think this quantization setting is relevant for HTP. Also, PTQ quantization config doesn't use it, so removing it in all the QAT configs. This helped improve the QAT model quality. Differential Revision: D82867843

pytorch-bot · 2025-09-24T22:04:38Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/14559

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 5 New Failures, 1 Pending, 5 Unrelated Failures

As of commit 07e6a73 with merge base b3f3111 ():

NEW FAILURES - The following jobs have failed:

pull / test-qnn-wheel-packages-linux (3.10) / linux-job (gh)
RuntimeError: Command docker exec -t 025fd5b421cbf596dfc24d7725c0e35c9417b2308404c4c240d1103f542e50f6 /exec failed with exit code 1
pull / test-qnn-wheel-packages-linux (3.11) / linux-job (gh)
RuntimeError: Command docker exec -t 45baed3bfea59a5b1ae9156b42cb3a09a414f26f25da9a63e2e541763940a39e /exec failed with exit code 1
pull / test-qnn-wheel-packages-linux (3.12) / linux-job (gh)
RuntimeError: Command docker exec -t c6dee65bbd07b63c90aaa91a17ac99f1dbc7ecf1da8e192c290c5774e90c671b /exec failed with exit code 1
pull / test-samsung-models-linux / linux-job (gh)
RuntimeError: Command docker exec -t 0638c47d347eaa0ab46080e7d2fc0206c69bd370efbcf88cb35fc2a18fea5587 /exec failed with exit code 1
Test CUDA Builds / check-all-cuda-builds (gh)
Process completed with exit code 1.

FLAKY - The following jobs failed but were likely due to flakiness present on trunk:

pull / test-binary-size-linux-gcc / linux-job (gh) (similar failure)
##[error]The operation was canceled.
Test CUDA Builds / test-executorch-cuda-build-12.8 / linux-job (gh) (detected as infra flaky with no log or failing log classifier)
Test CUDA Builds / test-executorch-cuda-build-13.0 / linux-job (gh) (detected as infra flaky with no log or failing log classifier)

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

pull / test-setup-linux-gcc / linux-job (gh) (trunk failure)
##[error]The operation was canceled.
pull / unittest / macos / macos-job (gh) (trunk failure)
exir/tests/test_quant_fusion_pass.py::TestQuantFusionPass::test_embedding_torchao

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot · 2025-09-24T22:04:45Z

@navsud has exported this pull request. If you are a Meta employee, you can view the originating diff in D82867843.

billmguo · 2025-09-24T22:14:01Z

@haowhsu-quic can u help review this PR?

haowhsu-quic

Thank you.

cccclai

stamp on behalf of qcom's team review

navsud added 2 commits September 24, 2025 15:04

navsud requested a review from cccclai as a code owner September 24, 2025 22:04

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Sep 24, 2025

facebook-github-bot added fb-exported meta-exported labels Sep 24, 2025

navsud added the release notes: none Do not include this in the release notes label Sep 24, 2025

navsud requested a review from shewu-quic September 24, 2025 22:08

billmguo requested review from haowhsu-quic and winskuo-quic September 24, 2025 22:13

haowhsu-quic approved these changes Sep 25, 2025

View reviewed changes

cccclai approved these changes Sep 25, 2025

View reviewed changes

facebook-github-bot merged commit 4622edb into pytorch:main Sep 26, 2025
125 of 151 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Remove reduce_range as it is not relevant for HTP #14559

Remove reduce_range as it is not relevant for HTP #14559

Uh oh!

navsud commented Sep 24, 2025

Uh oh!

pytorch-bot bot commented Sep 24, 2025 •

edited

Loading

Uh oh!

facebook-github-bot commented Sep 24, 2025

Uh oh!

billmguo commented Sep 24, 2025

Uh oh!

haowhsu-quic left a comment

Uh oh!

cccclai left a comment

Uh oh!

Uh oh!

Uh oh!

Remove reduce_range as it is not relevant for HTP #14559

Remove reduce_range as it is not relevant for HTP #14559

Uh oh!

Conversation

navsud commented Sep 24, 2025

Uh oh!

pytorch-bot bot commented Sep 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/14559

❌ 5 New Failures, 1 Pending, 5 Unrelated Failures

Uh oh!

facebook-github-bot commented Sep 24, 2025

Uh oh!

billmguo commented Sep 24, 2025

Uh oh!

haowhsu-quic left a comment

Choose a reason for hiding this comment

Uh oh!

cccclai left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

pytorch-bot bot commented Sep 24, 2025 •

edited

Loading