Add 16A8W quantization configuration utility for ARM backend #13728

pytorchbot · 2025-08-27T17:20:14Z

This PR was created by the merge bot to help merge the original PR into the main branch.
ghstack PR number: #13641 by @Ninja91
^ Please use this as the source of truth for the PR details, comments, and reviews
ghstack PR base: https://github.com/pytorch/executorch/tree/gh/Ninja91/1/base
ghstack PR head: https://github.com/pytorch/executorch/tree/gh/Ninja91/1/head
Merge bot PR base: https://github.com/pytorch/executorch/tree/main
Merge bot PR head: https://github.com/pytorch/executorch/tree/gh/Ninja91/1/orig
@diff-train-skip-merge

pytorch-bot · 2025-08-27T17:20:17Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/13728

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

Multiple CI trunk failures after landing https://github.com/pytorch/pytorch/pull/161002

❌ 3 New Failures

As of commit 8f4c714 with merge base 8278e7b ():

NEW FAILURES - The following jobs have failed:

Build documentation / build (buck2) / Build doc (gh)
At least one of the pre-conditions you specified did not hold
Build Presets / windows (pybind) / build (gh)
Process completed with exit code 1.
pull / unittest-arm-backend-with-no-fvp (test_pytest_ops) / linux-job (gh)
RuntimeError: Command docker exec -t 4b1bbe063c5a41867e003dd1ad254aba24e432a89e162bd44077add8e12e2b46 /exec failed with exit code 1

This comment was automatically generated by Dr. CI and updates every 15 minutes.

github-actions · 2025-08-27T17:20:51Z

This PR needs a `release notes:` label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

Pull Request resolved: #13641 This diff implements a 16A8W (16-bit activations, 8-bit weights) quantization configuration utility for the ExecutorTorch ARM backend, following the feedback from D79746479. ## Key Changes **1. New Quantization Configuration Function** - Add `get_16a8w_quantization_config()` in `fbcode/executorch/backends/arm/quantizer/arm_quantizer.py` - Provides 16-bit activations with HistogramObserver (better precision than 8A8W) - Maintains 8-bit weights with MinMaxObserver/PerChannelMinMaxObserver (memory efficient) - **Technically supported by TOSA through [EXT-INT16 extension/profile](https://www.mlplatform.org/tosa/tosa_spec.html#_conv2d)** ## Benefits - **Better Precision**: 16-bit activations provide higher precision than 8-bit. Useful for carrying precision for recurring neural nets. ghstack-source-id: 305991462 @exported-using-ghexport Differential Revision: [D79763381](https://our.internmc.facebook.com/intern/diff/D79763381/)

pytorchbot requested a review from digantdesai as a code owner August 27, 2025 17:20

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Aug 27, 2025

Ninja91 force-pushed the gh/Ninja91/1/orig branch from bb0f06d to 773fc0b Compare August 27, 2025 18:22

Ninja91 force-pushed the gh/Ninja91/1/orig branch from 773fc0b to 8f4c714 Compare August 27, 2025 18:49

lucylq approved these changes Sep 2, 2025

View reviewed changes

lucylq merged commit 4d1da11 into main Sep 2, 2025
109 of 112 checks passed

lucylq deleted the gh/Ninja91/1/orig branch September 2, 2025 21:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add 16A8W quantization configuration utility for ARM backend #13728

Add 16A8W quantization configuration utility for ARM backend #13728

Uh oh!

pytorchbot commented Aug 27, 2025

Uh oh!

pytorch-bot bot commented Aug 27, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Aug 27, 2025

Uh oh!

Uh oh!

Uh oh!

Add 16A8W quantization configuration utility for ARM backend #13728

Add 16A8W quantization configuration utility for ARM backend #13728

Uh oh!

Conversation

pytorchbot commented Aug 27, 2025

Uh oh!

pytorch-bot bot commented Aug 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/13728

❗ 1 Active SEVs

❌ 3 New Failures

Uh oh!

github-actions bot commented Aug 27, 2025

This PR needs a release notes: label

Uh oh!

Uh oh!

Uh oh!

pytorch-bot bot commented Aug 27, 2025 •

edited

Loading

This PR needs a `release notes:` label