[ET-VK] Statically quantized add #14649

SS-JIA · 2025-09-28T18:26:51Z

Stack from ghstack (oldest at bottom):

Changes

Title says it all! This diff adds an implementation of binary operators where all tensors are quantized to 8-bit with per-tensor scale and zero point. This is required for many convolution neural networks.

Differential Revision: D83437828

## Changes Title says it all! This diff adds an implementation of binary operators where all tensors are quantized to 8-bit with per-tensor scale and zero point. This is required for many convolution neural networks. Differential Revision: [D83437828](https://our.internmc.facebook.com/intern/diff/D83437828/) [ghstack-poisoned]

pytorch-bot · 2025-09-28T18:26:55Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/14649

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 2 New Failures, 2 Unrelated Failures

As of commit 4a9196e with merge base 049c9fc ():

NEW FAILURES - The following jobs have failed:

pull / test-llama-lora-linux / linux-job (gh)
RuntimeError: Command docker exec -t 28eaf976fff904cf09d3fd61185f590a2b8c86e70c5563b0d7cb7b44d682c1df /exec failed with exit code 1
pull / unittest / macos / macos-job (gh)
backends/xnnpack/test/recipes/test_xnnpack_recipes.py::TestXnnpackRecipes::test_all_models_with_recipes

FLAKY - The following job failed but was likely due to flakiness present on trunk:

pull / test-binary-size-linux-gcc / linux-job (gh) (similar failure)
##[error]The operation was canceled.

BROKEN TRUNK - The following job failed but was present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

pull / test-setup-linux-gcc / linux-job (gh) (trunk failure)
##[error]The operation was canceled.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

## Changes Title says it all! This diff adds an implementation of binary operators where all tensors are quantized to 8-bit with per-tensor scale and zero point. This is required for many convolution neural networks. Differential Revision: [D83437828](https://our.internmc.facebook.com/intern/diff/D83437828/) ghstack-source-id: 312658920 Pull Request resolved: #14649

facebook-github-bot · 2025-09-28T18:27:25Z

@SS-JIA has exported this pull request. If you are a Meta employee, you can view the originating diff in D83437828.

github-actions · 2025-09-28T18:28:01Z

This PR needs a `release notes:` label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

## Changes Title says it all! This diff adds an implementation of binary operators where all tensors are quantized to 8-bit with per-tensor scale and zero point. This is required for many convolution neural networks. Differential Revision: [D83437828](https://our.internmc.facebook.com/intern/diff/D83437828/) [ghstack-poisoned]

Pull Request resolved: #14649 ## Changes Title says it all! This diff adds an implementation of binary operators where all tensors are quantized to 8-bit with per-tensor scale and zero point. This is required for many convolution neural networks. Differential Revision: [D83437828](https://our.internmc.facebook.com/intern/diff/D83437828/) ghstack-source-id: 312663831

facebook-github-bot · 2025-09-28T20:14:24Z

@SS-JIA has exported this pull request. If you are a Meta employee, you can view the originating diff in D83437828.

## Changes Title says it all! This diff adds an implementation of binary operators where all tensors are quantized to 8-bit with per-tensor scale and zero point. This is required for many convolution neural networks. Differential Revision: [D83437828](https://our.internmc.facebook.com/intern/diff/D83437828/) [ghstack-poisoned]

Pull Request resolved: #14649 ## Changes Title says it all! This diff adds an implementation of binary operators where all tensors are quantized to 8-bit with per-tensor scale and zero point. This is required for many convolution neural networks. ghstack-source-id: 312809809 Differential Revision: [D83437828](https://our.internmc.facebook.com/intern/diff/D83437828/)

facebook-github-bot · 2025-09-29T17:30:43Z

@SS-JIA has exported this pull request. If you are a Meta employee, you can view the originating diff in D83437828.

Pull Request resolved: #14649 ## Changes Title says it all! This diff adds an implementation of binary operators where all tensors are quantized to 8-bit with per-tensor scale and zero point. This is required for many convolution neural networks. ghstack-source-id: 312809809 Differential Revision: [D83437828](https://our.internmc.facebook.com/intern/diff/D83437828/)

@SS-JIA

This PR was created by the merge bot to help merge the original PR into the main branch. ghstack PR number: #14649 by @SS-JIA ^ Please use this as the source of truth for the PR details, comments, and reviews ghstack PR base: https://github.com/pytorch/executorch/tree/gh/SS-JIA/334/base ghstack PR head: https://github.com/pytorch/executorch/tree/gh/SS-JIA/334/head Merge bot PR base: https://github.com/pytorch/executorch/tree/gh/SS-JIA/333/orig Merge bot PR head: https://github.com/pytorch/executorch/tree/gh/SS-JIA/334/orig Differential Revision: [D83437828](https://our.internmc.facebook.com/intern/diff/D83437828/) @diff-train-skip-merge Co-authored-by: ssjia <ssjia@devvm26340.ftw0.facebook.com>

This was referenced Sep 28, 2025

[ET-VK] Statically quantized convolutions #14647

Merged

[ET-VK] AOT logic for quantized conv2d #14648

Merged

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Sep 28, 2025

facebook-github-bot added fb-exported meta-exported labels Sep 28, 2025

SS-JIA requested review from larryliu0820 and kirklandsign as code owners September 28, 2025 20:13

manuelcandales approved these changes Sep 29, 2025

View reviewed changes

facebook-github-bot merged commit 48e2d0a into gh/SS-JIA/334/base Sep 29, 2025
125 of 132 checks passed

facebook-github-bot deleted the gh/SS-JIA/334/head branch September 29, 2025 22:12

facebook-github-bot temporarily deployed to cherry-pick-bot September 29, 2025 22:12 — with GitHub Actions Inactive

pytorchbot mentioned this pull request Sep 29, 2025

[ET-VK] Statically quantized add #14670

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ET-VK] Statically quantized add #14649

[ET-VK] Statically quantized add #14649

Uh oh!

SS-JIA commented Sep 28, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Sep 28, 2025 •

edited

Loading

Uh oh!

facebook-github-bot commented Sep 28, 2025

Uh oh!

github-actions bot commented Sep 28, 2025

Uh oh!

facebook-github-bot commented Sep 28, 2025

Uh oh!

facebook-github-bot commented Sep 29, 2025

Uh oh!

Uh oh!

Uh oh!

[ET-VK] Statically quantized add #14649

[ET-VK] Statically quantized add #14649

Uh oh!

Conversation

SS-JIA commented Sep 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes

Uh oh!

pytorch-bot bot commented Sep 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/14649

❌ 2 New Failures, 2 Unrelated Failures

Uh oh!

facebook-github-bot commented Sep 28, 2025

Uh oh!

github-actions bot commented Sep 28, 2025

This PR needs a release notes: label

Uh oh!

facebook-github-bot commented Sep 28, 2025

Uh oh!

facebook-github-bot commented Sep 29, 2025

Uh oh!

Uh oh!

Uh oh!

SS-JIA commented Sep 28, 2025 •

edited

Loading

pytorch-bot bot commented Sep 28, 2025 •

edited

Loading

This PR needs a `release notes:` label