[ez][ET-VK] Small fix for choose_qparams_affine_impl #16186

SS-JIA · 2025-12-10T21:36:56Z

Stack from ghstack (oldest at bottom):

It seems that choose_qparams_affine has recently appended some arguments to the schema. This causes newly exported models to break because at runtime, the output arg can no longer be found.

Fix by locating the output argument as the last entry in the args vector, rather than continuously incrementing the args index.

Update quantize/dequantize ops as well since it seems quantized_decomposed namespace ops are subject to change in the future.

Note that it would be good to do this for all operators in the Vulkan backend as a later refactor.

Differential Revision: D88887463

It seems that `choose_qparams_affine` has recently appended some arguments to the schema. This causes newly exported models to break because at runtime, the output arg can no longer be found. Fix by locating the output argument as the last entry in the args vector, rather than continuously incrementing the args index. Update quantize/dequantize ops as well since it seems quantized_decomposed namespace ops are subject to change in the future. Note that it would be good to do this for all operators in the Vulkan backend as a later refactor. Differential Revision: [D88887463](https://our.internmc.facebook.com/intern/diff/D88887463/) [ghstack-poisoned]

pytorch-bot · 2025-12-10T21:37:00Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/16186

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure, 1 Unrelated Failure

As of commit c9abd59 with merge base a0a6278 ():

NEW FAILURE - The following job has failed:

pull / test-llama-lora-linux / linux-job (gh)
RuntimeError: Command docker exec -t d22040eb13e10cc026fb51ae88d0ce22dedde9af80056a44ba531ea777dcbe7d /exec failed with exit code 1

UNSTABLE - The following job is marked as unstable, possibly due to flakiness on trunk:

pull / android / run-emulator (gh) (#16137)
Timeout waiting for emulator to boot.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

github-actions · 2025-12-10T21:38:01Z

This PR needs a `release notes:` label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

It seems that `choose_qparams_affine` has recently appended some arguments to the schema. This causes newly exported models to break because at runtime, the output arg can no longer be found. Fix by locating the output argument as the last entry in the args vector, rather than continuously incrementing the args index. Update quantize/dequantize ops as well since it seems quantized_decomposed namespace ops are subject to change in the future. Note that it would be good to do this for all operators in the Vulkan backend as a later refactor. Differential Revision: [D88887463](https://our.internmc.facebook.com/intern/diff/D88887463/) [ghstack-poisoned]

JacobSzwejbka · 2025-12-11T02:17:53Z

backends/vulkan/runtime/graph/ops/impl/QuantizeDequantize.cpp

    const std::vector<ValueRef>& args) {
-  int32_t arg_idx = 0;
+  size_t arg_idx = 0;
+  size_t last_arg_idx = args.size() - 1;


For the ones in between do you just rely on default behavior? What if they are serialized with values != default? Shouldnt you error out?

yeah, that's a good point. I have a planned updated to improve arg checking for quantized_decomposed ops since there are currently a lot of unsupported input cases which are not accounted for - I will include this in that update.

The primary purpose of this PR as-is is to recover a currently broken CI signal, so I would prefer to keep it as simple as possible. In practice, not validating the args should be ok (for now) since the quantized_decomposed ops are inserted by a quantization workflow and Vulkan doesn't really work with non-supported quant workflows anyways 😛

@manuelcandales

Stack from [ghstack](https://github.com/ezyang/ghstack) (oldest at bottom): * __->__ #16187 * #16186 TSIA! Differential Revision: [D88802100](https://our.internmc.facebook.com/intern/diff/D88802100/) cc @manuelcandales @digantdesai @cbilgin --------- Co-authored-by: ssjia <ssjia@devvm1479.ncg0.facebook.com>

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Dec 10, 2025

SS-JIA mentioned this pull request Dec 10, 2025

[ez][ET-VK] Update Vulkan runtime application name #16187

Merged

meta-codesync bot added fb-exported meta-exported labels Dec 10, 2025

JacobSzwejbka reviewed Dec 11, 2025

View reviewed changes

trivedivivek approved these changes Dec 11, 2025

View reviewed changes

SS-JIA changed the base branch from gh/SS-JIA/380/base to main December 11, 2025 19:48

SS-JIA merged commit f168dbf into main Dec 11, 2025
163 of 166 checks passed

SS-JIA deleted the gh/SS-JIA/380/head branch December 11, 2025 19:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ez][ET-VK] Small fix for choose_qparams_affine_impl #16186

[ez][ET-VK] Small fix for choose_qparams_affine_impl #16186

SS-JIA commented Dec 10, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Dec 10, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Dec 10, 2025

Uh oh!

JacobSzwejbka Dec 11, 2025

Uh oh!

SS-JIA Dec 11, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[ez][ET-VK] Small fix for choose_qparams_affine_impl #16186

[ez][ET-VK] Small fix for choose_qparams_affine_impl #16186

Conversation

SS-JIA commented Dec 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Dec 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/16186

❌ 1 New Failure, 1 Unrelated Failure

Uh oh!

github-actions bot commented Dec 10, 2025

This PR needs a release notes: label

Uh oh!

JacobSzwejbka Dec 11, 2025

Choose a reason for hiding this comment

Uh oh!

SS-JIA Dec 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

SS-JIA commented Dec 10, 2025 •

edited

Loading

pytorch-bot bot commented Dec 10, 2025 •

edited

Loading

This PR needs a `release notes:` label

SS-JIA Dec 11, 2025 •

edited

Loading