Fix InsertIOQDQ KeyError for dequantize encodings (#18622) by abhinaykukkadapu · Pull Request #18622 · pytorch/executorch

abhinaykukkadapu · 2026-03-31T21:02:59Z

Summary:
q_dq_map only contained quantize ops as keys, so when a node with a dequantize encoding (e.g. a pre-quantized LLM parameter) feeds the output node, the lookup crashes with a KeyError.

Add dequantize ops as keys in q_dq_map, mapping them to the correct dequantize target for output boundary insertion (dq.default -> dq.tensor, matching the existing quantize convention).

Since dequantize targets are now keys, _create_node transfers QCOM_QUANT_ATTRS to inserted dequant nodes. To prevent the live iterator from revisiting these nodes, iterate over a snapshot via list(graph_module.graph.nodes).

Fixes #17732

Differential Revision: D98977887

Pulled By: abhinaykukkadapu

pytorch-bot · 2026-03-31T21:03:02Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/18622

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (3 Unrelated Failures)

As of commit 7efb4e5 with merge base 8b30cfe ():

FLAKY - The following jobs failed but were likely due to flakiness present on trunk:

pull / test-openvino-linux / linux-job (gh) (detected as infra flaky with no log or failing log classifier)
pull / unittest / windows / windows-job (gh) (matched win rule in flaky-rules.json)
##[error]The operation was canceled.

BROKEN TRUNK - The following job failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

pull / unittest-editable / windows / windows-job (gh) (trunk failure)
##[error]The operation was canceled.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

github-actions · 2026-03-31T21:03:50Z

This PR needs a `release notes:` label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

meta-codesync · 2026-03-31T21:24:02Z

@abhinaykukkadapu has imported this pull request. If you are a Meta employee, you can view this in D98977887.

abhinaykukkadapu · 2026-03-31T22:06:31Z

Moving the discussion from other PR.

@haowhsu-quic

Hi @abhinaykukkadapu, the root cause of #17732 was having too many shards due to unsupported op like #16690. I believe the solution of #17732 has already been addressed in #17194.

Thanks for taking a look, right this task is not 1-1 mapping for the exact issue, but there is a comment on the task that refers to unavail dq in the map: #17732 (comment)

Also #17194 got reverted in #17385 this change with list() semantics should pass the tests.

Summary: q_dq_map only contained quantize ops as keys, so when a node with a dequantize encoding (e.g. a pre-quantized LLM parameter) feeds the output node, the lookup crashes with a KeyError. Add dequantize ops as keys in q_dq_map, mapping them to the correct dequantize target for output boundary insertion (dq.default -> dq.tensor, matching the existing quantize convention). Since dequantize targets are now keys, _create_node transfers QCOM_QUANT_ATTRS to inserted dequant nodes. To prevent the live iterator from revisiting these nodes, iterate over a snapshot via list(graph_module.graph.nodes). Fixes pytorch#17732 Differential Revision: D98977887 Pulled By: abhinaykukkadapu

meta-codesync · 2026-04-01T04:21:03Z

@abhinaykukkadapu has exported this pull request. If you are a Meta employee, you can view the originating Diff in D98977887.

Summary: q_dq_map only contained quantize ops as keys, so when a node with a dequantize encoding (e.g. a pre-quantized LLM parameter) feeds the output node, the lookup crashes with a KeyError. Add dequantize ops as keys in q_dq_map, mapping them to the correct dequantize target for output boundary insertion (dq.default -> dq.tensor, matching the existing quantize convention). Since dequantize targets are now keys, _create_node transfers QCOM_QUANT_ATTRS to inserted dequant nodes. To prevent the live iterator from revisiting these nodes, iterate over a snapshot via list(graph_module.graph.nodes). Fixes pytorch#17732 Differential Revision: D98977887 Pulled By: abhinaykukkadapu

haowhsu-quic · 2026-04-01T07:08:19Z

backends/qualcomm/_passes/insert_io_qdq.py


            # insert dq before output or fold mix_quantization q if applicable
            users = list(n.users.keys())
            if n.meta.get(QCOM_QUANT_ATTRS) and any(


Feel like we could just add a check like n.target != exir_ops.edge.quantized_decomposed.dequantize_per_tensor.tensor for any dequantize nodes that already be added.
Then we don't need the list approach (probably will have smaller memory footprint). But I still wonder if this is really an issue? Looks like the root cause of #17782 is because they tweaked the codebase.

The issue this diff is handling is not just with the snapshot of the nodes which is solved by the list. Here is what we want:

For input we want to pop the quant attrs, and insert q node

For output, we insert dq node but we don't want to pop the attrs

The target for example a conv node has q attrs, weight node if it feeds output will have dq attrs, we want to insert dq node for both without popping. I will put up a patch where we can reuse q_ops may be.

@haowhsu-quic i'm referring to the comments on this task, which is irrelevant to the task they commented on but this is the root cause: #17732 (comment)

Summary: q_dq_map only contained quantize ops as keys, so when a node with a dequantize encoding (e.g. a pre-quantized LLM parameter) feeds the output node, the lookup crashes with a KeyError. Add dequantize ops as keys in q_dq_map, mapping them to the correct dequantize target for output boundary insertion (dq.default -> dq.tensor, matching the existing quantize convention). Since dequantize targets are now keys, _create_node transfers QCOM_QUANT_ATTRS to inserted dequant nodes. To prevent the live iterator from revisiting these nodes, iterate over a snapshot via list(graph_module.graph.nodes). Fixes pytorch#17732 Differential Revision: D98977887 Pulled By: abhinaykukkadapu

abhinaykukkadapu · 2026-04-01T19:16:34Z

backends/qualcomm/_passes/insert_io_qdq.py

        )
        meta_val = node.meta["val"]
-        if target in self.q_dq_map:
+        if target in q_ops:


This is to avoid pop on target node if we are inserting dq.

abhinaykukkadapu requested a review from cccclai as a code owner March 31, 2026 21:03

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Mar 31, 2026

abhinaykukkadapu added this to ExecuTorch Core Mar 31, 2026

github-project-automation bot moved this to To triage in ExecuTorch Core Mar 31, 2026

abhinaykukkadapu mentioned this pull request Mar 31, 2026

[Qualcomm] Fix InsertIOQDQ KeyError for dequantize encodings #18601

Merged

abhinaykukkadapu requested review from cccclai, haowhsu-quic, metascroy, shewu-quic and winskuo-quic and removed request for cccclai and metascroy March 31, 2026 21:05

abhinaykukkadapu force-pushed the fix-insert-io-qdq-keyerror-v2 branch 2 times, most recently from 5cc2b41 to b7f5914 Compare March 31, 2026 21:08

meta-codesync bot changed the title ~~[Qualcomm] Fix InsertIOQDQ KeyError for dequantize encodings~~ Fix InsertIOQDQ KeyError for dequantize encodings (#18622) Apr 1, 2026

abhinaykukkadapu force-pushed the fix-insert-io-qdq-keyerror-v2 branch from b7f5914 to ecae50d Compare April 1, 2026 04:20

meta-codesync bot added fb-exported meta-exported labels Apr 1, 2026

abhinaykukkadapu force-pushed the fix-insert-io-qdq-keyerror-v2 branch from ecae50d to 14d0dbb Compare April 1, 2026 06:02

haowhsu-quic reviewed Apr 1, 2026

View reviewed changes

abhinaykukkadapu force-pushed the fix-insert-io-qdq-keyerror-v2 branch from 14d0dbb to 7efb4e5 Compare April 1, 2026 19:07

abhinaykukkadapu commented Apr 1, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix InsertIOQDQ KeyError for dequantize encodings (#18622)#18622

Fix InsertIOQDQ KeyError for dequantize encodings (#18622)#18622
abhinaykukkadapu wants to merge 1 commit intopytorch:mainfrom
abhinaykukkadapu:fix-insert-io-qdq-keyerror-v2

abhinaykukkadapu commented Mar 31, 2026 •

edited by meta-codesync bot

Loading

Uh oh!

pytorch-bot bot commented Mar 31, 2026 •

edited

Loading

Uh oh!

github-actions bot commented Mar 31, 2026

Uh oh!

meta-codesync bot commented Mar 31, 2026

Uh oh!

abhinaykukkadapu commented Mar 31, 2026 •

edited

Loading

Uh oh!

meta-codesync bot commented Apr 1, 2026

Uh oh!

haowhsu-quic Apr 1, 2026

Uh oh!

abhinaykukkadapu Apr 1, 2026

Uh oh!

abhinaykukkadapu Apr 1, 2026 •

edited

Loading

Uh oh!

abhinaykukkadapu Apr 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

abhinaykukkadapu commented Mar 31, 2026 • edited by meta-codesync bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Mar 31, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/18622

✅ You can merge normally! (3 Unrelated Failures)

Uh oh!

github-actions bot commented Mar 31, 2026

This PR needs a release notes: label

Uh oh!

meta-codesync bot commented Mar 31, 2026

Uh oh!

abhinaykukkadapu commented Mar 31, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

meta-codesync bot commented Apr 1, 2026

Uh oh!

haowhsu-quic Apr 1, 2026

Choose a reason for hiding this comment

Uh oh!

abhinaykukkadapu Apr 1, 2026

Choose a reason for hiding this comment

Uh oh!

abhinaykukkadapu Apr 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

abhinaykukkadapu Apr 1, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

abhinaykukkadapu commented Mar 31, 2026 •

edited by meta-codesync bot

Loading

pytorch-bot bot commented Mar 31, 2026 •

edited

Loading

This PR needs a `release notes:` label

abhinaykukkadapu commented Mar 31, 2026 •

edited

Loading

abhinaykukkadapu Apr 1, 2026 •

edited

Loading