Qualcomm AI Engine Direct - gpu support part1 #12165

haowhsu-quic · 2025-07-02T14:45:46Z

Summary

rename folders in backends/qualcomm/runtime/backends
add gpu infra

Test plan

python backends/qualcomm/tests/test_qnn_delegate.py TestQNNFloatingPointOperator.test_qnn_backend_conv2d -b build-android/ -m SM8750 -s 5f396958 --online_prepare --backend gpu

pytorch-bot · 2025-07-02T14:45:49Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/12165

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure, 1 Unrelated Failure

As of commit 5a7b62b with merge base 1fe59c8 ():

NEW FAILURE - The following job has failed:

pull / test-static-llama-qnn-linux (stories_260k_bc) / linux-job (gh)
RuntimeError: Command docker exec -t d084affc2af0048404b315e23131ec9d06dd9a48c32744f3a9a8883327b2c1e6 /exec failed with exit code 1

UNSTABLE - The following job is marked as unstable, possibly due to flakiness on trunk:

pull / android / run-emulator (gh) (#16137)
Timeout waiting for emulator to boot.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

haowhsu-quic · 2025-07-02T14:46:05Z

@pytorchbot label "release notes: qualcomm"

cccclai · 2025-07-04T00:32:00Z

backends/qualcomm/runtime/backends/QnnBackendFactory.h

-#include <executorch/backends/qualcomm/runtime/backends/htpbackend/HtpContext.h>
-#include <executorch/backends/qualcomm/runtime/backends/htpbackend/HtpDevice.h>
-#include <executorch/backends/qualcomm/runtime/backends/htpbackend/HtpGraph.h>
+#include <executorch/backends/qualcomm/runtime/backends/gpu/GpuBackend.h>


I'm slightly worried about the runtime size increase, that usually is a requirement for production. Do we know how much size increase with this PR? If I have a model runs on HTP only, can the runtime include HTP only?

The libqnn_executorch_backend.so grows from 630984 to 652672 bytes. We'll deprecate few files in next PR, hopefully it could further reduce the number.

What files will be deprecated in next PR?

I think it will be aot/ir and runtime/backend/CustomProtocol*. We now switch to QNN IR backend (DLC) for online-prepare path, the qcir and the legacy code for multi-method compilation can be fully deprecated.
But it would break backward compatibility since we used to wrap preprocess result with custom protocol. Probably will let you to decide when will be the right time to apply the change.

Hi, I was thinking wrong about the impact of deprecating files. We still need to keep the custom protocol implementation to make multi-graph path work.
The change is in #12583 now and will guarantee BC.

cccclai · 2025-07-11T20:55:52Z

Sorry I need to spend a bit more time on this, because we don't have CI to test the pllm model and I'm worried it will cause breakage

haowhsu-quic · 2025-07-12T01:09:13Z

Sorry I need to spend a bit more time on this, because we don't have CI to test the pllm model and I'm worried it will cause breakage

No worries, I think GA decoder models is way more important than this. This PR is mainly a proof of concept that we can extend the capability of QNN backend.

cccclai · 2025-07-18T00:17:01Z

Can we prioritize the stories.pte as part of CI to prevent BC breakage? Otherwise it's hard to catch failure

github-actions · 2025-09-16T00:52:16Z

Looks like this PR hasn't been updated in a while so we're going to go ahead and mark this as Stale.
Feel free to remove the Stale label if you feel this was a mistake.
If you are unable to remove the Stale label please contact a maintainer in order to do so.
If you want the bot to never mark this PR stale again, add the no-stale label.
Stale pull requests will automatically be closed after 30 days of inactivity.

cccclai · 2025-09-16T04:28:47Z

We should be good to continue this PR, what do you think?

github-actions · 2025-11-16T00:51:21Z

Looks like this PR hasn't been updated in a while so we're going to go ahead and mark this as Stale.
Feel free to remove the Stale label if you feel this was a mistake.
If you are unable to remove the Stale label please contact a maintainer in order to do so.
If you want the bot to never mark this PR stale again, add the no-stale label.
Stale pull requests will automatically be closed after 30 days of inactivity.

cccclai · 2025-11-16T01:47:12Z

Should we rebase this PR are land it?

haowhsu-quic · 2025-11-16T15:45:03Z

Should we rebase this PR are land it?

We're doing some final checking and will submit PR next week. We will also verify all the enabled models and try to support them with GPU after this PR.

meta-codesync · 2025-11-26T18:07:42Z

@cccclai has imported this pull request. If you are a Meta employee, you can view this in D87936306.

cccclai · 2025-12-04T02:13:16Z

I'm getting the error message

executorch/backends/qualcomm/tests/utils.py", line 222, in get_backend_type
return getattr(QnnExecuTorchBackendType, f"k{self.backend.title()}Backend")
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
AttributeError: type object 'QnnExecuTorchBackendType' has no attribute 'kBackend'. Did you mean: 'kDspBackend'?

Can you help fixing it?

chenweng-quic · 2025-12-04T07:31:40Z

I'm getting the error message

executorch/backends/qualcomm/tests/utils.py", line 222, in get_backend_type
return getattr(QnnExecuTorchBackendType, f"k{self.backend.title()}Backend")
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
AttributeError: type object 'QnnExecuTorchBackendType' has no attribute 'kBackend'. Did you mean: 'kDspBackend'?

Can you help fixing it?

Hi @cccclai,
Could you provide the command to reproduce the issue?
The self.backend supposed to not become an empty string and cause the error per my understanding.

cccclai · 2025-12-07T20:27:14Z

Sorry was out for 3 days last week, seems like you push new commits, let me check again

- rename folders in backends/qualcomm/runtime/backends - add gpu infra

cccclai · 2025-12-09T21:14:35Z

It seems like an internal test failing, I will forward fix

haowhsu-quic requested review from cccclai, kirklandsign and larryliu0820 as code owners July 2, 2025 14:45

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jul 2, 2025

pytorch-bot bot added the release notes: qualcomm Changes to the Qualcomm backend delegate label Jul 2, 2025

haowhsu-quic mentioned this pull request Jul 2, 2025

QNN GPU or DSP backend issue #5914

Open

cccclai reviewed Jul 4, 2025

View reviewed changes

github-actions bot added the stale PRs inactive for over 60 days label Sep 16, 2025

chenweng-quic force-pushed the dev_gpu_infra branch from f21b2b8 to a87ddf2 Compare November 26, 2025 05:18

chenweng-quic removed the stale PRs inactive for over 60 days label Nov 28, 2025

chenweng-quic force-pushed the dev_gpu_infra branch 2 times, most recently from 8f2b82e to 5e88803 Compare December 2, 2025 06:06

chenweng-quic requested a review from cccclai December 3, 2025 05:47

chenweng-quic force-pushed the dev_gpu_infra branch from 79baee9 to 7ce2b7a Compare December 7, 2025 16:54

haowhsu-quic and others added 2 commits December 9, 2025 10:36

Qualcomm AI Engine Direct - gpu support part1

24983dd

- rename folders in backends/qualcomm/runtime/backends - add gpu infra

Update utils.py

bca2372

chenweng-quic added 2 commits December 9, 2025 10:36

fix lint

73a2df1

fix CI

5a7b62b

chenweng-quic force-pushed the dev_gpu_infra branch from 7ce2b7a to 5a7b62b Compare December 9, 2025 02:36

cccclai approved these changes Dec 9, 2025

View reviewed changes

cccclai merged commit d39d64b into pytorch:main Dec 9, 2025
139 of 143 checks passed

Qualcomm AI Engine Direct - gpu support part1 #12165

Qualcomm AI Engine Direct - gpu support part1 #12165

Uh oh!

Conversation

haowhsu-quic commented Jul 2, 2025

Summary

Test plan

Uh oh!

pytorch-bot bot commented Jul 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/12165

❌ 1 New Failure, 1 Unrelated Failure

Uh oh!

haowhsu-quic commented Jul 2, 2025

Uh oh!

cccclai Jul 4, 2025

Choose a reason for hiding this comment

Uh oh!

haowhsu-quic Jul 4, 2025

Choose a reason for hiding this comment

Uh oh!

cccclai Jul 6, 2025

Choose a reason for hiding this comment

Uh oh!

haowhsu-quic Jul 12, 2025

Choose a reason for hiding this comment

Uh oh!

haowhsu-quic Jul 17, 2025

Choose a reason for hiding this comment

Uh oh!

cccclai commented Jul 11, 2025

Uh oh!

haowhsu-quic commented Jul 12, 2025

Uh oh!

cccclai commented Jul 18, 2025

Uh oh!

github-actions bot commented Sep 16, 2025

Uh oh!

cccclai commented Sep 16, 2025

Uh oh!

github-actions bot commented Nov 16, 2025

Uh oh!

cccclai commented Nov 16, 2025

Uh oh!

haowhsu-quic commented Nov 16, 2025

Uh oh!

meta-codesync bot commented Nov 26, 2025

Uh oh!

cccclai commented Dec 4, 2025

Uh oh!

chenweng-quic commented Dec 4, 2025

Uh oh!

cccclai commented Dec 7, 2025

Uh oh!

cccclai commented Dec 9, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

pytorch-bot bot commented Jul 2, 2025 •

edited

Loading