🚀 The feature, motivation and pitch
qnn sdk can address the GPU as well as NPU(per the docs)
Currently If i want to use both the GPU and NPU on a snapdragon chip, I would need to build my own arr with both the vulcan back-end and the QNN back-end, same with model quantitation.
Ideally the QNN must enable usage with both the GPU and the NPU
Alternatives
No response
Additional context
No response
RFC (Optional)
No response
cc @cccclai @cbilgin @abhinaykukkadapu
🚀 The feature, motivation and pitch
qnn sdk can address the GPU as well as NPU(per the docs)
Currently If i want to use both the GPU and NPU on a snapdragon chip, I would need to build my own arr with both the vulcan back-end and the QNN back-end, same with model quantitation.
Ideally the QNN must enable usage with both the GPU and the NPU
Alternatives
No response
Additional context
No response
RFC (Optional)
No response
cc @cccclai @cbilgin @abhinaykukkadapu