ONNX Runtime QNN Execution Provider v2.2.0
·
1 commit
to rel/ort-qnn-ep/2.2.0
since this release
This release delivers operator coverage improvements, multi-NPU device selection, and build fixes.
ONNX Runtime Compatibility: >= 1.24.1 (compiled with v1.24.4)
QAIRT SDK Compatibility: 2.46.0
pip install onnxruntime==1.24.4
pip install onnxruntime-qnn==2.2.0
Bug Fixes
- QNN EP: Fixed
GlobalMaxPool/GlobalAveragePoolfalsely claiming rank-3 support; unified the 3D→4D reshape path with windowed pool ops. (#201) - QNN EP: Restored Genie builds against QAIRT SDKs older than 2.45.0 by keying conditional compilation off the Genie API version (
GenieDlc.hbreaking change). (#225) - QNN EP: Fixed GCC 13 build failures: corrected
memory_order_acq_relonstd::atomic::store()tomemory_order_release, and suppressed a false-positive-Wmaybe-uninitializedinTestInputDef. (#228) - QNN EP: Fixed HNRD Model Compatibility checks incorrectly running on x86 platforms where they don't apply. (#319)
Improvements
- QNN EP: Relaxed QDQ BatchNormalization selector to accept BN nodes with 2 dequantized inputs (instead of requiring 3), matching the common pattern where
bias/mean/variancestay as float initializers. Reduces CPU fallback and graph fragmentation. (#209) - QNN EP: NPU device selection now supports HTP cores with non-zero device IDs. (#215)
Known Issues
- WoS AMD64 — Python 3.11 installer issue causes inference failure — On Windows on Snapdragon,
ep.get_library_path()returns theamd64folder path instead ofarm64ec, causing inference to fail in the AMD64 Python 3.11 environment, due to a known issue with the installer. As a workaround, manually construct the path to thearm64eclibrary. This issue affects Python 3.11 only.
Platform Support
| Package | Windows ARM64 | Windows x64 | Linux ARM64 |
|---|---|---|---|
| Python Wheel | Inference | AOT compilation + Inference | Inference |
| NuGet | Inference | — | — |
| ZIP | Inference | — | — |
| tgz | — | — | Inference |
Full Changelog: rel-2.1.0...rel/ort-qnn-ep/2.2.0
Contributors
This release includes contributions from:
Arnav Deshpande, Ashwath Shankarnarayan, Badri Narayanan, Calvin Nguyen, Cheng-Hsin Weng, Chun-Chih Teng, Hua-Yu Chou, Hung-Jui Wang, Jeff Kilpatrick, Kuan-Yu Lin, Kyle Romero, Matthew Sinclair, Mike Hsu, Min Fong Hong, Shubham Patel, Tirupathi Reddy T, Yathindra Kota, Yuduo Wu, Yu-Hung Chuang