-
Notifications
You must be signed in to change notification settings - Fork 747
NXP Backend: Add pass to remove unnecessary Quantize/Dequantize nodes. #15148
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
NXP Backend: Add pass to remove unnecessary Quantize/Dequantize nodes. #15148
Conversation
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/15148
Note: Links to docs will display an error until the docs builds have been completed. ✅ You can merge normally! (1 Unrelated Failure)As of commit b2831c3 with merge base 3b1aeda ( FLAKY - The following job failed but was likely due to flakiness present on trunk:
This comment was automatically generated by Dr. CI and updates every 15 minutes. |
aa651f1 to
972ad89
Compare
|
@pytorchbot label "module: nxp" "release notes: nxp" |
972ad89 to
66e43e8
Compare
66e43e8 to
b093129
Compare
backends/nxp/tests/test_turning_batch_first_gru_to_time_major.py
Outdated
Show resolved
Hide resolved
d5ba591 to
e529f83
Compare
e529f83 to
b276902
Compare
|
Update the Summary, the pass has different intention: This PR adds an edge dialect pre-processing pass to remove some Q/DQ nodes. This enables some non-delegated nodes (which run on the CPU) to run in directly in int8 and avoid the QDQ compute overhead. This improves the inference speed (by eliminating the need to artificially quantize and de-quantize input and output values. |
b276902 to
b2831c3
Compare
Summary
This PR adds an edge dialect pre-processing pass to remove some Q/DQ nodes. This enables some non-delegated nodes (which run on the CPU) to run in directly in int8 and avoid the QDQ compute overhead. This improves the inference speed (by eliminating the need to artificially quantize and de-quantize input and output values.
Test plan
Unit tests provided.
cc @robert-kalmar