-
Notifications
You must be signed in to change notification settings - Fork 722
Arm backend: Propagate node info from quantizer to backend (re-uploaded) #15778
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Arm backend: Propagate node info from quantizer to backend (re-uploaded) #15778
Conversation
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/15778
Note: Links to docs will display an error until the docs builds have been completed. ✅ No FailuresAs of commit c557356 with merge base 8c48a76 ( This comment was automatically generated by Dr. CI and updates every 15 minutes. |
3fa6e64 to
e308913
Compare
|
@oscarandersson8218 still failing due to Mind adding the buck file change from #15759? |
Use the Node meta 'custom' field to propagate information from quantizer to partitioner using a new ArmAnnotationInfo data class. This allows us to track quantized node reliably which is useful in order to track which nodes should 'fold' it's quantization parameter and which should be kept in fp when mixing integer and float in a sub-graph. Co-authored-by: Per Åstrand <per.astrand@arm.com> Signed-off-by: Oscar Andersson <oscar.andersson@arm.com> Change-Id: I398bf52e14d58fce56aa46ace74e45f45050c81b
Signed-off-by: Oscar Andersson <oscar.andersson@arm.com> Change-Id: I84aa43f880e0ea7bbefb30bf7f31fc0e3b362e5a
e308913 to
c557356
Compare
|
Hi @SS-JIA as this and the internal Meta PR is different now this needs a re-import to get OK to merge e.g. we now get: Meta Internal-Only Changes Check — The Diff and Pull Request are not in sync! Thanks for your help with this PR (and other) |
|
Tests look good now - feel free to merge! |
Use the Node meta 'custom' field to propagate information from quantizer to partitioner using a new ArmAnnotationInfo data class. This allows us to track quantized node reliably which is useful in order to track which nodes should 'fold' it's quantization parameter and which should be kept in fp when mixing integer and float in a sub-graph.
cc @freddan80 @per @zingo @digantdesai