what exactly are htp_quantsim_config_v#.json?
#4000
Unanswered
JerryLee (jl749)
asked this question in
Q&A
Replies: 1 comment
-
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment

Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Hello AIMET team
When I am working with
QuantsimModelI see that we are passingconfig_fileIf I understand correctly config jsons under
aimet_commonspecifies QDQ placements/schemes for each Hexagon Arch.When I export v75 compatible QNN encoding + ONNX file following llama3.2-3b deployment example under
qaihub-modelIt produces an encoding file containing
per-channel-quantizedMatMuloutput-quantizedGatherWhich contradicts
htp_quantsim_config_v75.jsonbecause
https://github.com/quic/aimet/blob/22d47e49556168dbebdd29f3a4325ddbc512701e/TrainingExtensions/common/src/python/aimet_common/quantsim_config/htp_quantsim_config_v75.json#L129-L132
https://github.com/quic/aimet/blob/22d47e49556168dbebdd29f3a4325ddbc512701e/TrainingExtensions/common/src/python/aimet_common/quantsim_config/htp_quantsim_config_v75.json#L81-L85
qaihub-model's pre-computed encodings were generated by AIMET.Is my understanding of config.json wrong?
Any guidance you can provide me would be appreciated.
Beta Was this translation helpful? Give feedback.
All reactions