-
Notifications
You must be signed in to change notification settings - Fork 283
Update onnx example for new API #190
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
6e2024c to
e5903fc
Compare
|
@chensuyue pls add extension test |
|
@mengniwang95 quant_format not well supported? |
Signed-off-by: Mengni Wang <mengni.wang@intel.com>
Signed-off-by: Mengni Wang <mengni.wang@intel.com>
Signed-off-by: Mengni Wang <mengni.wang@intel.com>
f8400e5 to
b1142fc
Compare
Please fix the perf gap, tuning issue, val interface. |
Signed-off-by: Mengni Wang <mengni.wang@intel.com>
Signed-off-by: Mengni Wang <mengni.wang@intel.com>
https://inteltf-jenk.sh.intel.com/job/intel-lpot-validation-top-mr-extension/3925/ densenet acc drop is caused by removing 'ENABLE_BASIC' |
Signed-off-by: Mengni Wang <mengni.wang@intel.com>
Signed-off-by: Mengni Wang <mengni.wang@intel.com>
Signed-off-by: Mengni Wang <mengni.wang@intel.com>
…190) * enable oneDNN's binaryadd op's broadcast optimization
… of W4A16 scheme (#190) * Revert "Revert "fp8 aware gptq (hybrid gptq) and fix performance drop in gptq test (SW-223441)" This reverts commit ba9475d. * addressing reviewer comments * Temporarily disable rel_err test until fixed * fixed pytest error --------- Co-authored-by: Asaf Karnieli <akarnieli@habana.ai> Co-authored-by: Mariusz Okroj <mariusz.okroj@intel.com> Co-authored-by: Linoy Buchnik <linoybu@gmail.com>
… of W4A16 scheme (#190) * Revert "Revert "fp8 aware gptq (hybrid gptq) and fix performance drop in gptq test (SW-223441)" This reverts commit ba9475d. * addressing reviewer comments * Temporarily disable rel_err test until fixed * fixed pytest error --------- Co-authored-by: Asaf Karnieli <akarnieli@habana.ai> Co-authored-by: Mariusz Okroj <mariusz.okroj@intel.com> Co-authored-by: Linoy Buchnik <linoybu@gmail.com> Signed-off-by: Xin He <xinhe3@habana.ai>
… of W4A16 scheme (#190) * Revert "Revert "fp8 aware gptq (hybrid gptq) and fix performance drop in gptq test (SW-223441)" This reverts commit ba9475d. * addressing reviewer comments * Temporarily disable rel_err test until fixed * fixed pytest error --------- Co-authored-by: Asaf Karnieli <akarnieli@habana.ai> Co-authored-by: Mariusz Okroj <mariusz.okroj@intel.com> Co-authored-by: Linoy Buchnik <linoybu@gmail.com> Signed-off-by: Xin He <xinhe3@habana.ai>
Signed-off-by: mengniwa mengni.wang@intel.com
Type of Change
example update
How has this PR been tested?
extension test