Skip to content

Conversation

@mengniwang95
Copy link
Contributor

Signed-off-by: mengniwa mengni.wang@intel.com

Type of Change

example update

How has this PR been tested?

extension test

@mengniwang95 mengniwang95 marked this pull request as ready for review December 1, 2022 10:35
@mengniwang95 mengniwang95 changed the title Update example for new API Update onnx example for new API Dec 1, 2022
@mengniwang95
Copy link
Contributor Author

@chensuyue pls add extension test

@chensuyue
Copy link
Contributor

extension test

@chensuyue
Copy link
Contributor

chensuyue commented Dec 16, 2022

extension test

@mengniwang95 quant_format not well supported?

Signed-off-by: Mengni Wang <mengni.wang@intel.com>
Signed-off-by: Mengni Wang <mengni.wang@intel.com>
Signed-off-by: Mengni Wang <mengni.wang@intel.com>
Signed-off-by: Mengni Wang <mengni.wang@intel.com>
@mengniwang95
Copy link
Contributor Author

@chensuyue
Copy link
Contributor

https://inteltf-jenk.sh.intel.com/job/intel-lpot-validation-top-mr-extension/3836/

Please fix the perf gap, tuning issue, val interface.

Signed-off-by: Mengni Wang <mengni.wang@intel.com>
Signed-off-by: Mengni Wang <mengni.wang@intel.com>
@mengniwang95
Copy link
Contributor Author

Signed-off-by: Mengni Wang <mengni.wang@intel.com>
@mengniwang95
Copy link
Contributor Author

Signed-off-by: Mengni Wang <mengni.wang@intel.com>
@mengniwang95
Copy link
Contributor Author

Signed-off-by: Mengni Wang <mengni.wang@intel.com>
@mengniwang95
Copy link
Contributor Author

Signed-off-by: Mengni Wang <mengni.wang@intel.com>
@chensuyue
Copy link
Contributor

Signed-off-by: Mengni Wang <mengni.wang@intel.com>
@mengniwang95
Copy link
Contributor Author

Signed-off-by: Mengni Wang <mengni.wang@intel.com>
@mengniwang95
Copy link
Contributor Author

Signed-off-by: Mengni Wang <mengni.wang@intel.com>
@chensuyue chensuyue merged commit 20559d2 into master Dec 29, 2022
@chensuyue chensuyue deleted the mengni/example branch December 29, 2022 09:01
VincyZhang pushed a commit that referenced this pull request Feb 12, 2023
…190)

* enable oneDNN's binaryadd op's broadcast optimization
yiliu30 pushed a commit that referenced this pull request Apr 30, 2025
… of W4A16 scheme (#190)

* Revert "Revert "fp8 aware gptq  (hybrid gptq) and fix performance drop in gptq test (SW-223441)"

This reverts commit ba9475d.

* addressing reviewer comments

* Temporarily disable rel_err test until fixed

* fixed pytest error

---------

Co-authored-by: Asaf Karnieli <akarnieli@habana.ai>
Co-authored-by: Mariusz Okroj <mariusz.okroj@intel.com>
Co-authored-by: Linoy Buchnik <linoybu@gmail.com>
xin3he pushed a commit that referenced this pull request Jul 15, 2025
… of W4A16 scheme (#190)

* Revert "Revert "fp8 aware gptq  (hybrid gptq) and fix performance drop in gptq test (SW-223441)"

This reverts commit ba9475d.

* addressing reviewer comments

* Temporarily disable rel_err test until fixed

* fixed pytest error

---------

Co-authored-by: Asaf Karnieli <akarnieli@habana.ai>
Co-authored-by: Mariusz Okroj <mariusz.okroj@intel.com>
Co-authored-by: Linoy Buchnik <linoybu@gmail.com>
Signed-off-by: Xin He <xinhe3@habana.ai>
XuehaoSun pushed a commit that referenced this pull request Jul 19, 2025
… of W4A16 scheme (#190)

* Revert "Revert "fp8 aware gptq  (hybrid gptq) and fix performance drop in gptq test (SW-223441)"

This reverts commit ba9475d.

* addressing reviewer comments

* Temporarily disable rel_err test until fixed

* fixed pytest error

---------

Co-authored-by: Asaf Karnieli <akarnieli@habana.ai>
Co-authored-by: Mariusz Okroj <mariusz.okroj@intel.com>
Co-authored-by: Linoy Buchnik <linoybu@gmail.com>
Signed-off-by: Xin He <xinhe3@habana.ai>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants