-
Notifications
You must be signed in to change notification settings - Fork 13.5k
opencl: initial q8_0 mv support
#15732
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
ggml/src/ggml-opencl/ggml-opencl.cpp
Outdated
| extra = new ggml_tensor_extra_cl_q8_0(); | ||
| } else { | ||
| extra = temp_tensor_extras_q8_0.back(); | ||
| temp_tensor_extras_q4_0.pop_back(); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Should be temp_tensor_extras_q8_0.pop_back().
9e07767 to
066b21d
Compare
|
@max-krasnyansky @rmatif I am going to merge this to move things forward. We can iterate on this if needed. |
Sorry for the delay. I’ve been pretty busy. I’ll look into this (and the others) today |
|
|
|
@lhez I'll let you merge the PR according to the new guidelines |
|
@rmatif Thank you for taking a look! |
This PR adds initial
q8_0support.