-
Notifications
You must be signed in to change notification settings - Fork 11.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ci: [SYCL] Use main GPU and enable sysman #12547
Conversation
If you add the keyword |
Ah. It is broken! |
It could be related to the shapes of the specific model. Here is how to get and convert the model: Lines 411 to 421 in 48d7021
Lines 435 to 437 in 48d7021
|
@Alcpz @ggerganov Confirmed Q4_0 broken by 08d5986 To quickly test, run llama-cli with this specific model with |
Thanks for pinpointing this. We may want to disable the GGML_SYCL_DISABLE_OPT path by default until it is fixed. |
Probably in a separate PR. This PR should be merged IMO(It enables the necessary flags). I will enable SYCL CI on whisper.cpp and ggml repos tomorrow. |
Set environmental variables for using the main GPU for now and enable SYSMAN for correct GPU memory reporting.