ci: [SYCL] Use main GPU and enable sysman #12547

qnixsynapse · 2025-03-24T13:34:33Z

Set environmental variables for using the main GPU for now and enable SYSMAN for correct GPU memory reporting.

ggerganov · 2025-03-24T13:36:52Z

If you add the keyword ggml-ci somewhere in the commit message, it will trigger the CI on this branch, so we can see if it works before merging.

qnixsynapse · 2025-03-24T13:44:32Z

Ah. It is broken!

ggerganov · 2025-03-24T13:47:51Z

It could be related to the shapes of the specific model. Here is how to get and convert the model:

llama.cpp/ci/run.sh

Lines 411 to 421 in 48d7021

    
           # pythia_1.4b 
        
           function gg_run_pythia_1_4b { 
        
               cd ${SRC} 
        
               gg_wget models-mnt/pythia/1.4B/ https://huggingface.co/EleutherAI/pythia-1.4b/raw/main/config.json 
        
               gg_wget models-mnt/pythia/1.4B/ https://huggingface.co/EleutherAI/pythia-1.4b/raw/main/tokenizer.json 
        
               gg_wget models-mnt/pythia/1.4B/ https://huggingface.co/EleutherAI/pythia-1.4b/raw/main/tokenizer_config.json 
        
               gg_wget models-mnt/pythia/1.4B/ https://huggingface.co/EleutherAI/pythia-1.4b/raw/main/special_tokens_map.json 
        
               gg_wget models-mnt/pythia/1.4B/ https://huggingface.co/EleutherAI/pythia-1.4b/resolve/main/pytorch_model.bin

llama.cpp/ci/run.sh

Lines 435 to 437 in 48d7021

    
           python3 ../convert_hf_to_gguf.py ${path_models} --outfile ${path_models}/ggml-model-f16.gguf

qnixsynapse · 2025-03-24T14:20:02Z

@Alcpz @ggerganov Confirmed Q4_0 broken by 08d5986

To quickly test, run llama-cli with this specific model with GGML_SYCL_DISABLE_OPT=1.

Alcpz · 2025-03-24T15:12:14Z

Thanks for pinpointing this. We may want to disable the GGML_SYCL_DISABLE_OPT path by default until it is fixed.
@qnixsynapse It's probably worth it to start a discussion in the SYCL discussion thread so everyone using the backend is aware.

qnixsynapse · 2025-03-24T16:31:12Z

We may want to disable the GGML_SYCL_DISABLE_OPT path by default until it is fixed.

Probably in a separate PR. This PR should be merged IMO(It enables the necessary flags).

I will enable SYCL CI on whisper.cpp and ggml repos tomorrow.

qnixsynapse requested a review from ggerganov as a code owner March 24, 2025 13:34

github-actions bot added the devops label Mar 24, 2025

qnixsynapse changed the title ~~ci: Use main GPU and enable sysman~~ ci: [SYCL] Use main GPU and enable sysman Mar 24, 2025

ci: [SYCL] ggml-ci Use main GPU and enable sysman

213a31e

qnixsynapse force-pushed the sycl/ci branch from 4abe4aa to 213a31e Compare March 24, 2025 13:39

ggerganov approved these changes Mar 24, 2025

View reviewed changes

ggerganov merged commit c95fa36 into master Mar 24, 2025
10 checks passed

qnixsynapse deleted the sycl/ci branch March 25, 2025 02:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ci: [SYCL] Use main GPU and enable sysman #12547

ci: [SYCL] Use main GPU and enable sysman #12547

qnixsynapse commented Mar 24, 2025

ggerganov commented Mar 24, 2025

qnixsynapse commented Mar 24, 2025

ggerganov commented Mar 24, 2025

qnixsynapse commented Mar 24, 2025 •

edited

Loading

Alcpz commented Mar 24, 2025

qnixsynapse commented Mar 24, 2025

ci: [SYCL] Use main GPU and enable sysman #12547

ci: [SYCL] Use main GPU and enable sysman #12547

Conversation

qnixsynapse commented Mar 24, 2025

ggerganov commented Mar 24, 2025

qnixsynapse commented Mar 24, 2025

ggerganov commented Mar 24, 2025

qnixsynapse commented Mar 24, 2025 • edited Loading

Alcpz commented Mar 24, 2025

qnixsynapse commented Mar 24, 2025

qnixsynapse commented Mar 24, 2025 •

edited

Loading