Misc. bug: Various memory related issues with SYCL

### Name and Version

vulkan:
version: 6719 (aa4711d3)
built with cc (GCC) 15.2.1 20250813 for x86_64-pc-linux-gnu

sycl:
version: 6719 (aa4711d3)
built with Intel(R) oneAPI DPC++/C++ Compiler 2025.0.4 (2025.0.4.20241205) for x86_64-unknown-linux-gnu

### Operating systems

Linux

### Which llama.cpp modules do you know to be affected?

libllama (core library)

### Command line

```shell
vulkan:
llama-server --threads 12 --prio 2 --ctx-size 12288 --gpu-layers 100 --model xxxx --host 0.0.0.0 --port 9091 --no-webui --props --no-slots

sycl:
ONEAPI_DEVICE_SELECTOR="level_zero:0" ZES_ENABLE_SYSMAN=1 llama-server --threads 12 --prio 2 --ctx-size 12288 --gpu-layers 100 --model xxxx --host 0.0.0.0 --port 9091 --no-webui --props --no-slots
```

### Problem description & steps to reproduce

1:
Compared to Vulkan, SYCL fits less context given the exact same model and parameters:
SYCL can handle ~9200 tokens before crashing with OOM while Vulkan can handle the full 12288 tokens
CUDA on an nvidia card with the same amount of VRAM can also handle the full 12288 tokens
Thats a difference of ~3000 tokens while all three runs were run on 12GB cards with no desktop environment or other gpu-using programs running 

2:
`ext_intel_free_memory is not supported` gets printed like 4 times, suggesting to set `ZES_ENABLE_SYSMAN` even when `ZES_ENABLE_SYSMAN=1` is already set

[log-vulkan.txt](https://github.com/user-attachments/files/22864181/log-vulkan.txt)
[log-sycl.txt](https://github.com/user-attachments/files/22864182/log-sycl.txt)
[build-commands.txt](https://github.com/user-attachments/files/22864183/build-commands.txt)
[os.txt](https://github.com/user-attachments/files/22864202/os.txt)
[hw.txt](https://github.com/user-attachments/files/22864260/hw.txt)

### First Bad Commit

_No response_

### Relevant log output

```shell
logs attached as files above
```

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Misc. bug: Various memory related issues with SYCL #16516

Name and Version

Operating systems

Which llama.cpp modules do you know to be affected?

Command line

Problem description & steps to reproduce

First Bad Commit

Relevant log output

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Misc. bug: Various memory related issues with SYCL #16516

Description

Name and Version

Operating systems

Which llama.cpp modules do you know to be affected?

Command line

Problem description & steps to reproduce

First Bad Commit

Relevant log output

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions