Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Misc. bug: performance drop with 2x SYCL GPUs #12575

Open
ky438 opened this issue Mar 25, 2025 · 4 comments
Open

Misc. bug: performance drop with 2x SYCL GPUs #12575

ky438 opened this issue Mar 25, 2025 · 4 comments

Comments

@ky438
Copy link

ky438 commented Mar 25, 2025

Name and Version

version: 4956 (e2f56017)
built with Intel(R) oneAPI DPC++/C++ Compiler 2025.1.0 (2025.1.0.20250317) for x86_64-unknown-linux-gnu

Operating systems

Linux

Which llama.cpp modules do you know to be affected?

llama-bench

Command line

bin/llama-bench -m models/llama-2-7b.Q4_0.gguf -mmp 0
bin/llama-bench -m models/llama-2-7b.Q4_0.gguf -mmp 0 -sm none

Problem description & steps to reproduce

I notice that performance drops drastically, and variance explodes, if two Intel Arc B580 GPUs are used instead of one:

2x GPUs:

| model                          |       size |     params | backend    | ngl | mmap |          test |                  t/s |
| ------------------------------ | ---------: | ---------: | ---------- | --: | ---: | ------------: | -------------------: |
| llama 7B Q4_0                  |   3.56 GiB |     6.74 B | SYCL       |  99 |    0 |         pp512 |      2114.89 ± 19.10 |
| llama 7B Q4_0                  |   3.56 GiB |     6.74 B | SYCL       |  99 |    0 |         tg128 |        18.81 ± 13.86 |

1x GPU with -sm none

| model                          |       size |     params | backend    | ngl |    sm | mmap |          test |                  t/s |
| ------------------------------ | ---------: | ---------: | ---------- | --: | ----: | ---: | ------------: | -------------------: |
| llama 7B Q4_0                  |   3.56 GiB |     6.74 B | SYCL       |  99 |  none |    0 |         pp512 |       2233.09 ± 2.90 |
| llama 7B Q4_0                  |   3.56 GiB |     6.74 B | SYCL       |  99 |  none |    0 |         tg128 |         41.76 ± 0.04 |

Why is this?

First Bad Commit

No response

Relevant log output

@NeoZhangJianyu
Copy link
Collaborator

could you share the whole log?

@NeoZhangJianyu
Copy link
Collaborator

@ky438
I can't see any profile info of this github account.
I see some several comments of different PRs/issues created by this account in same day.

Could you share background of this issue?

@ky438
Copy link
Author

ky438 commented Mar 26, 2025 via email

@NeoZhangJianyu
Copy link
Collaborator

OK! I think you should provide whole log of this issue.
So that we could help you.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants