ggml : use svcntb() for SVE vector length detection #17474

angt · 2025-11-24T15:16:58Z

This allows SVE on FreeBSD and others

Signed-off-by: Adrien Gallouët <angt@huggingface.co>

angt · 2025-11-24T16:30:49Z

I dont know why this code exists in its current form, I guess it is mostly historical/legacy.
However, afaik runtime detection does not work on aarch64, and the current implementation fails to compile on FreeBSD.

angt · 2025-11-25T09:52:27Z

I did some git archaeology this morning to understand why we use prctl, and to be honest, I’m even more confused than yesterday 😵‍💫 (starting from #8709 as a reference).

Anyway, I reopen the PR so we can at least discuss it and move things forward

angt · 2025-12-02T08:49:47Z

@ggerganov do you have any insights on why the code is currently in this state? I'm having trouble understanding how we got there, and right now llama.cpp just doesn’t build on many OS.

ggerganov · 2025-12-02T08:56:58Z

ggml/src/ggml-cpu/ggml-cpu.c

 int ggml_cpu_get_sve_cnt(void) {
 #if defined(__ARM_ARCH) && defined(__ARM_FEATURE_SVE)
-    return ggml_arm_arch_features.sve_cnt;
+    return svcntb();
 #else
    return 0;
 #endif


This is a system call that we want to avoid in hot loops. See the discussion starting here: #8709 (comment)

@angt Does this answer the question? I am not very familiar with the Arm feature set logic. Probably it can be improved/extended. I believe the main difficulty is that Arm hardware is not very ubiquitous among regular users (apart from Macs).

Yes, I saw the PR (I reopened this PR after reading it). svcntb() is not a syscall it's a SVE instruction (cntb) and I would be very surprised if a global var read were faster.

But more importantly, it’s already used in several hot paths without any issues:

ggml_vec_dot_f16:

llama.cpp/ggml/src/ggml-cpu/vec.cpp

Line 219 in 4574f29

const int sve_register_length = svcntb() * 8; //get vector length

ggml_vec_dot_q2_K_q8_K:

llama.cpp/ggml/src/ggml-cpu/arch/arm/quants.c

Line 1431 in 682e665

const int vector_length = svcntb()*8;

ggml_vec_dot_q3_K_q8_K:

llama.cpp/ggml/src/ggml-cpu/arch/arm/quants.c

Line 1773 in 4574f29

const int vector_length = svcntb()*8;

And no one wanted to replace it so far.

@ggerganov If I remove the last commit to keep the global var, would this be mergeable for you?

Sounds good. Even if it not a system call, the person there measured a significant difference, so unless someone else measures another result, I would go with that information. These svcntb() calls should be replaced - probably were overlooked.

Done, let's go the safe way, just replacing prctl() for svcntb() to set the global var.

ggml : use svcntb() for SVE vector length detection

115f0b1

Signed-off-by: Adrien Gallouët <angt@huggingface.co>

angt requested review from ggerganov and slaren as code owners November 24, 2025 15:16

loci-dev mentioned this pull request Nov 24, 2025

UPSTREAM PR #17474: ggml : use svcntb() for SVE vector length detection auroralabs-loci/llama.cpp#307

Open

github-actions bot added the ggml changes relating to the ggml tensor library for machine learning label Nov 24, 2025

angt closed this Nov 24, 2025

angt reopened this Nov 25, 2025

angt mentioned this pull request Dec 2, 2025

ggml-cpu: BMI2 is only available on amd64 #17528

Open

ggerganov reviewed Dec 2, 2025

View reviewed changes

angt force-pushed the ggml-use-svcntb-for-sve-vector-length-detection branch from 502cc78 to 115f0b1 Compare December 2, 2025 10:04

ggerganov approved these changes Dec 2, 2025

View reviewed changes

ggerganov merged commit e148380 into ggml-org:master Dec 2, 2025
70 of 140 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ggml : use svcntb() for SVE vector length detection #17474

ggml : use svcntb() for SVE vector length detection #17474

Uh oh!

angt commented Nov 24, 2025

Uh oh!

angt commented Nov 24, 2025

Uh oh!

angt commented Nov 25, 2025

Uh oh!

angt commented Dec 2, 2025

Uh oh!

ggerganov Dec 2, 2025

Uh oh!

angt Dec 2, 2025

Uh oh!

angt Dec 2, 2025

Uh oh!

ggerganov Dec 2, 2025

Uh oh!

angt Dec 2, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ggml : use svcntb() for SVE vector length detection #17474

ggml : use svcntb() for SVE vector length detection #17474

Uh oh!

Conversation

angt commented Nov 24, 2025

Uh oh!

angt commented Nov 24, 2025

Uh oh!

angt commented Nov 25, 2025

Uh oh!

angt commented Dec 2, 2025

Uh oh!

ggerganov Dec 2, 2025

Choose a reason for hiding this comment

Uh oh!

angt Dec 2, 2025

Choose a reason for hiding this comment

Uh oh!

angt Dec 2, 2025

Choose a reason for hiding this comment

Uh oh!

ggerganov Dec 2, 2025

Choose a reason for hiding this comment

Uh oh!

angt Dec 2, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants