Sync master with upstream release b7951 by jan-service-account · Pull Request #415 · janhq/llama.cpp

jan-service-account · 2026-02-06T00:44:33Z

Updates dev branch with latest release (b7951) from ggml-org/llama.cpp

* add missing llama_add_compile_flags * disable all warnings for ssl, crypto and fipsmodule

* vulkan: fix GPU deduplication logic. As reported in ggml-org#19221, the (same uuid, same driver) logic is problematic for windows+intel igpu. Let's just avoid filtering for MoltenVK which is apple-specific, and keep the logic the same as before 88d23ad - just dedup based on UUID. Verified that MacOS + 4xVega still reports 4 GPUs with this version. * vulkan: only skip dedup when both drivers are moltenVk

Signed-off-by: Adrien Gallouët <angt@huggingface.co>

* bench : update script * benches : update numbers

…org#19281) Write out a 2-bit code per block and avoid loading the mask when it matches these two common cases. Apply this optimization when the mask is relatively large (i.e. prompt processing).

…g#19369)

CISC and others added 11 commits February 5, 2026 02:27

vendor : add missing llama_add_compile_flags (ggml-org#19322)

11fb327

* add missing llama_add_compile_flags * disable all warnings for ssl, crypto and fipsmodule

metal : add missing includes (ggml-org#19348)

af252d0

vulkan: fix non-contig rope (ggml-org#19299)

c342c3b

vulkan: Set k_load_shmem to false when K is too large (ggml-org#19301)

3409ab8

metal : add diag (ggml-org#19330)

7a4f97d

vendor : update BoringSSL to 0.20260204.0 (ggml-org#19333)

a4ea7a1

Signed-off-by: Adrien Gallouët <angt@huggingface.co>

docker : fix vulkan build (ggml-org#19352)

b828e18

benches : update models + numbers (ggml-org#19359)

3795cc1

* bench : update script * benches : update numbers

vulkan: Preprocess FA mask to detect all-neg-inf and all-zero. (ggml-…

449ec2a

…org#19281) Write out a 2-bit code per block and avoid loading the mask when it matches these two common cases. Apply this optimization when the mask is relatively large (i.e. prompt processing).

metal : adaptive CPU/GPU interleave based on number of nodes (ggml-or…

22cae83

…g#19369)

jan-service-account merged commit 9b083a5 into dev Feb 6, 2026
3 checks passed

jan-service-account deleted the update-dev-from-master-2026-02-06-00-44 branch February 6, 2026 00:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sync master with upstream release b7951#415

Sync master with upstream release b7951#415
jan-service-account merged 11 commits intodevfrom
update-dev-from-master-2026-02-06-00-44

jan-service-account commented Feb 6, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

Conversation

jan-service-account commented Feb 6, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants