Vulkan: Fix the memory allocation change in #17110 by 0cc4m · Pull Request #17122 · ggml-org/llama.cpp

0cc4m · 2025-11-09T14:52:00Z

I forgot to exit the second loop early in #17110. This should fix #17117.

Goldenkoron · 2025-11-09T17:12:01Z

On strix halo, trying to load any model on windows with 96gb set in UMA will give a memory buffer error, seemingly no matter the allocation size.

With 64gb set, models will try to load up to 96gb but if the memory actually exceeds 64gb it will give a device out of memory error (even if there is room in Windows shared memory, it never uses it).

0cc4m requested a review from ngxson as a code owner November 9, 2025 14:52

0cc4m changed the title ~~Vulkan: Fix the memory allocation change in~~ Vulkan: Fix the memory allocation change in #17110 Nov 9, 2025

vulkan: fix memory allocations

0d5e75d

0cc4m force-pushed the 0cc4m/vulkan-memory-reporting-fix-fix branch from de6d775 to 0d5e75d Compare November 9, 2025 14:54

0cc4m requested review from jeffbolznv and removed request for ngxson November 9, 2025 14:54

This was referenced Nov 9, 2025

vulkan: iGPU memory reporting fix #17110

Merged

Misc. bug: b6996 need too much memory #17117

Closed

jeffbolznv approved these changes Nov 9, 2025

View reviewed changes

github-actions Bot added Vulkan Issues specific to the Vulkan backend ggml changes relating to the ggml tensor library for machine learning labels Nov 9, 2025

0cc4m merged commit 392e09a into master Nov 9, 2025
59 of 63 checks passed

0cc4m deleted the 0cc4m/vulkan-memory-reporting-fix-fix branch November 9, 2025 15:14

0cc4m mentioned this pull request Nov 9, 2025

Eval bug: Vulkan llama.cpp > 64GB Graphics card load bug. #16575

Closed

Anico2 added a commit to Anico2/llama.cpp that referenced this pull request Jan 15, 2026

vulkan: fix memory allocations (ggml-org#17122)

0193200

blime4 referenced this pull request in blime4/llama.cpp Feb 5, 2026

vulkan: fix memory allocations (#17122)

701967d

Seunghhon pushed a commit to Seunghhon/llama.cpp that referenced this pull request Apr 26, 2026

vulkan: fix memory allocations (ggml-org#17122)

a7dcbf1

ljubomirj pushed a commit to ljubomirj/llama.cpp that referenced this pull request May 6, 2026

vulkan: fix memory allocations (ggml-org#17122)

3514a93

my-other-github-account pushed a commit to my-other-github-account/llama.cpp that referenced this pull request May 15, 2026

vulkan: fix memory allocations (ggml-org#17122)

a87cbc2

my-other-github-account pushed a commit to my-other-github-account/llama.cpp that referenced this pull request May 15, 2026

vulkan: fix memory allocations (ggml-org#17122)

d1874e1

phibya pushed a commit to ziee-ai/llama.cpp that referenced this pull request May 29, 2026

vulkan: fix memory allocations (ggml-org#17122)

c537840

fewtarius pushed a commit to fewtarius/llama.cpp that referenced this pull request May 30, 2026

vulkan: fix memory allocations (ggml-org#17122)

ec7d978

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Vulkan: Fix the memory allocation change in #17110#17122

Vulkan: Fix the memory allocation change in #17110#17122
0cc4m merged 1 commit into
masterfrom
0cc4m/vulkan-memory-reporting-fix-fix

0cc4m commented Nov 9, 2025 •

edited

Loading

Uh oh!

Uh oh!

Goldenkoron commented Nov 9, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

0cc4m commented Nov 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Goldenkoron commented Nov 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

0cc4m commented Nov 9, 2025 •

edited

Loading

Goldenkoron commented Nov 9, 2025 •

edited

Loading