ggml : extend the GGML_SCHED_NO_REALLOC debug logic of the scheduler #17617

ggerganov · 2025-11-30T11:23:05Z

Graph reallocations by the backend scheduler can be expected in various cases when the graph topology becomes different from the one that was used initially after constructing the scheduler. For example, the scheduler of llama_context will likely reallocate when:

Changing LoRAs
Switching from token to embedding input in llama_batch
Attaching different set of samplers (sampling : add support for backend sampling #17004)
etc.

The less expected cases are the ones similar to the case in #17143 where it's more difficult to predict that a reallocation would occur since the graph topology remains the same.

The GGML_SCHED_NO_REALLOC macro is now targeted towards detecting the "unexpected" reallocations.

Also, can now override the reallocation debug behavior via a new environment variable GGML_SCHED_DEBUG_REALLOC:

# abort only on unexpected reallocations (i.e. same as building with -DGGML_SCHED_NO_REALLOC=ON)
GGML_SCHED_DEBUG_REALLOC=1 ...

# abort on all reallocations
GGML_SCHED_DEBUG_REALLOC=2 ...

ggml : extend the GGML_SCHED_NO_REALLOC debug logic of the scheduler

43f7063

ggerganov force-pushed the gg/sched-debug-realloc branch from 37be189 to 43f7063 Compare November 30, 2025 11:23

github-actions bot added the ggml changes relating to the ggml tensor library for machine learning label Nov 30, 2025

danbev approved these changes Dec 1, 2025

View reviewed changes

ggerganov merged commit 90c72a6 into master Dec 1, 2025
72 of 74 checks passed

ggerganov deleted the gg/sched-debug-realloc branch December 1, 2025 10:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ggml : extend the GGML_SCHED_NO_REALLOC debug logic of the scheduler #17617

ggml : extend the GGML_SCHED_NO_REALLOC debug logic of the scheduler #17617

Uh oh!

ggerganov commented Nov 30, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ggml : extend the GGML_SCHED_NO_REALLOC debug logic of the scheduler #17617

ggml : extend the GGML_SCHED_NO_REALLOC debug logic of the scheduler #17617

Uh oh!

Conversation

ggerganov commented Nov 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ggerganov commented Nov 30, 2025 •

edited

Loading