Release b9860 · ggml-org/llama.cpp

llama : add llama_model_ftype_name() (#25134)

llama : add llama_model_ftype_name()

Expose the model file type (quantization) name, e.g. "Q8_0" or
"Q4_K - Medium", through a new public C API. The returned pointer is
valid for the lifetime of the model and nullptr when the model is
invalid or the file type is unknown.

Signed-off-by: Adrien Gallouët angt@huggingface.co

Export enum

Signed-off-by: Adrien Gallouët angt@huggingface.co

s/llama_model_ftype_name/llama_ftype_name/

Signed-off-by: Adrien Gallouët angt@huggingface.co

Move "(guessed)" to the front in llama_ftype_name

Prepend the "(guessed)" label instead of appending it. This allows removing
the non-thread-safe static std::string, making the function allocation-free.

Signed-off-by: Adrien Gallouët angt@huggingface.co

Add LLAMA_FTYPE_PREFIX

Signed-off-by: Adrien Gallouët angt@huggingface.co

Dont check for model

Signed-off-by: Adrien Gallouët angt@huggingface.co

macOS/iOS:

macOS Apple Silicon (arm64)
macOS Apple Silicon (arm64, KleidiAI enabled) DISABLED
macOS Intel (x64)
iOS XCFramework

Linux:

Android:

Android arm64 (CPU)

Windows:

openEuler:

DISABLED
openEuler x86 (310p)
openEuler x86 (910b, ACL Graph)
openEuler aarch64 (310p)
openEuler aarch64 (910b, ACL Graph)

UI:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

b9860

Choose a tag to compare

Sorry, something went wrong.

Sorry, something went wrong.

Uh oh!

No results found

Uh oh!