-
Notifications
You must be signed in to change notification settings - Fork 11.2k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
llama-vocab : add SuperBPE pre-tokenizer
model
Model specific
python
python script changes
#12532
opened Mar 23, 2025 by
compilade
Loading…
ggml : riscv: add 128-bit RVV support
ggml
changes relating to the ggml tensor library for machine learning
#12530
opened Mar 23, 2025 by
xctan
Loading…
vulkan: fix mul_mat_vec failure in backend tests
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
Vulkan
Issues specific to the Vulkan backend
#12529
opened Mar 23, 2025 by
jeffbolznv
Loading…
cmake: fix ccache conflict
ggml
changes relating to the ggml tensor library for machine learning
#12522
opened Mar 23, 2025 by
BusyJay
Loading…
llama-map to support hugepage feature of pagesize 2M or 1G which can …
#12521
opened Mar 23, 2025 by
nickhuang99
Loading…
perplexity: Add option to ignore context window overflow errors and continue score calculation
examples
#12512
opened Mar 22, 2025 by
EAddario
Loading…
quantize: Handle user-defined quantization levels for additional tensors
examples
#12511
opened Mar 22, 2025 by
EAddario
Loading…
cmake: Allow to configure GGML_BUILD_NUMBER with file
ggml
changes relating to the ggml tensor library for machine learning
opencl: simplify kernel embedding logic in CMakeLists.txt
ggml
changes relating to the ggml tensor library for machine learning
#12503
opened Mar 21, 2025 by
lhez
Loading…
rpc : send hash when tensor data is above some fixed threshold
examples
ggml
changes relating to the ggml tensor library for machine learning
#12496
opened Mar 21, 2025 by
rgerganov
Loading…
llamafile : ppc64le MMA implementation for Q4_0.
ggml
changes relating to the ggml tensor library for machine learning
#12489
opened Mar 21, 2025 by
amritahs-ibm
Loading…
Evenly and stably pinning thread pool
ggml
changes relating to the ggml tensor library for machine learning
Metal TQ2_0
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
#12485
opened Mar 20, 2025 by
dmahurin
Loading…
[Issue #12458] Temporarily Clamp inf Values in ggml-cpu.c to Prevent Garbled Output(or coredump) on RK3588
ggml
changes relating to the ggml tensor library for machine learning
#12459
opened Mar 19, 2025 by
Corsair-cxs
Loading…
Add PLM GGUF Conversion & Inference Support
python
python script changes
#12457
opened Mar 18, 2025 by
Si1w
Loading…
2 of 4 tasks
SYCL: Remove misleading ggml_sycl_op_flatten function
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#12387
opened Mar 14, 2025 by
qnixsynapse
•
Draft
[WIP] MUSA: enable fastfp16, correct warp reduce impl and perf tuning
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
server
: streaming of tool calls and thoughts when --jinja
is on
documentation
Previous Next
ProTip!
Updated in the last three days: updated:>2025-03-20.