-
Notifications
You must be signed in to change notification settings - Fork 12.3k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
webui : add a preset feature to the settings
examples
server
#14649
opened Jul 12, 2025 by
gabriellarson
Loading…
Add CUDA non-contiguous Unary Ops support
build
Compilation issues
documentation
Improvements or additions to documentation
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#14639
opened Jul 11, 2025 by
YavorGIvanov
Loading…
OpenCL: add changes relating to the ggml tensor library for machine learning
OpenCL
Issues specific to the OpenCL backend
mul_mat_f16_f32_image
kernel
ggml
#14635
opened Jul 11, 2025 by
rmatif
Loading…
Add EXAONE 4.0 model architecture
python
python script changes
#14630
opened Jul 11, 2025 by
lgai-exaone
Loading…
HIP: Enable Matrix cores for MMQ Kernels, Enable stream-K for CDNA 3
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#14624
opened Jul 10, 2025 by
deepsek
Loading…
tool: add convertation of text/parquet to custom format
build
Compilation issues
examples
#14622
opened Jul 10, 2025 by
lexasub
Loading…
webui: Change Download function to download the full text of the conversation
examples
server
#14619
opened Jul 10, 2025 by
michaelmarziani
Loading…
SYCL: use 1D kernel for set_rows
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#14618
opened Jul 10, 2025 by
qnixsynapse
Loading…
sycl: Batched mulmat rework for oneDNN dispatch
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#14617
opened Jul 10, 2025 by
ShanoToni
Loading…
metal : fuse add
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
docker : add cann build pipline
devops
improvements to build systems and github actions
#14591
opened Jul 9, 2025 by
diannaojiang
Loading…
metal : reuse graphs
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
demo
Demonstrate some concept or idea, not intended to be merged
ggml
changes relating to the ggml tensor library for machine learning
model : add PLaMo-2 model
examples
python
python script changes
#14560
opened Jul 7, 2025 by
mitmul
Loading…
common: detect and prefer big cores on AArch64 hybrid CPU on linux
#14532
opened Jul 4, 2025 by
kiwi142857
Loading…
train: add simple loading already tokenized data from parquet dataset
build
Compilation issues
examples
#14522
opened Jul 3, 2025 by
lexasub
Loading…
ggml: Add initial WebGPU backend
devops
improvements to build systems and github actions
documentation
Improvements or additions to documentation
ggml
changes relating to the ggml tensor library for machine learning
hot
Something that is hot
python
python script changes
#14521
opened Jul 3, 2025 by
reeselevine
Loading…
mtmd : Fix 32-bit narrowing issue in export-lora and mtmd clip
examples
#14503
opened Jul 2, 2025 by
kiwi142857
Loading…
MUSA: upgrade musa sdk to <<TBD>>
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#14498
opened Jul 2, 2025 by
yeahdongcn
•
Draft
Compute buffer and KV-cache aware layer distribution for multi-GPU inference
#14484
opened Jul 1, 2025 by
borebot
Loading…
Previous Next
ProTip!
Updated in the last three days: updated:>2025-07-09.