-
Notifications
You must be signed in to change notification settings - Fork 12.3k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
HIP: Enable Matrix cores for MMQ Kernels, Enable stream-K for CDNA 3
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#14624
opened Jul 10, 2025 by
deepsek
Loading…
tool: add convertation of text/parquet to custom format
build
Compilation issues
examples
#14622
opened Jul 10, 2025 by
lexasub
Loading…
llama : support LiquidAI LFM2 hybrid model family
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
python
python script changes
#14620
opened Jul 10, 2025 by
tdakhran
Loading…
webui: Change Download function to download the full text of the conversation
examples
server
#14619
opened Jul 10, 2025 by
michaelmarziani
Loading…
SYCL: use 1D kernel for set_rows
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#14618
opened Jul 10, 2025 by
qnixsynapse
Loading…
sycl: Batched mulmat rework for oneDNN dispatch
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#14617
opened Jul 10, 2025 by
ShanoToni
Loading…
metal : fuse add
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
docker : add cann build pipline
devops
improvements to build systems and github actions
#14591
opened Jul 9, 2025 by
diannaojiang
Loading…
vulkan: support SET_ROWS
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#14587
opened Jul 9, 2025 by
jeffbolznv
Loading…
metal : reuse graphs
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
demo
Demonstrate some concept or idea, not intended to be merged
ggml
changes relating to the ggml tensor library for machine learning
model : add PLaMo-2 model
examples
python
python script changes
#14560
opened Jul 7, 2025 by
mitmul
Loading…
vulkan: optimizations for deepseek prompt processing
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#14555
opened Jul 6, 2025 by
jeffbolznv
Loading…
CUDA: add set rows for f32 and f16
examples
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#14551
opened Jul 6, 2025 by
am17an
Loading…
common: detect and prefer big cores on AArch64 hybrid CPU on linux
#14532
opened Jul 4, 2025 by
kiwi142857
Loading…
webui : add a preset feature to the settings
examples
server
#14523
opened Jul 3, 2025 by
gabriellarson
Loading…
train: add simple loading already tokenized data from parquet dataset
build
Compilation issues
examples
#14522
opened Jul 3, 2025 by
lexasub
Loading…
ggml: Add initial WebGPU backend
devops
improvements to build systems and github actions
documentation
Improvements or additions to documentation
ggml
changes relating to the ggml tensor library for machine learning
python
python script changes
#14521
opened Jul 3, 2025 by
reeselevine
Loading…
mtmd : Fix 32-bit narrowing issue in export-lora and mtmd clip
examples
#14503
opened Jul 2, 2025 by
kiwi142857
Loading…
MUSA: upgrade musa sdk to <<TBD>>
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#14498
opened Jul 2, 2025 by
yeahdongcn
•
Draft
Compute buffer and KV-cache aware layer distribution for multi-GPU inference
#14484
opened Jul 1, 2025 by
borebot
Loading…
server : (webui) let server send locally-defined default webui settings
examples
server
#14468
opened Jun 30, 2025 by
woof-dog
Loading…
Previous Next
ProTip!
Exclude everything labeled
bug
with -label:bug.