Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

llama : remove ggml_cont where possible
#14568 opened Jul 7, 2025 by CISC Loading…
CUDA: add bilinear interpolation for upscale ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#14563 opened Jul 7, 2025 by am17an Loading…
SYCL: Initial set_rows kernel implementation ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#14562 opened Jul 7, 2025 by qnixsynapse Loading…
musa: fix build warnings (unused variable) ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#14561 opened Jul 7, 2025 by yeahdongcn Loading…
Add PLaMo-2 model examples python python script changes
#14560 opened Jul 7, 2025 by mitmul Loading…
vulkan: optimizations for deepseek prompt processing ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#14555 opened Jul 6, 2025 by jeffbolznv Loading…
vulkan: optimize flash attention split_k_reduce ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#14554 opened Jul 6, 2025 by jeffbolznv Loading…
CUDA: add set rows for f32 and f16 ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#14551 opened Jul 6, 2025 by am17an Loading…
opencl: add set_rows for f16 and f32 ggml changes relating to the ggml tensor library for machine learning OpenCL Issues specific to the OpenCL backend
#14547 opened Jul 6, 2025 by lhez Loading…
OpenCL: add tiled mul_mat_f16_f32 ggml changes relating to the ggml tensor library for machine learning OpenCL Issues specific to the OpenCL backend
#14535 opened Jul 4, 2025 by rmatif Loading…
llama: add initial support for Falcon-H1 model family python python script changes
#14534 opened Jul 4, 2025 by ibrahimkhadraoui Loading…
ggml: fix typo in ggml.c ggml changes relating to the ggml tensor library for machine learning
#14531 opened Jul 4, 2025 by zhouwg Loading…
ggml: Add initial WebGPU backend devops improvements to build systems and github actions documentation Improvements or additions to documentation ggml changes relating to the ggml tensor library for machine learning python python script changes
#14521 opened Jul 3, 2025 by reeselevine Loading…
kv-cache : prepare K/V buffers for separation
#14517 opened Jul 3, 2025 by ggerganov Loading…
MUSA: upgrade musa sdk to <<TBD>> ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#14498 opened Jul 2, 2025 by yeahdongcn Draft
Allow truncation when embedding examples server
#14493 opened Jul 2, 2025 by huydt84 Loading…
llama : reuse compute graphs examples
#14482 opened Jul 1, 2025 by ggerganov Loading…
6 of 15 tasks
ProTip! Type g i on any issue or pull request to go back to the issue listing page.