Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

opencl: add set_rows for f16 and f32 ggml changes relating to the ggml tensor library for machine learning OpenCL Issues specific to the OpenCL backend
#14547 opened Jul 6, 2025 by lhez Draft
OpenCL: add tiled mul_mat_f16_f32 ggml changes relating to the ggml tensor library for machine learning OpenCL Issues specific to the OpenCL backend
#14535 opened Jul 4, 2025 by rmatif Loading…
ggml: fix typo in ggml.c ggml changes relating to the ggml tensor library for machine learning
#14531 opened Jul 4, 2025 by zhouwg Loading…
CUDA: add bf16 and i32 to getrows ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#14529 opened Jul 4, 2025 by am17an Loading…
ggml: Add initial WebGPU backend devops improvements to build systems and github actions documentation Improvements or additions to documentation ggml changes relating to the ggml tensor library for machine learning python python script changes
#14521 opened Jul 3, 2025 by reeselevine Loading…
kv-cache : prepare K/V buffers for separation
#14517 opened Jul 3, 2025 by ggerganov Loading…
MUSA: upgrade musa sdk to <<TBD>> ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#14498 opened Jul 2, 2025 by yeahdongcn Draft
Allow truncation when embedding examples server
#14493 opened Jul 2, 2025 by huydt84 Loading…
vulkan: unpack more values at a time for iquants mat mul ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#14485 opened Jul 1, 2025 by netrunnereve Loading…
llama : reuse compute graphs
#14482 opened Jul 1, 2025 by ggerganov Draft
6 of 15 tasks
Pr/7191 build Compilation issues devops improvements to build systems and github actions python python script changes
#14447 opened Jun 29, 2025 by esrakorkmz Loading…
Added CI with RISC-V RVV1.0 Hardware devops improvements to build systems and github actions
#14439 opened Jun 29, 2025 by alitariq4589 Loading…
model : add hunyuan moe python python script changes
#14425 opened Jun 27, 2025 by ngxson Loading…
4 tasks done
ggml : add ggml_scale_bias Apple Metal https://en.wikipedia.org/wiki/Metal_(API) ggml changes relating to the ggml tensor library for machine learning testing Everything test related
#14417 opened Jun 27, 2025 by ngxson Draft
[CANN] weight format to nz for Ascend310P3 Ascend NPU issues specific to Ascend NPUs ggml changes relating to the ggml tensor library for machine learning
#14407 opened Jun 27, 2025 by tqgy6 Loading…
OpenCL: add conv2d kernel ggml changes relating to the ggml tensor library for machine learning OpenCL Issues specific to the OpenCL backend
#14403 opened Jun 26, 2025 by rmatif Loading…
ProTip! no:milestone will show everything without a milestone.