Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Add Vulkan images to docker.md documentation Improvements or additions to documentation
#14472 opened Jul 1, 2025 by xek Loading…
vulkan: Split large mul_mat_id to fit in shared memory ggml changes relating to the ggml tensor library for machine learning testing Everything test related Vulkan Issues specific to the Vulkan backend
#14451 opened Jun 29, 2025 by jeffbolznv Loading…
convert : correct gemma 3n conversion python python script changes
#14450 opened Jun 29, 2025 by ngxson Loading…
vulkan: support softmax/FA batch and broadcast ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#14449 opened Jun 29, 2025 by jeffbolznv Loading…
Pr/7191 build Compilation issues devops improvements to build systems and github actions python python script changes
#14447 opened Jun 29, 2025 by esrakorkmz Loading…
ggml : implement GEGLU_ERF and GEGLU_QUICK ops Apple Metal https://en.wikipedia.org/wiki/Metal_(API) examples ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language Vulkan Issues specific to the Vulkan backend
#14445 opened Jun 29, 2025 by CISC Loading…
Added CI with RISC-V RVV1.0 Hardware devops improvements to build systems and github actions
#14439 opened Jun 29, 2025 by alitariq4589 Loading…
ggml : support broadcast for ggml_soft_max_ext and ggml_flash_attn_ext Apple Metal https://en.wikipedia.org/wiki/Metal_(API) Ascend NPU issues specific to Ascend NPUs ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language testing Everything test related Vulkan Issues specific to the Vulkan backend
#14435 opened Jun 28, 2025 by ggerganov Loading…
2 of 5 tasks
model : add hunyuan moe python python script changes
#14425 opened Jun 27, 2025 by ngxson Loading…
4 tasks done
ggml : add ggml_scale_bias Apple Metal https://en.wikipedia.org/wiki/Metal_(API) ggml changes relating to the ggml tensor library for machine learning testing Everything test related
#14417 opened Jun 27, 2025 by ngxson Draft
[CANN]update aclnnGroupedMatmulV2 to aclnnGroupedMatmulV3 Ascend NPU issues specific to Ascend NPUs ggml changes relating to the ggml tensor library for machine learning
#14411 opened Jun 27, 2025 by noemotiovon Loading…
[CANN] weight format to nz for Ascend310P3 Ascend NPU issues specific to Ascend NPUs ggml changes relating to the ggml tensor library for machine learning
#14407 opened Jun 27, 2025 by tqgy6 Loading…
OpenCL: add conv2d kernel ggml changes relating to the ggml tensor library for machine learning
#14403 opened Jun 26, 2025 by rmatif Loading…
ggml : add pointer to attach user data ggml changes relating to the ggml tensor library for machine learning
#14397 opened Jun 26, 2025 by koush Loading…
compare-commits.sh: support both llama-bench and test-backend-ops python python script changes script Script related
#14392 opened Jun 26, 2025 by yeahdongcn Loading…
ggml-cpu: Build variant targeting Neoverse-V2 ggml changes relating to the ggml tensor library for machine learning
#14380 opened Jun 25, 2025 by ckastner Loading…
Q2k interleaving implementation - x86/x64 SIMD ggml changes relating to the ggml tensor library for machine learning
#14373 opened Jun 25, 2025 by Srihari-mcw Loading…
docs: fix broken url in main readme
#14371 opened Jun 25, 2025 by justinclift-prvidr Loading…
test-backend-ops: add support for specifying output format testing Everything test related
#14368 opened Jun 25, 2025 by yeahdongcn Loading…
llama : add high-throughput mode Apple Metal https://en.wikipedia.org/wiki/Metal_(API) examples ggml changes relating to the ggml tensor library for machine learning
#14363 opened Jun 24, 2025 by ggerganov Draft
11 of 19 tasks
llama : expose C API to get layer device type
#14358 opened Jun 24, 2025 by okaris Loading…
ProTip! no:milestone will show everything without a milestone.