Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Nomic Embed Text V2 with Mixture-of-Experts (MoE) architecture python python script changes
#12466 opened Mar 19, 2025 by manyoso Loading…
[Issue #12458] Temporarily Clamp inf Values in ggml-cpu.c to Prevent Garbled Output(or coredump) on RK3588 ggml changes relating to the ggml tensor library for machine learning
#12459 opened Mar 19, 2025 by Corsair-cxs Loading…
Add PLM GGUF Conversion & Inference Support python python script changes
#12457 opened Mar 18, 2025 by Si1w Loading…
2 of 4 tasks
ci: add Linux cross-compile build devops improvements to build systems and github actions ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#12428 opened Mar 17, 2025 by bandoti Loading…
vulkan: optimize iq1 coopmat2 dequant functions ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#12427 opened Mar 17, 2025 by jeffbolznv Loading…
Add Qwen2.5VL support examples python python script changes
#12402 opened Mar 15, 2025 by HimariO Draft
2 of 4 tasks
SYCL: Remove misleading ggml_sycl_op_flatten function ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#12387 opened Mar 14, 2025 by qnixsynapse Draft
[WIP] MUSA: enable fastfp16, correct warp reduce impl and perf tuning ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#12383 opened Mar 14, 2025 by BodhiHu Draft
server: streaming of tool calls and thoughts when --jinja is on documentation Improvements or additions to documentation examples python python script changes server testing Everything test related
#12379 opened Mar 14, 2025 by ochafik Draft
4 of 10 tasks
Add support for new gfx1200 and gfx1201 targets documentation Improvements or additions to documentation ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#12372 opened Mar 13, 2025 by slojosic-amd Loading…
Block interleaving support for Q4_K quantization for x86 AVX2 architecture ggml changes relating to the ggml tensor library for machine learning
#12332 opened Mar 11, 2025 by Srihari-mcw Loading…
Fixed Eval Bug: 12163 : Fallback to CPU when loading model: vk::PhysicalDevice::createDevice: ErrorExtensionNotPresent. ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#12329 opened Mar 11, 2025 by ashwini778 Loading…
PR: Refine ggml-qnn backend(QNN, Qualcomm Neural Network,aka Qualcomm AI Engine Direct) for latest ggml,whisper.cpp,llama.cpp build Compilation issues ggml changes relating to the ggml tensor library for machine learning script Script related testing Everything test related
#12326 opened Mar 11, 2025 by zhouwg Loading…
1 task done
ggml : fix quantized cpy op ggml changes relating to the ggml tensor library for machine learning testing Everything test related
#12310 opened Mar 10, 2025 by ggerganov Loading…
tool-call: Phi-4 support android Issues specific to Android Apple Metal https://en.wikipedia.org/wiki/Metal_(API) devops improvements to build systems and github actions documentation Improvements or additions to documentation examples ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs python python script changes server SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language testing Everything test related Vulkan Issues specific to the Vulkan backend
#12288 opened Mar 9, 2025 by jpohhhh Loading…
vulkan: fix coopmat shader generation when cross-compiling ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#12272 opened Mar 8, 2025 by Icenowy Loading…
Add simple-tts example examples
#12261 opened Mar 8, 2025 by danemadsen Loading…
vulkan: optimization proposals for coopmat1 mul_mm ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#12260 opened Mar 7, 2025 by remyoudompheng Draft
server : Add verbose output to OAI compatible chat endpoint. android Issues specific to Android Apple Metal https://en.wikipedia.org/wiki/Metal_(API) build Compilation issues devops improvements to build systems and github actions documentation Improvements or additions to documentation examples ggml changes relating to the ggml tensor library for machine learning Kompute https://github.com/KomputeProject/kompute/ nix Issues specific to consuming flake.nix, or generally concerned with ❄ Nix-based llama.cpp deployment Nvidia GPU Issues specific to Nvidia GPUs python python script changes script Script related server SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language testing Everything test related Vulkan Issues specific to the Vulkan backend
#12246 opened Mar 7, 2025 by mglambda Loading…
Fix rocWMMA build documentation documentation Improvements or additions to documentation
#12243 opened Mar 7, 2025 by Headcrabed Loading…
tests: use adaptive number of threads testing Everything test related
#12236 opened Mar 6, 2025 by JohannesGaessler Loading…
SYCL: Rename oneMKL to oneMath documentation Improvements or additions to documentation examples ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#12192 opened Mar 5, 2025 by Rbiessy Loading…
ProTip! Updated in the last three days: updated:>2025-03-16.