-
Notifications
You must be signed in to change notification settings - Fork 11.1k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Nomic Embed Text V2 with Mixture-of-Experts (MoE) architecture
python
python script changes
#12466
opened Mar 19, 2025 by
manyoso
Loading…
[Issue #12458] Temporarily Clamp inf Values in ggml-cpu.c to Prevent Garbled Output(or coredump) on RK3588
ggml
changes relating to the ggml tensor library for machine learning
#12459
opened Mar 19, 2025 by
Corsair-cxs
Loading…
Add PLM GGUF Conversion & Inference Support
python
python script changes
#12457
opened Mar 18, 2025 by
Si1w
Loading…
2 of 4 tasks
vulkan: optimize iq1 coopmat2 dequant functions
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#12427
opened Mar 17, 2025 by
jeffbolznv
Loading…
SYCL: Remove misleading ggml_sycl_op_flatten function
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#12387
opened Mar 14, 2025 by
qnixsynapse
•
Draft
[WIP] MUSA: enable fastfp16, correct warp reduce impl and perf tuning
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
server
: streaming of tool calls and thoughts when --jinja
is on
documentation
Add support for new gfx1200 and gfx1201 targets
documentation
Improvements or additions to documentation
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#12372
opened Mar 13, 2025 by
slojosic-amd
Loading…
Block interleaving support for Q4_K quantization for x86 AVX2 architecture
ggml
changes relating to the ggml tensor library for machine learning
#12332
opened Mar 11, 2025 by
Srihari-mcw
Loading…
Fixed Eval Bug: 12163 : Fallback to CPU when loading model: vk::PhysicalDevice::createDevice: ErrorExtensionNotPresent.
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#12329
opened Mar 11, 2025 by
ashwini778
Loading…
PR: Refine ggml-qnn backend(QNN, Qualcomm Neural Network,aka Qualcomm AI Engine Direct) for latest ggml,whisper.cpp,llama.cpp
build
Compilation issues
ggml
changes relating to the ggml tensor library for machine learning
script
Script related
testing
Everything test related
#12326
opened Mar 11, 2025 by
zhouwg
Loading…
1 task done
ggml : fix quantized cpy op
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
#12310
opened Mar 10, 2025 by
ggerganov
Loading…
tool-call
: Phi-4 support
android
#12288
opened Mar 9, 2025 by
jpohhhh
Loading…
vulkan: fix coopmat shader generation when cross-compiling
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#12272
opened Mar 8, 2025 by
Icenowy
Loading…
vulkan: optimization proposals for coopmat1 mul_mm
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#12260
opened Mar 7, 2025 by
remyoudompheng
•
Draft
server : Add verbose output to OAI compatible chat endpoint.
android
Issues specific to Android
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
build
Compilation issues
devops
improvements to build systems and github actions
documentation
Improvements or additions to documentation
examples
ggml
changes relating to the ggml tensor library for machine learning
Kompute
https://github.com/KomputeProject/kompute/
nix
Issues specific to consuming flake.nix, or generally concerned with ❄ Nix-based llama.cpp deployment
Nvidia GPU
Issues specific to Nvidia GPUs
python
python script changes
script
Script related
server
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
testing
Everything test related
Vulkan
Issues specific to the Vulkan backend
#12246
opened Mar 7, 2025 by
mglambda
Loading…
Fix rocWMMA build documentation
documentation
Improvements or additions to documentation
#12243
opened Mar 7, 2025 by
Headcrabed
Loading…
tests: use adaptive number of threads
testing
Everything test related
#12236
opened Mar 6, 2025 by
JohannesGaessler
Loading…
SYCL: Rename oneMKL to oneMath
documentation
Improvements or additions to documentation
examples
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#12192
opened Mar 5, 2025 by
Rbiessy
Loading…
Previous Next
ProTip!
Updated in the last three days: updated:>2025-03-16.