ggml-org / llama.cpp Public

Notifications
Fork 11.2k
Star 77.1k

Code
Issues 350
Pull requests 400
Discussions
Actions
Projects 9
Wiki
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Wiki
Security
Insights

Pull requests: ggml-org/llama.cpp

Labels 72 Milestones 0

New pull request New

400 Open 5,183 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

llama-vocab : add SuperBPE pre-tokenizer model

Model specific

python

python script changes

#12532 opened Mar 23, 2025 by compilade

Loading…

ggml : riscv: add 128-bit RVV support ggml

changes relating to the ggml tensor library for machine learning

#12530 opened Mar 23, 2025 by xctan

Loading…

vulkan: fix mul_mat_vec failure in backend tests ggml

changes relating to the ggml tensor library for machine learning

testing

Everything test related

Vulkan

Issues specific to the Vulkan backend

#12529 opened Mar 23, 2025 by jeffbolznv

Loading…

cmake: fix ccache conflict ggml

changes relating to the ggml tensor library for machine learning

#12522 opened Mar 23, 2025 by BusyJay

Loading…

llama-map to support hugepage feature of pagesize 2M or 1G which can …

#12521 opened Mar 23, 2025 by nickhuang99

Loading…

Vulkan: Remove dedicated aligned matrix matrix multiplication shaders ggml

changes relating to the ggml tensor library for machine learning

testing

Everything test related

Vulkan

Issues specific to the Vulkan backend

#12515 opened Mar 22, 2025 by 0cc4m • Draft

llama-tts : precompute irFFT theta examples

#12514 opened Mar 22, 2025 by marcoStocchi

Loading…

perplexity: Add option to ignore context window overflow errors and continue score calculation examples

#12512 opened Mar 22, 2025 by EAddario

Loading…

quantize: Handle user-defined quantization levels for additional tensors examples

#12511 opened Mar 22, 2025 by EAddario

Loading…

cmake: Allow to configure GGML_BUILD_NUMBER with file ggml

changes relating to the ggml tensor library for machine learning

#12509 opened Mar 22, 2025 by booxter • Draft

opencl: simplify kernel embedding logic in CMakeLists.txt ggml

changes relating to the ggml tensor library for machine learning

#12503 opened Mar 21, 2025 by lhez

Loading…

llama: support Qwen3 python

python script changes

#12501 opened Mar 21, 2025 by CISC • Draft

rpc : send hash when tensor data is above some fixed threshold examples ggml

changes relating to the ggml tensor library for machine learning

#12496 opened Mar 21, 2025 by rgerganov

Loading…

llamafile : ppc64le MMA implementation for Q4_0. ggml

changes relating to the ggml tensor library for machine learning

#12489 opened Mar 21, 2025 by amritahs-ibm

Loading…

Evenly and stably pinning thread pool ggml

changes relating to the ggml tensor library for machine learning

#12488 opened Mar 21, 2025 by zts9989 • Draft

(draft) tts: Orpheus support ggml

changes relating to the ggml tensor library for machine learning

python

python script changes

#12487 opened Mar 21, 2025 by jamorphy • Draft

Metal TQ2_0 Apple Metal

https://en.wikipedia.org/wiki/Metal_(API)

ggml

changes relating to the ggml tensor library for machine learning

#12485 opened Mar 20, 2025 by dmahurin

Loading…

Nomic Embed Text V2 with Mixture-of-Experts (MoE) architecture python

python script changes

#12466 opened Mar 19, 2025 by manyoso • Draft

[Issue #12458] Temporarily Clamp inf Values in ggml-cpu.c to Prevent Garbled Output(or coredump) on RK3588 ggml

changes relating to the ggml tensor library for machine learning

#12459 opened Mar 19, 2025 by Corsair-cxs

Loading…

Add PLM GGUF Conversion & Inference Support python

python script changes

#12457 opened Mar 18, 2025 by Si1w

Loading…

2 of 4 tasks

ci: add Linux cross-compile build devops

improvements to build systems and github actions

ggml

changes relating to the ggml tensor library for machine learning

Vulkan

Issues specific to the Vulkan backend

#12428 opened Mar 17, 2025 by bandoti

Loading…

Add Qwen2.5VL support examples python

python script changes

#12402 opened Mar 15, 2025 by HimariO • Draft

2 of 4 tasks

SYCL: Remove misleading ggml_sycl_op_flatten function ggml

changes relating to the ggml tensor library for machine learning

SYCL

https://en.wikipedia.org/wiki/SYCL - GPU programming language

#12387 opened Mar 14, 2025 by qnixsynapse • Draft

[WIP] MUSA: enable fastfp16, correct warp reduce impl and perf tuning ggml

changes relating to the ggml tensor library for machine learning

Nvidia GPU

Issues specific to Nvidia GPUs

#12383 opened Mar 14, 2025 by BodhiHu • Draft

server: streaming of tool calls and thoughts when --jinja is on documentation

Improvements or additions to documentation

examples python

python script changes

server testing

Everything test related

#12379 opened Mar 14, 2025 by ochafik • Draft

4 of 10 tasks

Previous 1 2 3 4 5 … 15 16 Next

Previous Next

ProTip! Updated in the last three days: updated:>2025-03-20.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly