ggml-org / llama.cpp Public

Notifications
Fork 11.2k
Star 77.3k

Code
Issues 350
Pull requests 407
Discussions
Actions
Projects 9
Wiki
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Wiki
Security
Insights

Pull requests: ggml-org/llama.cpp

Labels 72 Milestones 0

New pull request New

407 Open 5,236 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

sync : ggml ggml

changes relating to the ggml tensor library for machine learning

script

Script related

#12645 opened Mar 29, 2025 by ggerganov

Loading…

llama-tts refactor console output examples

#12640 opened Mar 29, 2025 by marcoStocchi

Loading…

tts : implement mimi decoder examples python

python script changes

#12636 opened Mar 29, 2025 by ngxson • Draft

4 tasks done

llama-server : implement universal assisted decoding examples server

#12635 opened Mar 28, 2025 by g2mt

Loading…

llama : support BailingMoE (Ling) model

Model specific

python

python script changes

#12634 opened Mar 28, 2025 by CISC

Loading…

vulkan: Hybrid waitForFences/getFenceStatus to reduce fence latency ggml

changes relating to the ggml tensor library for machine learning

Vulkan

Issues specific to the Vulkan backend

#12630 opened Mar 28, 2025 by jeffbolznv

Loading…

vulkan: Implement split_k for coopmat2 flash attention. ggml

changes relating to the ggml tensor library for machine learning

testing

Everything test related

Vulkan

Issues specific to the Vulkan backend

#12627 opened Mar 28, 2025 by jeffbolznv

Loading…

opencl: remove a self-referential macro ggml

changes relating to the ggml tensor library for machine learning

#12626 opened Mar 28, 2025 by linehill

Loading…

sycl: allow ggml-sycl configuration and compilation using Visual Studio project/solution documentation

Improvements or additions to documentation

ggml

changes relating to the ggml tensor library for machine learning

SYCL

https://en.wikipedia.org/wiki/SYCL - GPU programming language

#12625 opened Mar 28, 2025 by s-Nick

Loading…

opencl: Add support for multiple devices ggml

changes relating to the ggml tensor library for machine learning

#12622 opened Mar 28, 2025 by linehill • Draft

Add Yandex instruct model template support testing

Everything test related

#12621 opened Mar 28, 2025 by vorobyov01

Loading…

1 of 3 tasks

llama : fix non-causal mask for gemma 3 examples

#12615 opened Mar 27, 2025 by ngxson

Loading…

musa: fix all warnings, re-enable -DLLAMA_FATAL_WARNINGS=ON in ci and update doc devops

improvements to build systems and github actions

ggml

changes relating to the ggml tensor library for machine learning

Nvidia GPU

Issues specific to Nvidia GPUs

#12611 opened Mar 27, 2025 by yeahdongcn

Loading…

2 tasks done

Enable MMA for BF16 data types on Powerpc ggml

changes relating to the ggml tensor library for machine learning

#12565 opened Mar 25, 2025 by shalinib-ibm • Draft

vulkan: Implement grouped query attention in the coopmat2 FA shader ggml

changes relating to the ggml tensor library for machine learning

Vulkan

Issues specific to the Vulkan backend

#12559 opened Mar 25, 2025 by jeffbolznv

Loading…

ggml-quants : weighted rounding algorithms with cumulative search generation quality

Quality of model output

ggml

changes relating to the ggml tensor library for machine learning

Less than 4 bits

Efforts related to viable quantized models using <4 bits

research 🔬 Review Complexity : Medium

Generally require more time to grok but manageable by beginner to medium expertise level

Tensor Encoding Scheme

https://github.com/ggerganov/llama.cpp/wiki/Tensor-Encoding-Schemes

#12557 opened Mar 25, 2025 by compilade

Loading…

Add Trillion 7B model support python

python script changes

#12556 opened Mar 25, 2025 by juyoung-trl

Loading…

1 of 3 tasks

Draft: vulkan: Add bfloat16 support ggml

changes relating to the ggml tensor library for machine learning

Vulkan

Issues specific to the Vulkan backend

#12554 opened Mar 24, 2025 by jeffbolznv

Loading…

llama-map to support hugepage feature of pagesize 2M or 1G which can …

#12552 opened Mar 24, 2025 by nickhuang99

Loading…

(draft) tts: Sesame support examples python

python script changes

#12549 opened Mar 24, 2025 by pminev • Draft

Vulkan: Remove dedicated aligned matrix matrix multiplication shaders ggml

changes relating to the ggml tensor library for machine learning

testing

Everything test related

Vulkan

Issues specific to the Vulkan backend

#12515 opened Mar 22, 2025 by 0cc4m • Draft

llama-tts : precompute irFFT theta examples

#12514 opened Mar 22, 2025 by marcoStocchi

Loading…

perplexity: Add option to ignore context window overflow errors and continue score calculation examples

#12512 opened Mar 22, 2025 by EAddario

Loading…

quantize: Handle user-defined quantization levels for additional tensors examples

#12511 opened Mar 22, 2025 by EAddario

Loading…

cmake: Allow to configure GGML_BUILD_NUMBER with file ggml

changes relating to the ggml tensor library for machine learning

#12509 opened Mar 22, 2025 by booxter • Draft

Previous 1 2 3 4 5 … 16 17 Next

Previous Next

ProTip! no:milestone will show everything without a milestone.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly