ggml-org / llama.cpp Public

Notifications
Fork 12.3k
Star 82.8k

Code
Issues 295
Pull requests 474
Discussions
Actions
Projects 10
Wiki
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Wiki
Security
Insights

Pull requests: ggml-org/llama.cpp

Labels 75 Milestones 0

New pull request New

474 Open 6,229 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

HIP: Enable Matrix cores for MMQ Kernels, Enable stream-K for CDNA 3 ggml

changes relating to the ggml tensor library for machine learning

Nvidia GPU

Issues specific to Nvidia GPUs

#14624 opened Jul 10, 2025 by deepsek

Loading…

tool: add convertation of text/parquet to custom format build

Compilation issues

examples

#14622 opened Jul 10, 2025 by lexasub

Loading…

llama : support LiquidAI LFM2 hybrid model family ggml

changes relating to the ggml tensor library for machine learning

Nvidia GPU

Issues specific to Nvidia GPUs

python

python script changes

#14620 opened Jul 10, 2025 by tdakhran

Loading…

webui: Change Download function to download the full text of the conversation examples server

#14619 opened Jul 10, 2025 by michaelmarziani

Loading…

SYCL: use 1D kernel for set_rows ggml

changes relating to the ggml tensor library for machine learning

SYCL

https://en.wikipedia.org/wiki/SYCL - GPU programming language

#14618 opened Jul 10, 2025 by qnixsynapse

Loading…

sycl: Batched mulmat rework for oneDNN dispatch ggml

changes relating to the ggml tensor library for machine learning

SYCL

https://en.wikipedia.org/wiki/SYCL - GPU programming language

#14617 opened Jul 10, 2025 by ShanoToni

Loading…

kv-cache : opt mask set input

#14600 opened Jul 9, 2025 by ggerganov

Loading…

metal : fuse add Apple Metal

https://en.wikipedia.org/wiki/Metal_(API)

ggml

changes relating to the ggml tensor library for machine learning

#14596 opened Jul 9, 2025 by ggerganov • Draft

docker : add cann build pipline devops

improvements to build systems and github actions

#14591 opened Jul 9, 2025 by diannaojiang

Loading…

vulkan: support SET_ROWS ggml

changes relating to the ggml tensor library for machine learning

Vulkan

Issues specific to the Vulkan backend

#14587 opened Jul 9, 2025 by jeffbolznv

Loading…

quantize: fix minor logic flaw in --tensor-type

#14572 opened Jul 7, 2025 by EAddario • Draft

metal : reuse graphs Apple Metal

https://en.wikipedia.org/wiki/Metal_(API)

demo

Demonstrate some concept or idea, not intended to be merged

ggml

changes relating to the ggml tensor library for machine learning

#14570 opened Jul 7, 2025 by ggerganov • Draft

model : add PLaMo-2 model examples python

python script changes

#14560 opened Jul 7, 2025 by mitmul

Loading…

vulkan: optimizations for deepseek prompt processing ggml

changes relating to the ggml tensor library for machine learning

Vulkan

Issues specific to the Vulkan backend

#14555 opened Jul 6, 2025 by jeffbolznv

Loading…

CUDA: add set rows for f32 and f16 examples ggml

changes relating to the ggml tensor library for machine learning

Nvidia GPU

Issues specific to Nvidia GPUs

#14551 opened Jul 6, 2025 by am17an

Loading…

common: detect and prefer big cores on AArch64 hybrid CPU on linux

#14532 opened Jul 4, 2025 by kiwi142857

Loading…

webui : add a preset feature to the settings examples server

#14523 opened Jul 3, 2025 by gabriellarson

Loading…

train: add simple loading already tokenized data from parquet dataset build

Compilation issues

examples

#14522 opened Jul 3, 2025 by lexasub

Loading…

ggml: Add initial WebGPU backend devops

improvements to build systems and github actions

documentation

Improvements or additions to documentation

ggml

changes relating to the ggml tensor library for machine learning

python

python script changes

#14521 opened Jul 3, 2025 by reeselevine

Loading…

mtmd : Fix 32-bit narrowing issue in export-lora and mtmd clip examples

#14503 opened Jul 2, 2025 by kiwi142857

Loading…

MUSA: upgrade musa sdk to <<TBD>> ggml

changes relating to the ggml tensor library for machine learning

Nvidia GPU

Issues specific to Nvidia GPUs

#14498 opened Jul 2, 2025 by yeahdongcn • Draft

Allow truncation when embedding examples server

#14493 opened Jul 2, 2025 by huydt84

Loading…

Compute buffer and KV-cache aware layer distribution for multi-GPU inference

#14484 opened Jul 1, 2025 by borebot

Loading…

llama : reuse compute graphs examples

#14482 opened Jul 1, 2025 by ggerganov

Loading…

8 of 15 tasks

server : (webui) let server send locally-defined default webui settings examples server

#14468 opened Jun 30, 2025 by woof-dog

Loading…

Previous 1 2 3 4 5 … 18 19 Next

Previous Next

ProTip! Exclude everything labeled bug with -label:bug.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!