-
Notifications
You must be signed in to change notification settings - Fork 11.2k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
convert : fix squeeze for ssm_conv tensors
python
python script changes
#12573
opened Mar 25, 2025 by
ggerganov
Loading…
metal : refactor mat-vec code
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
#12569
opened Mar 25, 2025 by
ggerganov
Loading…
clip: Fix llama-llava-clip-quantize-cli quantization error under CUDA backend
examples
#12566
opened Mar 25, 2025 by
Ivy233
Loading…
Enable MMA for BF16 data types on Powerpc
ggml
changes relating to the ggml tensor library for machine learning
#12565
opened Mar 25, 2025 by
shalinib-ibm
Loading…
vulkan: Implement grouped query attention in the coopmat2 FA shader
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#12559
opened Mar 25, 2025 by
jeffbolznv
Loading…
ggml-quants : weighted rounding algorithms with cumulative search
generation quality
Quality of model output
ggml
changes relating to the ggml tensor library for machine learning
Less than 4 bits
Efforts related to viable quantized models using <4 bits
research 🔬
Review Complexity : Medium
Generally require more time to grok but manageable by beginner to medium expertise level
Tensor Encoding Scheme
https://github.com/ggerganov/llama.cpp/wiki/Tensor-Encoding-Schemes
#12557
opened Mar 25, 2025 by
compilade
Loading…
Add Trillion 7B model support
python
python script changes
#12556
opened Mar 25, 2025 by
juyoung-trl
Loading…
1 of 3 tasks
Draft: vulkan: Add bfloat16 support
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#12554
opened Mar 24, 2025 by
jeffbolznv
Loading…
llama-map to support hugepage feature of pagesize 2M or 1G which can …
#12552
opened Mar 24, 2025 by
nickhuang99
Loading…
ggml : fix MUL_MAT_ID repack with Q8_K
ggml
changes relating to the ggml tensor library for machine learning
#12544
opened Mar 24, 2025 by
ggerganov
Loading…
ggml : riscv: add 128-bit RVV support
ggml
changes relating to the ggml tensor library for machine learning
#12530
opened Mar 23, 2025 by
xctan
Loading…
cmake: fix ccache conflict
ggml
changes relating to the ggml tensor library for machine learning
#12522
opened Mar 23, 2025 by
BusyJay
Loading…
perplexity: Add option to ignore context window overflow errors and continue score calculation
examples
#12512
opened Mar 22, 2025 by
EAddario
Loading…
quantize: Handle user-defined quantization levels for additional tensors
examples
#12511
opened Mar 22, 2025 by
EAddario
Loading…
cmake: Allow to configure GGML_BUILD_NUMBER with file
ggml
changes relating to the ggml tensor library for machine learning
rpc : send hash when tensor data is above some fixed threshold
examples
ggml
changes relating to the ggml tensor library for machine learning
#12496
opened Mar 21, 2025 by
rgerganov
Loading…
llamafile : ppc64le MMA implementation for Q4_0.
ggml
changes relating to the ggml tensor library for machine learning
#12489
opened Mar 21, 2025 by
amritahs-ibm
Loading…
Evenly and stably pinning thread pool
ggml
changes relating to the ggml tensor library for machine learning
Metal TQ2_0
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
#12485
opened Mar 20, 2025 by
dmahurin
Loading…
Previous Next
ProTip!
Follow long discussions with comments:>50.