Skip to content

Releases: ggml-org/llama.cpp

b4856

08 Mar 16:46
6fefc05
Compare
Choose a tag to compare
ggml-backend : make path_str compatible with C++20 (#12269)

b4855

07 Mar 19:40
7ab3643
Compare
Choose a tag to compare
server : infill gen ends on new line (#12254)

b4854

07 Mar 14:01
7c7f3b7
Compare
Choose a tag to compare
ggml : skip intermediate .air file when compiling .metallib (#12247)

This commit updates the compilation of default.metallib to skip the
intermediate .air (Apple Intermediate Representation) file.

The motivation for this change is to simplify the custom command a
little and avoid generating and then removing the .air file.

b4853

07 Mar 13:40
Compare
Choose a tag to compare
sync : ggml

ggml-ci

b4851

07 Mar 12:46
68d0027
Compare
Choose a tag to compare
ggml-cpu: faster AVX2 variant for IQ1_M (#12216)

b4849

07 Mar 11:05
8fad3c7
Compare
Choose a tag to compare
server : Log original chat template parsing error (#12233)

b4848

07 Mar 10:16
7cf64f6
Compare
Choose a tag to compare
sync: minja - support QwQ-32B (#12235)

https://github.com/google/minja/commit/8a76f7815e8a3ae00bd233c2b5a8b7d4e86564ec

b4847

07 Mar 08:22
5e2d57b
Compare
Choose a tag to compare
metal : simplify kernel arguments using a struct (#3229) (#12194)

* metal : refactor im2col parameters into a struct

* metal: Change im2col offset types from int32_t to uint64_t to support larger memory offsets

* metal : refactor sum_rows parameters into a struct

* metal : refactor soft_max parameters into a struct

* metal : refactor diag_mask_inf parameters into a struct

* metal : refactor ssm_conv parameters into a struct

* metal : refactor ssm_scan parameters into a struct

* metal : refactor get_rows parameters into a struct

* metal : refactor group_norm parameters into a struct

* metal : refactor conv_transpose_1d parameters into a struct

* metal : refactor upscale parameters into a struct

* metal : refactor pad parameters into a struct

* metal : refactor pad_reflect_1d parameters into a struct

* metal : refactor arange parameters into a struct

* metal : refactor timestep_embedding parameters into a struct

* metal : refactor argsort parameters into a struct

* metal : refactor leaky_relu parameters into a struct

* metal : refactor pool_2d parameters into a struct

* metal : fix trailing whitespace

---------

Co-authored-by: alexju <alexju@tencent.com>

b4846

07 Mar 07:51
f1648e9
Compare
Choose a tag to compare
HIP: fix rocWMMA build flags under Windows (#12230)

b4837

06 Mar 08:01
e721c05
Compare
Choose a tag to compare
HIP/CUDA: set the paramerter value in maintain_cuda_graph instead of …