Sync master with upstream release b4966 #31

jan-service-account · 2025-03-27T00:08:11Z

Updates dev branch with latest release (b4966) from ggml-org/llama.cpp

* Fix Mistral3/Gemma3 model hparams init * set positional args correctly * use existing hparams if passed

Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>

* ggml : fix MUL_MAT_ID repack with Q8_K ggml-ci * ggml : improve repack templates ggml-ci

* convert : fix squeeze for ssm_conv tensors * convert : match ssm_conv tensors by type --------- Co-authored-by: Francis Couture-Harpin <git@compilade.net>

… backend (ggml-org#12566) * [Fix] Compiling clip-quantize-cli and running it in a CUDA environment will cause ggml_fp16_to_fp32 to report an error when trying to access video memory. You need to switch to the CPU backend to run quantize. After the fix, it will automatically run in the CPU backend and will no longer be bound to CUDA. * [Fix]Roll back the signature and implementation of clip_model_load, and change the call in clip_model_quantize to clip_init.

* metal : refactor mat-vec code ggml-ci * metal : rename all_sum -> sum_all ggml-ci * metal : fix comments [no ci] * metal : fix nr constant [no ci] * metal : mv q6_K support nr0 > 1 ggml-ci * metal : reduce register pressure ggml-ci * metal : fix typo [no ci] * metal : reduce register pressure ggml-ci

CISC and others added 8 commits March 25, 2025 23:03

convert: fix Mistral3/Gemma3 model hparams init (ggml-org#12571)

53af4db

* Fix Mistral3/Gemma3 model hparams init * set positional args correctly * use existing hparams if passed

doc: [MUSA] minor changes (ggml-org#12583)

fd7855f

Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>

ggml : fix MUL_MAT_ID repack with Q8_K (ggml-org#12544)

5ed38b6

* ggml : fix MUL_MAT_ID repack with Q8_K ggml-ci * ggml : improve repack templates ggml-ci

convert : fix squeeze for ssm_conv tensors (ggml-org#12573)

df4d20c

* convert : fix squeeze for ssm_conv tensors * convert : match ssm_conv tensors by type --------- Co-authored-by: Francis Couture-Harpin <git@compilade.net>

upgrade to llguidance 0.7.10 (ggml-org#12576)

2447ad8

HIP: Add support for RDNA4 targets (ggml-org#12372)

bd40678

github-actions bot added devops Apple Metal Nvidia GPU testing examples python ggml documentation labels Mar 27, 2025

jan-service-account merged commit fcf9298 into dev Mar 27, 2025
16 checks passed

jan-service-account deleted the update-dev-from-master-2025-03-27-00-08 branch March 27, 2025 00:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Sync master with upstream release b4966 #31

Sync master with upstream release b4966 #31

Uh oh!

jan-service-account commented Mar 27, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

Sync master with upstream release b4966 #31

Sync master with upstream release b4966 #31

Uh oh!

Conversation

jan-service-account commented Mar 27, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants