Sync master with upstream release b5013 #38

jan-service-account · 2025-04-01T00:09:31Z

Updates dev branch with latest release (b5013) from ggml-org/llama.cpp

* tts.cpp : llama tokens console output is done using LOG_INF instead of printf(). Therefore the options '--log-disable' and '--log-file' have now uniform impact on all output.

* SYCL: Remove misleading ggml_sycl_op_flatten function * remove trailing whitespace * Fix L2 norm from rebase * remove try catch block from element_wise.cpp * remove comment from common.hp * ggml-sycl.cpp: Add try catch sycl::exception block in compute_forward * norm.cpp: remove try catch exception block

Co-authored-by: Sandro Hanea <me@sandro.rocks>

ggml-ci

* Vulkan: Add DP4A MMQ and Q8_1 quantization shader * Add q4_0 x q8_1 matrix matrix multiplication support * Vulkan: Add int8 coopmat MMQ support * Vulkan: Add q4_1, q5_0 and q5_1 quants, improve integer dot code * Add GL_EXT_integer_dot_product check * Remove ggml changes, fix mmq pipeline picker * Remove ggml changes, restore Intel coopmat behaviour * Fix glsl compile attempt when integer vec dot is not supported * Remove redundant code, use non-saturating integer dot, enable all matmul sizes for mmq * Remove redundant comment * Fix integer dot check * Fix compile issue with unsupported int dot glslc * Update Windows build Vulkan SDK version

…esent (ggml-org#12667)

* faster ssm_scan * delete unused commnet * clang format * add space * modify unnecessary calculations * faster ssm conv implementatioin * modify file name with dash

* vocab : add special infill tokens for CodeLlama The commit adds the following special tokens for CodeLlama infill: - `▁<PRE>` - `▁<SUF>` - `▁<MID>` The motivation for this is that currently the infill example uses CodeLlama as a suggested model. But when using this model the following error is generated: ```console /llama.cpp-debug/examples/infill/infill.cpp:165: GGML_ASSERT(llama_vocab_fim_pre(vocab) >= 0) failed Could not attach to process. If your uid matches the uid of the target process, check the setting of /proc/sys/kernel/yama/ptrace_scope, or try again as the root user. For more details, see /etc/sysctl.d/10-ptrace.conf ptrace: Operation not permitted. No stack. The program is not being run. 305251 Aborted (core dumped) ./build/bin/llama-infill -t 10 -ngl 0 -m models/codellama-13b.Q5_K_S.gguf \ -c 4096 --temp 0.7 --repeat_penalty 1.1 -n 20 \ --in-prefix "def helloworld():\n print(\"hell" \ --in-suffix "\n print(\"goodbye world\")\n " ``` * squash! vocab : add special infill tokens for CodeLlama Add _<EOT> as well.

marcoStocchi and others added 11 commits March 31, 2025 11:20

tts : remove printfs (ggml-org#12640)

52de2e5

* tts.cpp : llama tokens console output is done using LOG_INF instead of printf(). Therefore the options '--log-disable' and '--log-file' have now uniform impact on all output.

llava : fix clip loading GGUFs with missing description (ggml-org#12660)

f52d59d

llava : proper description fix (ggml-org#12668)

1a85949

cmake: improve Vulkan cooperative matrix support checks (whisper/2966)

a772448

Co-authored-by: Sandro Hanea <me@sandro.rocks>

sync : ggml

0114a32

ggml-ci

cmake : fix whitespace (#0)

1790e73

convert : Qwerky : use lora_rank_tokenshift and lora_rank_decay if pr…

403fbac

…esent (ggml-org#12667)

ggml : faster ssm scan (ggml-org#10558)

250d795

* faster ssm_scan * delete unused commnet * clang format * add space * modify unnecessary calculations * faster ssm conv implementatioin * modify file name with dash

github-actions bot added devops SYCL Nvidia GPU Vulkan examples python ggml script labels Apr 1, 2025

jan-service-account merged commit c80a775 into dev Apr 2, 2025
23 of 24 checks passed

jan-service-account deleted the update-dev-from-master-2025-04-01-00-09 branch April 2, 2025 00:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Sync master with upstream release b5013 #38

Sync master with upstream release b5013 #38

Uh oh!

jan-service-account commented Apr 1, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

10 participants

Sync master with upstream release b5013 #38

Sync master with upstream release b5013 #38

Uh oh!

Conversation

jan-service-account commented Apr 1, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

10 participants