ggml : fix scalar path for computing norm #16558

ggerganov · 2025-10-13T07:11:02Z

This bug was introduced in #15953 (cc @duduta)

LostRuins · 2025-10-13T07:32:23Z

seems to be working from a quick test

duduta · 2025-10-13T11:20:01Z

auch, thank you @ggerganov for spottiing the issue

* origin/master: (32 commits) metal : FA support F32 K and V and head size = 32 (ggml-org#16531) graph : support cacheless embeddings with FA and iSWA (ggml-org#16528) opencl: fix build targeting CL 2 (ggml-org#16554) CUDA: fix numerical issues in tile FA kernel (ggml-org#16540) ggml : fix build broken with -march=armv9-a on MacOS (ggml-org#16520) CANN: fix CPU memory leak in CANN backend (ggml-org#16549) fix: add remark plugin to render raw HTML as literal text (ggml-org#16505) metal: add support for opt_step_sgd (ggml-org#16539) ggml : fix scalar path for computing norm (ggml-org#16558) CANN: Update several operators to support FP16 data format (ggml-org#16251) metal : add opt_step_adamw and op_sum (ggml-org#16529) webui: remove client-side context pre-check and rely on backend for limits (ggml-org#16506) [SYCL] fix UT fault cases: count-equal, argsort, pad OPs (ggml-org#16521) ci : add Vulkan on Ubuntu with default packages build (ggml-org#16532) common : handle unicode during partial json parsing (ggml-org#16526) common : update presets (ggml-org#16504) ggml : Fix FP16 ELU positive branch (ggml-org#16519) hparams : add check for layer index in is_recurrent (ggml-org#16511) ggml: Correct SVE implementation in ggml_vec_dot_f16_unroll (ggml-org#16518) CUDA: faster tile FA, add oob checks, more HSs (ggml-org#16492) ...

ggml : fix scalar path for computing norm

7402b74

ggerganov requested a review from slaren as a code owner October 13, 2025 07:11

ggerganov mentioned this pull request Oct 13, 2025

ggml-cpu: optimize the ggml NORM operation #15953

Merged

github-actions bot added the ggml changes relating to the ggml tensor library for machine learning label Oct 13, 2025

LostRuins added a commit to LostRuins/koboldcpp that referenced this pull request Oct 13, 2025

apply fix from ggml-org#16558

3a42c6b

ggerganov merged commit c515fc5 into master Oct 13, 2025
70 checks passed

ggerganov deleted the gg/ggml-fix-cpu-norm-scalar branch October 13, 2025 08:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ggml : fix scalar path for computing norm #16558

ggml : fix scalar path for computing norm #16558

ggerganov commented Oct 13, 2025

Uh oh!

LostRuins commented Oct 13, 2025

Uh oh!

Uh oh!

duduta commented Oct 13, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ggml : fix scalar path for computing norm #16558

ggml : fix scalar path for computing norm #16558

Conversation

ggerganov commented Oct 13, 2025

Uh oh!

LostRuins commented Oct 13, 2025

Uh oh!

Uh oh!

duduta commented Oct 13, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants