Skip to content

Conversation

ggerganov
Copy link
Member

ref #15953 (comment)

This bug was introduced in #15953 (cc @duduta)

@ggerganov ggerganov requested a review from slaren as a code owner October 13, 2025 07:11
@github-actions github-actions bot added the ggml changes relating to the ggml tensor library for machine learning label Oct 13, 2025
LostRuins added a commit to LostRuins/koboldcpp that referenced this pull request Oct 13, 2025
@LostRuins
Copy link
Collaborator

seems to be working from a quick test

@ggerganov ggerganov merged commit c515fc5 into master Oct 13, 2025
70 checks passed
@ggerganov ggerganov deleted the gg/ggml-fix-cpu-norm-scalar branch October 13, 2025 08:22
@duduta
Copy link
Contributor

duduta commented Oct 13, 2025

auch, thank you @ggerganov for spottiing the issue

gabe-l-hart added a commit to gabe-l-hart/llama.cpp that referenced this pull request Oct 13, 2025
* origin/master: (32 commits)
metal : FA support F32 K and V and head size = 32 (ggml-org#16531)
graph : support cacheless embeddings with FA and iSWA (ggml-org#16528)
opencl: fix build targeting CL 2 (ggml-org#16554)
CUDA: fix numerical issues in tile FA kernel (ggml-org#16540)
ggml : fix build broken with -march=armv9-a on MacOS (ggml-org#16520)
CANN: fix CPU memory leak in CANN backend (ggml-org#16549)
fix: add remark plugin to render raw HTML as literal text (ggml-org#16505)
metal: add support for opt_step_sgd (ggml-org#16539)
ggml : fix scalar path for computing norm (ggml-org#16558)
CANN: Update several operators to support FP16 data format (ggml-org#16251)
metal : add opt_step_adamw and op_sum (ggml-org#16529)
webui: remove client-side context pre-check and rely on backend for limits (ggml-org#16506)
[SYCL] fix UT fault cases: count-equal, argsort, pad OPs (ggml-org#16521)
ci : add Vulkan on Ubuntu with default packages build (ggml-org#16532)
common : handle unicode during partial json parsing (ggml-org#16526)
common : update presets (ggml-org#16504)
ggml : Fix FP16 ELU positive branch (ggml-org#16519)
hparams : add check for layer index in is_recurrent (ggml-org#16511)
ggml: Correct SVE implementation in ggml_vec_dot_f16_unroll (ggml-org#16518)
CUDA: faster tile FA, add oob checks, more HSs (ggml-org#16492)
...
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ggml changes relating to the ggml tensor library for machine learning

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants