Skip to content

Sync master with upstream release b8242#447

Merged
jan-service-account merged 9 commits intodevfrom
update-dev-from-master-2026-03-09-00-47
Mar 9, 2026
Merged

Sync master with upstream release b8242#447
jan-service-account merged 9 commits intodevfrom
update-dev-from-master-2026-03-09-00-47

Conversation

@jan-service-account
Copy link
Copy Markdown

Updates dev branch with latest release (b8242) from ggml-org/llama.cpp

arthw and others added 9 commits March 8, 2026 12:00
* support flash-attention for fp32/fp16/Q4/Q5/Q8

* rm warining

* update for JIT
* Revert to OAI-compatible args

* Apply workaround::func_args_not_string
* tests: add end-to-end tests per model architecture

* fixup for rebase

* fix use-after-free in llama-model-loader.cpp

* fix CI

* fix WebGPU

* fix CI

* disable CI for macOS-latest-cmake-arm64

* use expert_weights_scale only if != 0.0f

* comments
* vulkan: Fix data races in coopmat1 mul_mat(_id)

Add barriers between coopmat store and regular loads. We sort of got away with
this because it was the same subgroup accessing the values, but it's still a
race and may not work.

* switch to subgroup control barriers
* ggml-Vulkan: add ELU support

* ggml-Vulkan: remove extra spaces and variables

* ggml-Vulkan: fix format issue

* ggml-Vulkan: fix format issue

* fix whitespace issue

* Update Vulkan.csv and ops.md
* Fix structured outputs

* Update common/chat-auto-parser-generator.cpp

Co-authored-by: Aldehir Rojas <hello@alde.dev>

---------

Co-authored-by: Aldehir Rojas <hello@alde.dev>
* Fix compile bug

* Update common/chat-auto-parser-helpers.cpp

Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>

---------

Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>
@jan-service-account jan-service-account merged commit 4ae95d3 into dev Mar 9, 2026
1 check passed
@jan-service-account jan-service-account deleted the update-dev-from-master-2026-03-09-00-47 branch March 9, 2026 00:48
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

8 participants