Skip to content

Sync master with upstream release b7062#324

Merged
jan-service-account merged 8 commits intodevfrom
update-dev-from-master-2025-11-15-00-34
Nov 15, 2025
Merged

Sync master with upstream release b7062#324
jan-service-account merged 8 commits intodevfrom
update-dev-from-master-2025-11-15-00-34

Conversation

@jan-service-account
Copy link
Copy Markdown

Updates dev branch with latest release (b7062) from ggml-org/llama.cpp

allozaur and others added 8 commits November 14, 2025 01:19
Signed-off-by: Wang Yang <yangwang@iscas.ac.cn>
* metal : refactor argsort

* cont : sort chunks

* cont : merge sorted buckets

* cont : cleanup
…nstruction (ggml-org#17048)

* fix : Dangling pointer for non-empty trigger words in llama_sampler_init_grammar_impl (ggml-org#17047)

* Replace 'static' workaround, with keeping variable in scope for longer

* Create std::array directly and pass into llama_grammar_init_impl

* Add back the trigger pattern

* Missed array include
* Add AFMOE model support

* Update to vocab

* Add model sizing

* Undo Rope change for ARCEE model

* Address review comments

* Update modeling code is_sliding -> use_rope, replace hard-coded logic

* Fix AFMOE tokenizer

* Update convert_hf_to_gguf.py

Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>

* Update convert_hf_to_gguf.py

Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>

* Update AFMoE tokenizer class identification to be more unique

---------

Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>
@jan-service-account jan-service-account merged commit 6d957f6 into dev Nov 15, 2025
3 checks passed
@jan-service-account jan-service-account deleted the update-dev-from-master-2025-11-15-00-34 branch November 15, 2025 00:36
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

7 participants