Skip to content

Conversation

@jan-service-account
Copy link

Updates dev branch with latest release (b5488) from ggml-org/llama.cpp

qnixsynapse and others added 10 commits May 25, 2025 10:08
…-org#13752)

Temporarily reverted due to failing fp16 DIV operation

This reverts commit 02cdd2d.

ggml-ci
Co-authored-by: ochafik <ochafik@google.com>
* Multimodal: Added Moondream2 model and fixed ggml.org link

* Apply suggestions from code review

---------

Co-authored-by: name <none@none.com>
Co-authored-by: Xuan-Son Nguyen <thichthat@gmail.com>
* mtmd : add Qwen2-Audio support

* small clean up

* update discussion link

* clarify mtmd_get_output_embd

* clarification in multimodal.md

* fix ultravox bug

* ggml_cont
* kv-cache : rework kv_cell

ggml-ci

* kv-cells : use "shift" instead of "delta" consistently

ggml-ci

* llama : add llama_max_parallel_sequences()

ggml-ci

* kv-cells : update comments [no ci]

* context : fail upon construction if sequences exceed max value

ggml-ci

* kv-cells : get_pos() -> pos_get() + comments

ggml-ci

* kv-cells : fix tracking of "used" cells

ggml-ci
… w/ enable_thinking:false) (ggml-org#13771)

---------

Co-authored-by: ochafik <ochafik@google.com>
Co-authored-by: Xuan-Son Nguyen <thichthat@gmail.com>
@jan-service-account jan-service-account merged commit de7bfe2 into dev May 26, 2025
15 checks passed
@jan-service-account jan-service-account deleted the update-dev-from-master-2025-05-26-00-09 branch May 26, 2025 00:21
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

10 participants