Skip to content

Conversation

@jan-service-account
Copy link

Updates dev branch with latest release (b5223) from ggml-org/llama.cpp

ngxson and others added 8 commits April 29, 2025 09:45
* llama-graph : fix text position for mrope

* fix typo

* explicitly set 4th dim in the loop
* llava : add clip_n_output_tokens, deprecate clip_n_patches

* mtmd : add qwen2vl and qwen2.5vl

* decode_embd_batch::set_position_...

* working version

* deprecate llama-qwen2vl-cli

* correct order W, H of clip_embd_nbytes_by_img

* edit existing line in hot topics
…org#13174)

* Prefilling assistant message in openai compatible API

* fixed indentation

* fixed code convention

* simplify method usage

* no more than one assistant message at end of messages

* merge checks into prefill code

* Update examples/server/utils.hpp

---------

Co-authored-by: matteo <matteo@naspc.lan>
Co-authored-by: Xuan-Son Nguyen <thichthat@gmail.com>
@jan-service-account jan-service-account merged commit 7720935 into dev Apr 30, 2025
9 checks passed
@jan-service-account jan-service-account deleted the update-dev-from-master-2025-04-30-00-08 branch April 30, 2025 00:19
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

8 participants