Skip to content

Conversation

@jan-service-account
Copy link

Updates dev branch with latest release (b5509) from ggml-org/llama.cpp

ggerganov and others added 8 commits May 27, 2025 09:40
* llama : validate seq id batch input

ggml-ci

* cont : fix the fix

ggml-ci
* sampling : min-p should always return at least one token

ggml-ci

* sampling : same for typical sampling

* tests : sampling tests use min_keep == 0

ggml-ci
…org#13808)

* kv-cells : track min/max used cells and per-sequence positions

ggml-ci

* kv-cells : fix pos-modification updates for seq_pos

ggml-ci

* kv-cells : add comments

ggml-ci
…gml-org#13784)

* mtmd : allow multiple modalities at the same time

* refactor mtmd tokenizer

* fix compile

* ok, missing SinusoidsPositionEmbedding

* first working version

* fix style

* more strict validate of n_embd

* refactor if..else to switch

* fix regression

* add test for 3B

* update docs

* fix tokenizing with add_special

* add more tests

* fix test case "huge"

* rm redundant code

* set_position_mrope_1d rm n_tokens
* ggml : riscv: add xtheadvector support

* ggml : clean up some macro usage
@jan-service-account jan-service-account merged commit cebd471 into dev May 27, 2025
15 checks passed
@jan-service-account jan-service-account deleted the update-dev-from-master-2025-05-27-14-47 branch May 27, 2025 14:58
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

8 participants