Skip to content

Sync master with upstream release b8262#449

Merged
jan-service-account merged 10 commits intodevfrom
update-dev-from-master-2026-03-11-00-43
Mar 11, 2026
Merged

Sync master with upstream release b8262#449
jan-service-account merged 10 commits intodevfrom
update-dev-from-master-2026-03-11-00-43

Conversation

@jan-service-account
Copy link
Copy Markdown

Updates dev branch with latest release (b8262) from ggml-org/llama.cpp

am17an and others added 10 commits March 9, 2026 16:15
* Map developer role to system
* Simplify
)

* server : fix off-by-1 in server_tokens::size_up_to_pos()

* cont : fix typo [no ci]
Enable mul_mv_ext small-batch kernels (BS 2-8) for BF16, Q2_K,
and Q3_K quantization types. These types previously fell through
to the slower single-row mul_mv path.

BF16 uses the float4 dequantize path (like F16). Q2_K and Q3_K
use the float4x4 K-quant path (like Q4_K/Q5_K/Q6_K).

Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>
…rg#20208)

* Parse port numbers from MCP server URLs

* Pass scheme to http proxy for determining whether to use SSL

* Fix download on non-standard port and re-add port to logging

* add test

---------

Co-authored-by: Xuan Son Nguyen <son@huggingface.co>
@jan-service-account jan-service-account merged commit 9cef3e9 into dev Mar 11, 2026
1 check passed
@jan-service-account jan-service-account deleted the update-dev-from-master-2026-03-11-00-43 branch March 11, 2026 00:44
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

6 participants