Sync master with upstream release b8508 by jan-service-account · Pull Request #464 · janhq/llama.cpp

jan-service-account · 2026-03-25T00:49:07Z

Updates dev branch with latest release (b8508) from ggml-org/llama.cpp

…on and fix gpt-oss (ggml-org#20912)

* llama-fit: fix regex pattern for gate_up tensors * Apply suggestions from code review Co-authored-by: Johannes Gäßler <johannesg@5d6.de> --------- Co-authored-by: Johannes Gäßler <johannesg@5d6.de>

* common : add standard Hugging Face cache support - Use HF API to find all files - Migrate all manifests to hugging face cache at startup Signed-off-by: Adrien Gallouët <angt@huggingface.co> * Check with the quant tag Signed-off-by: Adrien Gallouët <angt@huggingface.co> * Cleanup Signed-off-by: Adrien Gallouët <angt@huggingface.co> * Improve error handling and report API errors Signed-off-by: Adrien Gallouët <angt@huggingface.co> * Restore common_cached_model_info and align mmproj filtering Signed-off-by: Adrien Gallouët <angt@huggingface.co> * Prefer main when getting cached ref Signed-off-by: Adrien Gallouët <angt@huggingface.co> * Use cached files when HF API fails Signed-off-by: Adrien Gallouët <angt@huggingface.co> * Use final_path.. Signed-off-by: Adrien Gallouët <angt@huggingface.co> * Check all inputs Signed-off-by: Adrien Gallouët <angt@huggingface.co> --------- Signed-off-by: Adrien Gallouët <angt@huggingface.co>

Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>

Co-authored-by: nryoo <nryoo@nryooui-MacBookPro.local>

Signed-off-by: Adrien Gallouët <angt@huggingface.co>

* autoresize textarea on mount * allow textarea to grow to same height as rendered messages * add UI build file

Signed-off-by: Adrien Gallouët <angt@huggingface.co>

…rg#20927)

…20943) * models : move the token embedding norms to the first layer * cont : fix LLM_TENSOR_CONV1D + fix il indexing

aldehir and others added 13 commits March 23, 2026 22:21

common : replace wrap_for_generation with a prefix convenience functi…

312d870

…on and fix gpt-oss (ggml-org#20912)

llama-fit: fix regex pattern for gate_up tensors (ggml-org#20910)

e852eb4

* llama-fit: fix regex pattern for gate_up tensors * Apply suggestions from code review Co-authored-by: Johannes Gäßler <johannesg@5d6.de> --------- Co-authored-by: Johannes Gäßler <johannesg@5d6.de>

issues: add openvino backends (ggml-org#20932)

c2e224d

Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>

metal : add FA instantiations for HSK=512, HSV=512 (ggml-org#20902)

342d612

metal : add FLOOR, CEIL, ROUND, TRUNC unary ops (ggml-org#20930)

92080b4

Co-authored-by: nryoo <nryoo@nryooui-MacBookPro.local>

common : add a WARNING for HF cache migration (ggml-org#20935)

2d2d9c2

Signed-off-by: Adrien Gallouët <angt@huggingface.co>

readme : clarify MODEL_ENDPOINT usage (ggml-org#20941)

c9dc433

Signed-off-by: Adrien Gallouët <angt@huggingface.co>

WebUI: fix edit msg form textarea height (ggml-org#20830)

a94fdb0

* autoresize textarea on mount * allow textarea to grow to same height as rendered messages * add UI build file

common : fix get_gguf_split_info (ggml-org#20946)

42ebce3

Signed-off-by: Adrien Gallouët <angt@huggingface.co>

vendor : update cpp-httplib to 0.39.0 (ggml-org#20933)

29771a0

ggml-backend: re-enable graph reuse with pipeline parallelism (ggml-o…

3fc6f1a

…rg#20927)

models : move the token embedding norms to the first layer (ggml-org#…

9f102a1

…20943) * models : move the token embedding norms to the first layer * cont : fix LLM_TENSOR_CONV1D + fix il indexing

jan-service-account merged commit 972e464 into dev Mar 25, 2026
3 checks passed

jan-service-account deleted the update-dev-from-master-2026-03-25-00-49 branch March 25, 2026 00:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sync master with upstream release b8508#464

Sync master with upstream release b8508#464
jan-service-account merged 13 commits intodevfrom
update-dev-from-master-2026-03-25-00-49

jan-service-account commented Mar 25, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

9 participants

Conversation

jan-service-account commented Mar 25, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

9 participants