Skip to content

Sync master with upstream release b8508#464

Merged
jan-service-account merged 13 commits intodevfrom
update-dev-from-master-2026-03-25-00-49
Mar 25, 2026
Merged

Sync master with upstream release b8508#464
jan-service-account merged 13 commits intodevfrom
update-dev-from-master-2026-03-25-00-49

Conversation

@jan-service-account
Copy link
Copy Markdown

Updates dev branch with latest release (b8508) from ggml-org/llama.cpp

aldehir and others added 13 commits March 23, 2026 22:21
* llama-fit: fix regex pattern for gate_up tensors

* Apply suggestions from code review

Co-authored-by: Johannes Gäßler <johannesg@5d6.de>

---------

Co-authored-by: Johannes Gäßler <johannesg@5d6.de>
* common : add standard Hugging Face cache support

- Use HF API to find all files
- Migrate all manifests to hugging face cache at startup

Signed-off-by: Adrien Gallouët <angt@huggingface.co>

* Check with the quant tag

Signed-off-by: Adrien Gallouët <angt@huggingface.co>

* Cleanup

Signed-off-by: Adrien Gallouët <angt@huggingface.co>

* Improve error handling and report API errors

Signed-off-by: Adrien Gallouët <angt@huggingface.co>

* Restore common_cached_model_info and align mmproj filtering

Signed-off-by: Adrien Gallouët <angt@huggingface.co>

* Prefer main when getting cached ref

Signed-off-by: Adrien Gallouët <angt@huggingface.co>

* Use cached files when HF API fails

Signed-off-by: Adrien Gallouët <angt@huggingface.co>

* Use final_path..

Signed-off-by: Adrien Gallouët <angt@huggingface.co>

* Check all inputs

Signed-off-by: Adrien Gallouët <angt@huggingface.co>

---------

Signed-off-by: Adrien Gallouët <angt@huggingface.co>
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
Co-authored-by: nryoo <nryoo@nryooui-MacBookPro.local>
Signed-off-by: Adrien Gallouët <angt@huggingface.co>
Signed-off-by: Adrien Gallouët <angt@huggingface.co>
* autoresize textarea on mount

* allow textarea to grow to same height as rendered messages

* add UI build file
Signed-off-by: Adrien Gallouët <angt@huggingface.co>
…20943)

* models : move the token embedding norms to the first layer

* cont : fix LLM_TENSOR_CONV1D + fix il indexing
@jan-service-account jan-service-account merged commit 972e464 into dev Mar 25, 2026
3 checks passed
@jan-service-account jan-service-account deleted the update-dev-from-master-2026-03-25-00-49 branch March 25, 2026 00:50
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

9 participants