Skip to content

[pull] master from ggml-org:master#770

Merged
pull[bot] merged 3 commits intoLongLeCE:masterfrom
ggml-org:master
Jan 12, 2026
Merged

[pull] master from ggml-org:master#770
pull[bot] merged 3 commits intoLongLeCE:masterfrom
ggml-org:master

Conversation

@pull
Copy link

@pull pull bot commented Jan 12, 2026

See Commits and Changes for more details.


Created by pull[bot] (v2.0.0-alpha.4)

Can you help keep this open source service alive? 💖 Please sponsor : )

angt and others added 3 commits January 12, 2026 15:58
Signed-off-by: Adrien Gallouët <angt@huggingface.co>
Signed-off-by: Adrien Gallouët <angt@huggingface.co>
* server : add arg for disabling prompt caching

Disabling prompt caching is useful for clients who are restricted to
sending only OpenAI-compat requests and want deterministic
responses.

* address review comments

* address review comments
@pull pull bot locked and limited conversation to collaborators Jan 12, 2026
@pull pull bot added the ⤵️ pull label Jan 12, 2026
@pull pull bot merged commit bcf7546 into LongLeCE:master Jan 12, 2026
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants