Could you add support for llama.cpp? #987

tdzard94 · 2026-05-02T15:47:42Z

tdzard94
May 2, 2026

I’m using llama.cpp to run the Qwen3.6-27B model on my machine with a relatively low context size, and I access it over my LAN because running it through Ollama or LM Studio is too demanding for my system. I noticed that OpenClaude can’t retrieve context usage information from llama.cpp, so I never really know when the context window is getting full and needs compaction. Would it be possible to add this feature?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Gitlawb

Could you add support for llama.cpp? #987

Uh oh!

{{title}}

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

Gitlawb

Could you add support for llama.cpp? #987

Uh oh!

tdzard94 May 2, 2026

Replies: 0 comments

tdzard94
May 2, 2026