Support for llama-swap, llama.cpp and also vllm #441
Replies: 2 comments
-
|
Llama-swap for sure, and very likely others as well, function perfectly fine as OpenAI Compatible providers, which Kanban seems to support well. For Llama-swap specifically, I've had luck creating a new provider in my list and pointing the base URL to my llama-swap instance. Notably, there seems to be a bug at the moment that API key is a required field, even if the provider doesn't actually validate it. If you llama-swap instance isn't configured to use a key, you should be able to put any value in the API key field and have it work. |
Beta Was this translation helpful? Give feedback.
-
|
Local model support is most useful if it is treated as a provider contract rather than a special case for one runtime. Many local stacks can present an OpenAI-compatible API, but their capabilities differ a lot: tool calling, structured outputs, context length, streaming, multimodal support, and prompt caching are not guaranteed. A practical implementation would let users configure:
For agentic workflows, capability detection matters. If a local model does not support reliable tool calling or structured output, the UI should show that clearly instead of failing later during task execution. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Could we please make the new cline compatible with local llm's? local llms are run for very long running tasks and also where privacy is required (e.g. on data containing government id's etc).
Beta Was this translation helpful? Give feedback.
All reactions