Skip to content

v0.7.0

Latest

Choose a tag to compare

@github-actions github-actions released this 03 Jul 21:16
  • feat(adapters): the self-host tier is real — a local adapter for any
    OpenAI-compatible server (Ollama, vLLM, LM Studio, SGLang, DwarfStar/DS4):
    user-declared capabilities and context window (the router's capability floor —
    never guessed), $0 default pricing (overridable, never invented), an explicit
    note when a server reports no token usage, and byte-exact tool-call ID replay
    promoted into the shared conformance battery for every adapter. New
    glam run --local / routing.localOnly fail loud rather than silently
    falling back to a hosted provider. Catalog gains ollama/vllm/lmstudio rows
    plus dwarfstar-ds4 and ornith-1.0-9b/35b with honest beta/unverified flags.
    Live-verified against a real Ollama daemon: keyless glam run tool
    round-trip at cost: $0.000000. (#25)