Skip to content

raggity v0.10.0 — model switching, doctor, provider auto-discovery, personas

Latest

Choose a tag to compare

@IxMxAMAR IxMxAMAR released this 03 Jul 16:34

rag model — switch generation backend/model from the CLI: rag model gemma3:4b -p ollama edits your raggity.toml in place (comments preserved). Providers: claude/anthropic, openai, ollama, lmstudio, llamacpp, vllm, jan, koboldcpp — local OpenAI-compatible servers get the right base_url automatically and don't demand an API key. rag model --list shows every local LLM runtime discovered on your machine (running / installed / models available).

Provider auto-discovery & auto-start — raggity probes known local runtimes and, when your configured backend is Ollama (or LM Studio with its CLI present) and the server isn't running, finds the binary and starts it automatically — no more manual ollama serve. Gated by generation.auto_start (default on).

rag doctor — one command that checks your whole setup: config, sources, index, embedding model, generation backend (Claude auth / Ollama reachable and model pulled / OpenAI key), optional extras, write access — with ok/warn/FAIL markers and fix hints. It doesn't just diagnose a stopped Ollama; it starts it.

Opt-in personalizationgeneration.personal_kb = true makes first-person questions ('who am I', 'my …') treat the knowledge base as belonging to the user; generation.persona adds free-text user context. Both append to the system prompt without weakening citation/abstention rules, and the default prompt is byte-identical to before. Multi-tenant servers can set per-API-key personas ([server.personas]).

Also: ASCII-safe log strings on Windows consoles. 425 tests, 0 warnings. New base dep: tomlkit (comment-preserving config edits).