RelyLoop v0.1.0 — MVP1 alpha
RelyLoop v0.1.0 — MVP1 alpha
What's in MVP1
The full Karpathy loop end-to-end on Elasticsearch and OpenSearch, single-tenant, no auth, Docker Compose:
- Engine adapter — one
SearchAdapterProtocol covering both ES 8.11+/9.x and OpenSearch 2.x/3.x. Cluster registration via UI or API. - Optuna optimizer — TPE sampler against a parametrized query template; up to N trials per study; per-trial budget guard;
pytrec_evalmetrics (ndcg@k,map,precision,recall,mrr,err). - LLM-as-judge —
POST /api/v1/judgments/generaterates query-document pairs against a rubric. ~$0.01–$0.05 per query set withgpt-4o-mini. Provider-agnostic: works against any OpenAI-compatible endpoint (Ollama / LM Studio / vLLM / TGI) viaOPENAI_BASE_URL. - Digest — LLM-generated narrative summary of each completed study, plus parameter-importance chart and recommended config.
- GitHub PR worker — winning configs land as Pull Requests against a central search-config Git repo. Operator's CI deploys.
- Chat agent — describe the problem in chat; the agent introspects the cluster, proposes a search-space, and queues the study after operator confirmation. 19-tool surface.
- Operator tutorial + sample data — 1,000 curated Amazon ESCI products + 48 queries + canonical Jinja2 query template.
docs/08_guides/tutorial-first-study.mdwalksgit clone → Open PRin under 30 minutes on a fresh laptop. - CI smoke gate — every PR runs the full Karpathy loop end-to-end against a fresh stack with a budgeted OpenAI key. Same operator path as the tutorial; no degraded variants.
Full feature list: see docs/02_product/mvp1-user-stories.md.
Audience
Technical evaluators, Relevance Engineers, and search-platform teams considering an open-source query-tuning tool. Not yet production-deployable — see docs/01_architecture/deployment.md for the MVP1 → MVP3 → GA v1 deployment maturity ramp.
How to install
Follow the tutorial: docs/08_guides/tutorial-first-study.md.
Operators build images locally via make up. Pre-built GHCR images ship at MVP3 per the canonical release matrix; until then, make up triggers a local Docker build of relyloop/api and relyloop/ui on first run.
Known limitations
This is alpha. Three operator-visible issues are tracked but not blocking:
- Long chat sessions silently drop context after 100 messages. The agent prompt-window cap is brute-force; smarter context management ships in MVP2. Tracked:
bug_chat_long_conversation_truncation. - Query templates created via the API with declared params can't be used for LLM judgment generation. Workaround: use one template with
declared_params={}for judgment generation, a separate template with declared params for the optimization study (this is what the tutorial does). Tracked:bug_judgment_template_default_params_contract. - Worker may need a manual restart after first-run
make migrate. If youmake upand immediately fire a study beforemake migratecompletes, the Arq worker dies on Optuna schema init and stays down. Workaround:docker compose restart workeraftermake migrate. Tracked:bug_worker_optuna_init_race.
How to provide feedback
- GitHub Discussions: https://github.com/SoundMindsAI/relyloop/discussions
- Issues (bug reports, feature requests): https://github.com/SoundMindsAI/relyloop/issues/new/choose
Roadmap
| Release | Theme | Adds |
|---|---|---|
| MVP1 / v0.1.0 (you are here) | "The Loop" | ES + OpenSearch adapter, OpenAI-compatible LLM, GitHub provider, single-tenant, no auth, Docker Compose, 80% coverage gate |
| MVP2 / v0.2 | "Observable" | Langfuse + ClickHouse + SigNoz; canonical event catalog; audit_log + immutability trigger; lineage columns; PII redaction; trace propagation |
| MVP3 / v0.3 | "Production Stacks" | Lucidworks Fusion adapter; multi-Git-provider abstraction (GitLab, Bitbucket); production install (TLS via Caddy + Let's Encrypt, managed Postgres/Redis); AWS managed OpenSearch |
| MVP4 / v0.4 | "Multi-tenant, Multi-LLM" | tenants + tenant_memberships + users + api_keys; tenant_id columns + backfill; SSO via reverse proxy; native non-OpenAI provider SDKs |
| GA v1 | "Production-ready" | LangGraph orchestrator + PostgresSaver; full RFC 7807 errors; Idempotency-Key; Helm chart; container scanning; image signing |
Canonical release matrix: docs/01_architecture/tech-stack.md.