Let's build play.observal.io, a zero-setup way to feel Observal in 60 seconds #437

Apoorvgarg-creator · 2026-04-20T12:12:40Z

Apoorvgarg-creator
Apr 20, 2026
Maintainer

Hey folks 👋

Floating this idea for community input before I spike anything. TL;DR: a hosted, resettable playground at play.observal.io so people can understand Observal without running 8 Docker services on their laptop.

The use case we're trying to kill

Today the "can I try this?" path looks like:

git clone → docker compose up (Postgres + ClickHouse + Redis + OTel + Grafana + API + Web + worker)
→ wait 3 min → .env → observal auth login → scan → pull → finally see a trace

That's fine for someone who already wants Observal. It's brutal for someone who just saw a tweet and has 5 minutes. Most of them bounce before they ever see a trace, a scorecard, or an agent page.

Problem: we're losing people at "can I see it work?" before we ever get to "should I self-host it?"

What I want play.observal.io to be:

👉 Click a link → land on a real Observal web UI with pre-seeded agents, traces, spans, and scorecards
👉 pip install observal && observal auth login --playground → CLI just works, points at the playground, no server needed
👉 Play with pulling an agent, running observal ops traces, seeing a scorecard - end-to-end, in a browser tab
👉 Nothing they do affects anyone else, nothing leaks, and we don't get a $4k LLM bill

The constraint that makes this spicy

Observal is self-hosted by design. The whole point is that your keys, your traces, your agents live on your infrastructure. A playground contradicts that since we'd be running the one managed Observal instance that exists.

So the rule I'd like us to hold: zero forks, zero if playground: ... branches in the core code. Anything we need should be configurable via env vars or feature flags we already have, or a thin wrapper service in front. If we start special-casing "playground mode" inside the API, it becomes a maintenance tax on every future PR.

🌱 A small starting sketch (so we have something to pick apart)

Here's the simplest thing I think could work - please tear it apart:

1. One shared server, ephemeral per-visitor "workspace"

Every new browser session gets a synthetic user auto-provisioned (behind the scenes) inside a dedicated "playground" org
Session lives ~30 min. Nightly cron wipes all playground orgs and re-seeds from a fixture
No email, no password - session cookie only. Grafana Play does something similar with anonymous access

2. CLI --playground flag issues a scoped demo key

observal auth login --playground → opens browser → backend mints a short-lived API key tied to that same ephemeral workspace
CLI now talks to play.observal.io exactly like it would to a self-hosted server. No CLI code changes beyond reading a flag

3. Pre-seeded fixtures, not real telemetry

Ship a playground-seed.yaml - 5 agents, 3 MCP servers, 2 sandboxes, a few hundred fake traces with realistic-looking spans and scorecards
This is the demo data everyone lands on. Fresh install feel, without waiting for anyone to actually run an agent

4. Eval engine gets a mock provider in playground mode only

EVAL_MODEL_PROVIDER=mock → returns canned scorecards instead of hitting Bedrock/OpenAI
This is the big cost protector. Real eval + public access = LLM bill from hell
Already env-var driven, so no code change

5. Nightly reset via docker compose down -v && up

Accept that it's a disposable demo and periodically prune
One cron, full wipe, fresh fixtures. Easier than enforcing a thousand per-tenant limits

💸 Where to host without going broke

Rough cost napkin - open to better suggestions:

Piece	Option A (cheap)	Option B (nicer)
App/API/Web	Hetzner CX22 (€4/mo) or Fly.io free tier	Railway / Render small instance
Postgres	Neon free tier	Supabase free
ClickHouse	Self-host on same VM, 14-day TTL	ClickHouse Cloud dev tier
Redis	Upstash free tier	—
LLM for eval	Mocked (see above)	—
Domain + TLS	Cloudflare (free)	—

My gut: we can run this for <$15/mo if we're disciplined about retention and mocking the eval engine. The second we let real LLM calls through, costs become unbounded.

🔒 Keeping user data safe without touching core code

This is the part I want the most opinions on. Things I'd lean on:

Everything runs in a "playground" org - standard RBAC we already have keeps cross-tenant data from leaking
Reverse proxy in front (Caddy / Cloudflare Worker) enforces stricter rate limits, strips admin endpoints, blocks /api/v1/admin/* entirely for public traffic
DEPLOYMENT_MODE=local turned off, registration disabled, SSO off, bootstrap off. All accounts are minted server-side by the session provisioner
Sandbox MCPs only - pre-approved safe MCP servers in the registry. No arbitrary MCP execution; the sandbox component type stays disabled or heavily restricted
No outbound secrets - eval is mocked, no real API keys on the server, so even a full breach reveals nothing but synthetic traces
TTL on ClickHouse - we already have DATA_RETENTION_DAYS, just turn it down to 1

If there's data worth leaking on play.observal.io, we've already done something wrong upstream.

❓ What I'd love the community to weigh in on

Ephemeral-session vs always-on shared demo - do we want per-visitor isolation, or one big shared org where everyone sees each other's toy edits (more "fun", way weirder threat model)?
Should the CLI ship with a built-in --playground or should we just document the URL? First is smoother UX, second is less coupling
What's the first "aha!" moment a new visitor should hit? Seeing a scorecard? Pulling an agent? A live trace stream via GraphQL subscription? We should design the fixtures around that one moment
Anyone have cheap ClickHouse hosting war stories? That's the one component I'm least sure about
Do we need an "arcade"-style guided tour, or is a well-seeded UI enough?

Drop thoughts below 👇 - I'd rather merge the community's ideas than ship my first draft.

Sources used while drafting:

Kaushik-Kumar-CEG · 2026-04-20T13:14:15Z

Kaushik-Kumar-CEG
Apr 20, 2026
Collaborator

this seems nice
ill try working on this and ask you for any queries

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Let's build play.observal.io, a zero-setup way to feel Observal in 60 seconds #437

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Uh oh!

Let's build play.observal.io, a zero-setup way to feel Observal in 60 seconds #437

Uh oh!

Uh oh!

Apoorvgarg-creator Apr 20, 2026 Maintainer

The use case we're trying to kill

The constraint that makes this spicy

🌱 A small starting sketch (so we have something to pick apart)

💸 Where to host without going broke

🔒 Keeping user data safe without touching core code

❓ What I'd love the community to weigh in on

Replies: 1 comment

Uh oh!

Kaushik-Kumar-CEG Apr 20, 2026 Collaborator

Apoorvgarg-creator
Apr 20, 2026
Maintainer

Kaushik-Kumar-CEG
Apr 20, 2026
Collaborator