An autonomous AI agent that monitors other AI applications — and takes action when their quality degrades.
Built for the Google Cloud Rapid Agent Hackathon · Arize Track.
LLM applications fail silently. A prompt change ships, a model goes stale on new information, an answer quietly gets vague — and nobody notices until users complain. Observability platforms collect the traces, but someone still has to look at them.
flightcheck is that someone — an agent instead of a person.
flightcheck monitors a real, live LLM application and runs an autonomous quality-check loop on demand:
- Pulls project stats from the Arize Phoenix observability platform.
- Pulls recent traces — the actual user questions and assistant answers.
- Reasons over the answers to judge quality: are they specific and helpful, or vague, evasive, and degraded?
- If quality has degraded, files a severity-tagged alert to a Discord channel — autonomously, no human in the loop.
- Reports a clear verdict.
It is a true agent: it plans a multi-step mission, calls tools, and takes a real action — it doesn't just chat.
To monitor something real, this project includes a real application: Lineup, a Valorant strategy coach chatbot. Lineup answers tactical questions (agent comps, site executes, ability lineups), uses live web search to stay current on new agents and maps, and emits a trace to Phoenix on every message. flightcheck watches Lineup the same way it would watch any production LLM app.
Lineup chatbot (Gemini + Tavily web search)
│ every conversation traced
▼
Arize Phoenix ── observability / trace storage
│
▼
flightcheck MCP server (FastMCP, on Cloud Run)
│ exposes trace-reading + alerting tools over MCP
▼
flightcheck agent (Gemini, Google Cloud Agent Builder)
│ plans · inspects · judges · acts
▼
Discord alert ── autonomous notification when quality degrades
- Gemini — powers both the flightcheck agent and the Lineup chatbot
- Google Cloud Agent Builder (CX Agent Studio) — hosts the flightcheck agent
- Model Context Protocol (MCP) — a custom FastMCP server bridges the agent and Arize Phoenix
- Arize Phoenix — observability platform storing the monitored app's traces
- Google Cloud Run — hosts the MCP server and the Lineup backend
- Tavily — live web search, keeping Lineup current
- Discord — receives autonomous quality alerts
- flightcheck (the monitoring agent): https://divergent99.github.io/flightcheck/
- Lineup (the monitored chatbot): https://divergent99.github.io/flightcheck/lineup.html
server.py— the flightcheck MCP server: exposesget_recent_traces,get_project_stats, andfile_quality_alerttools to the agentlineup-bot/— the Lineup chatbot backend (FastAPI, Gemini, Tavily, Phoenix tracing)docs/— the hosted front-ends (index.html,lineup.html)Dockerfile,requirements.txt— deployment for the MCP server
- Copy
.env.exampleto.envand fill in your values. pip install -r requirements.txtpython server.py
MIT — see LICENSE.