EvalForge: CI/CD for reliable AI agents. RAGAS-shape evals, Langfuse-shape traces, inline guardrails, and a PASS/FAIL deploy gate that compares a baseline RAG chatbot to an engineered one. Built for Hack the Tech 2026.
python typescript hackathon nextjs devtools rag pydantic fastapi guardrails vercel ai-engineering anthropic langfuse llm-evaluation ragas deploy-gate
-
Updated
May 25, 2026 - Python