The ultimate benchmark for AI coding agents. Give an AI $500, an empty vending machine, and 90 days — race Claude, Codex, Gemini head-to-head with a live dashboard.
python flask simulation websocket gemini codex vending-machine ai-agents claude race-mode llm-evaluation ai-benchmark
-
Updated
Feb 27, 2026 - Python