Public artifact for the paper The Economics of Coding Agents: Calibrating the Cost-Quality Frontier (Aninda Ray, 2026). This release marks the exact code state cited in the paper.
Reproduce the paper's claims offline — no Docker, API keys, or model calls:
python3 benchmarks/verify_criteria.py # PASS/FAIL per claim; exits 0 iff all 12 reproduceBrowse the evidence interactively:
bash benchmarks/explore.shSee CLAIMS.md for per-claim evidence and paper/paper.pdf for full methodology.
Pre-release pending arXiv submission; the arXiv id and any final paper.pdf update will follow.