Release v1.0-arxiv · aray-17/code-capsules

Public artifact for the paper The Economics of Coding Agents: Calibrating the Cost-Quality Frontier (Aninda Ray, 2026). This release marks the exact code state cited in the paper.

Reproduce the paper's claims offline — no Docker, API keys, or model calls:

python3 benchmarks/verify_criteria.py   # PASS/FAIL per claim; exits 0 iff all 12 reproduce

Browse the evidence interactively:

bash benchmarks/explore.sh

See CLAIMS.md for per-claim evidence and paper/paper.pdf for full methodology.

Pre-release pending arXiv submission; the arXiv id and any final paper.pdf update will follow.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

v1.0-arxiv

Choose a tag to compare

Sorry, something went wrong.

Sorry, something went wrong.

Uh oh!

No results found

Uh oh!