feat: restore cross-language benchmark suite (ILO-65) by danieljohnmorris · Pull Request #608 · ilo-lang/ilo

danieljohnmorris · 2026-05-21T21:53:12Z

Summary

Restores bench/ at repo root with five canonical benchmarks: fib, hof, listproc, pattern-match, sum-loop
Each benchmark has implementations in ilo, Python 3, Node.js (V8), and Rust
bench/run.sh runs all impls, verifies ilo correctness, emits bench/results.json
.github/workflows/bench.yml runs nightly, commits updated results, and warns on >10% regression

What was restored vs rebuilt

The original research/bench/run.sh (1669 lines, removed in commit 1d4fb21) and research/explorations/bench-realistic/ (removed in 2f69316) were recovered from git history. The five benchmarks specified in ILO-65 (fib, hof, listproc, pattern-match, sum-loop) were extracted and adapted to the current ilo syntax (braceless guards for early return; inline lambdas required for map/flt/fld; explicit function-name arg for multi-function files).

Test plan

bash bench/run.sh --no-rust runs end-to-end and produces bench/results.json
All five ilo correctness checks pass (fib=610, hof=332833500, listproc=3417, pattern-match=1386050, sum-loop=1353850)
bash bench/run.sh compiles and runs Rust baselines
CI nightly job triggers on schedule

Follow-ups

ILO-65 mentions website perf table consuming bench/results.json — site/src/content/docs/docs/perf.md not yet wired up
Hardware standardisation for CI runner (currently GitHub-hosted ubuntu-latest)
Statistical methodology (warmups, confidence intervals) not yet addressed

Closes ILO-65

🤖 Generated with Claude Code

codecov · 2026-05-21T21:54:25Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ All tests successful. No failed tests found.

📢 Thoughts on this report? Let us know!

Add bench/ directory with five canonical benchmarks (fib, hof, listproc, pattern-match, sum-loop) each implemented in ilo + Python / Node.js / Rust. bench/run.sh runs all impls, verifies correctness, and emits bench/results.json. CI nightly workflow (.github/workflows/bench.yml) runs the suite and gates on >10% regression vs previous run. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

danieljohnmorris force-pushed the chore/benchmark-suite-restore branch from 91b2b69 to 72a5469 Compare May 22, 2026 00:46

danieljohnmorris merged commit 1096a00 into main May 22, 2026
5 checks passed

danieljohnmorris deleted the chore/benchmark-suite-restore branch May 22, 2026 01:23

This was referenced May 22, 2026

chore(bench): hardware standardisation for CI runner (ILO-348) #683

Merged

bench: statistical methodology (warmup, confidence intervals) #684

Merged

chore: bench website perf table wiring (ILO-347) #685

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: restore cross-language benchmark suite (ILO-65)#608

feat: restore cross-language benchmark suite (ILO-65)#608
danieljohnmorris merged 1 commit into
mainfrom
chore/benchmark-suite-restore

danieljohnmorris commented May 21, 2026

Uh oh!

codecov Bot commented May 21, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

danieljohnmorris commented May 21, 2026

Summary

What was restored vs rebuilt

Test plan

Follow-ups

Uh oh!

codecov Bot commented May 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

codecov Bot commented May 21, 2026 •

edited

Loading