Skip to content

feat: restore cross-language benchmark suite (ILO-65)#608

Merged
danieljohnmorris merged 1 commit into
mainfrom
chore/benchmark-suite-restore
May 22, 2026
Merged

feat: restore cross-language benchmark suite (ILO-65)#608
danieljohnmorris merged 1 commit into
mainfrom
chore/benchmark-suite-restore

Conversation

@danieljohnmorris
Copy link
Copy Markdown
Collaborator

Summary

  • Restores bench/ at repo root with five canonical benchmarks: fib, hof, listproc, pattern-match, sum-loop
  • Each benchmark has implementations in ilo, Python 3, Node.js (V8), and Rust
  • bench/run.sh runs all impls, verifies ilo correctness, emits bench/results.json
  • .github/workflows/bench.yml runs nightly, commits updated results, and warns on >10% regression

What was restored vs rebuilt

The original research/bench/run.sh (1669 lines, removed in commit 1d4fb21) and research/explorations/bench-realistic/ (removed in 2f69316) were recovered from git history. The five benchmarks specified in ILO-65 (fib, hof, listproc, pattern-match, sum-loop) were extracted and adapted to the current ilo syntax (braceless guards for early return; inline lambdas required for map/flt/fld; explicit function-name arg for multi-function files).

Test plan

  • bash bench/run.sh --no-rust runs end-to-end and produces bench/results.json
  • All five ilo correctness checks pass (fib=610, hof=332833500, listproc=3417, pattern-match=1386050, sum-loop=1353850)
  • bash bench/run.sh compiles and runs Rust baselines
  • CI nightly job triggers on schedule

Follow-ups

  • ILO-65 mentions website perf table consuming bench/results.jsonsite/src/content/docs/docs/perf.md not yet wired up
  • Hardware standardisation for CI runner (currently GitHub-hosted ubuntu-latest)
  • Statistical methodology (warmups, confidence intervals) not yet addressed

Closes ILO-65

🤖 Generated with Claude Code

@codecov
Copy link
Copy Markdown

codecov Bot commented May 21, 2026

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ All tests successful. No failed tests found.

📢 Thoughts on this report? Let us know!

Add bench/ directory with five canonical benchmarks (fib, hof, listproc,
pattern-match, sum-loop) each implemented in ilo + Python / Node.js / Rust.
bench/run.sh runs all impls, verifies correctness, and emits bench/results.json.
CI nightly workflow (.github/workflows/bench.yml) runs the suite and gates
on >10% regression vs previous run.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
@danieljohnmorris danieljohnmorris force-pushed the chore/benchmark-suite-restore branch from 91b2b69 to 72a5469 Compare May 22, 2026 00:46
@danieljohnmorris danieljohnmorris merged commit 1096a00 into main May 22, 2026
5 checks passed
@danieljohnmorris danieljohnmorris deleted the chore/benchmark-suite-restore branch May 22, 2026 01:23
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant