fix: sum and avg now work on VM and Cranelift, not just tree by danieljohnmorris · Pull Request #295 · ilo-lang/ilo

danieljohnmorris · 2026-05-16T01:08:26Z

Summary

ilo file.ilo --run-vm and --run-cranelift reported Compile error: undefined function: sum / avg. The tree-walker had both builtins wired up directly; the VM compiler had dedicated arms for median, stdev, variance, quantile, and cumsum but none for sum or avg, so they fell through to the named-function lookup and errored.

Manifesto framing: every agent reaching for sum xs on the VM hits a confusing "undefined function" diagnostic that names a real builtin, then has to rewrite the call as fld + xs 0. Tokens wasted and trust burned. The fix restores symmetry with the rest of the stats family across all three engines.

While fixing this I noticed the existing stats family (median, stdev, variance, quantile) had a Cranelift correctness bug: the per-register F64 shadow used by the OP_*_NN fast paths was never refreshed after a helper call, so any arithmetic over a stats result silently produced 0 on --run-cranelift. Fixed in the same PR with cross-cutting coverage.

Repro

Before:

$ echo 'r>n;avg [1, 2, 3, 4]' > t.ilo
$ ilo t.ilo --run r
2.5
$ ilo t.ilo --run-vm r
Compile error: undefined function: avg
$ ilo t.ilo --run-cranelift r
Compile error: undefined function: avg

After: 2.5 on every engine. Empty list: sum [] returns 0, avg [] errors with avg: cannot average an empty list, matching tree-walker semantics on every backend.

What's in the diff

ae3768c vm: new OP_SUM (165) and OP_AVG (166) opcodes with compile arms in build_call, dispatch handlers, vm_sum / vm_avg helpers, and jit_sum / jit_avg for the Cranelift handoff. Un-ignores vm_sum_basic, vm_sum_empty, vm_avg_basic now that the VM backs them.
393d7a8 cranelift: wire the new opcodes through the JIT and AOT compilers (helper FuncIds, declarations, symbol registrations, codegen blocks). In the same commit, refresh the F64 shadow after every stats helper call (median, quantile, stdev, variance, sum, avg) so subsequent OP_*_NN fast paths read the real numeric result instead of a stale zero. Both numeric-output classification passes (jit + AOT) include the new ops so reg_always_num flows through.
4f191c0 test: 16-test cross-engine regression file pinning the happy paths, the empty-list contracts (sum=0, avg=error), the wrong-type / non-numeric-element errors, and the F64-shadow hazard (including the pre-existing median variant). New examples/sum-avg.ilo picked up by examples_engines so every backend round-trips it.

Test plan

cargo test --release --features cranelift --test regression_sum_avg_cross_engine — 16/16 pass
cargo test --release --features cranelift --test examples_engines — pass
cargo test --release --features cranelift --lib — pass (the only failures with CARGO_TARGET_DIR set are pre-existing AOT tests that look for libilo.a in the default target dir; clean target passes 7/7)
Manual repro across all three engines (tree, VM, cranelift) for: sum/avg happy path, empty list, non-list arg, non-numeric element, arithmetic-on-results (the F64-shadow hazard)
cargo fmt --check clean

Follow-ups

None for this fix. The OP_SUM/OP_AVG dispatch in the VM has the same inline shape as OP_MEDIAN; a future refactor could collapse the six stats codegen blocks behind a small helper, but that's pure cleanup with no behavioural change.

sum and avg fell through to the named-function lookup on the VM and errored as 'undefined function', though both were implemented in the tree-walking interpreter. Pattern-match the median wiring: new OP_SUM (165) and OP_AVG (166), compile arms in build_call, dispatch handlers, vm_sum/vm_avg helpers, and jit_sum/jit_avg for the Cranelift path. Semantics match the tree-walker: sum [] = 0, avg [] errors, non-list input errors, non-number elements error. Un-ignore vm_sum_basic, vm_sum_empty, and vm_avg_basic now that the VM backs them.

Two changes, same hazard: 1. Add OP_SUM/OP_AVG codegen to both the in-process JIT (jit_cranelift) and the AOT compiler (compile_cranelift). New helper FuncIds, helper declarations, symbol registrations, and matching codegen blocks that mirror OP_MEDIAN. 2. Refresh the per-register F64 shadow after every stats helper call (median, quantile, stdev, variance, sum, avg). The JIT keeps an F64 shadow alongside each I64 NanVal so OP_*_NN fast paths can skip the bitcast; helper-call opcodes were only writing the I64 slot, leaving the shadow stale at zero. That made expressions like '-(avg xs) (sum ys)' silently produce 0 instead of the real result. Pre-existing bug for the median family; surfaced by sum/avg landing in the same hot path. Both classification passes (jit_cranelift and compile_cranelift) now list OP_SUM/OP_AVG in the numeric-output set so reg_always_num flows through correctly.

Sixteen regression tests pinning sum and avg behaviour across --run-tree, --run-vm, and (when enabled) --run-cranelift: happy paths, empty-list edge cases (sum returns 0, avg errors), wrong-type and non-numeric-element errors, and the F64-shadow hazard that breaks arithmetic on stats helper results. examples/sum-avg.ilo demonstrates the four common shapes and is picked up by examples_engines so every engine has to round-trip the file.

codecov · 2026-05-16T01:11:53Z

Codecov Report

❌ Patch coverage is 98.03922% with 3 lines in your changes missing coverage. Please review.
✅ All tests successful. No failed tests found.

Files with missing lines	Patch %	Lines
src/vm/mod.rs	97.10%	2 Missing ⚠️
src/vm/compile_cranelift.rs	97.14%	1 Missing ⚠️

📢 Thoughts on this report? Let us know!

Three codegen tests using compile_to_object_bytes (no libilo.a needed) so the AOT branches for sum and avg, plus the F64-shadow refresh under a subsequent OP_SUB_NN, are exercised by cargo llvm-cov.

Helpers struct gains min_lst / max_lst FuncIds in both jit_cranelift and compile_cranelift, declared as 2-param/1-return imports resolved from jit_min_lst / jit_max_lst at link time. Both opcodes register in the guaranteed-numeric-output allowlist for reg_always_num analysis, and the JIT site refreshes the F64 shadow on the result so subsequent arithmetic (`max xs - min xs`) reads the fresh value rather than a stale shadow. Same pattern PR #295 used for OP_SUM / OP_AVG / OP_MEDIAN and the rest of the stats helpers.

danieljohnmorris added 3 commits May 16, 2026 02:07

danieljohnmorris added 2 commits May 16, 2026 02:16

test: cover AOT codegen blocks for OP_SUM and OP_AVG

029b490

Three codegen tests using compile_to_object_bytes (no libilo.a needed) so the AOT branches for sum and avg, plus the F64-shadow refresh under a subsequent OP_SUB_NN, are exercised by cargo llvm-cov.

fmt: apply rustfmt to new OP_SUM/OP_AVG codegen tests

f90a70c

danieljohnmorris merged commit c0fee68 into main May 16, 2026
5 checks passed

danieljohnmorris deleted the fix/avg-cross-engine branch May 16, 2026 09:52

danieljohnmorris mentioned this pull request May 16, 2026

fix: accept list form for min and max #302

Merged

7 tasks

danieljohnmorris mentioned this pull request May 16, 2026

fix: flat now works on VM and Cranelift, not just tree #314

Merged

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: sum and avg now work on VM and Cranelift, not just tree#295

fix: sum and avg now work on VM and Cranelift, not just tree#295
danieljohnmorris merged 5 commits into
mainfrom
fix/avg-cross-engine

danieljohnmorris commented May 16, 2026

Uh oh!

codecov Bot commented May 16, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

danieljohnmorris commented May 16, 2026

Summary

Repro

What's in the diff

Test plan

Follow-ups

Uh oh!

codecov Bot commented May 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

codecov Bot commented May 16, 2026 •

edited

Loading