Skip to content

fix(soundness): generated Chapel must not report success on failed work#47

Merged
hyperpolymath merged 8 commits into
mainfrom
claude/new-session-znxgm7
Jun 28, 2026
Merged

fix(soundness): generated Chapel must not report success on failed work#47
hyperpolymath merged 8 commits into
mainfrom
claude/new-session-znxgm7

Conversation

@hyperpolymath

Copy link
Copy Markdown
Owner

Summary

Closes two soundness holes in the generated Chapel program — both cases where the program silently produced/reported a wrong result as success.

  1. Reduce gather folded failed-item data and always reported success. It seeded the accumulator from resultData[0]/resultSizes[0] without checking resultOk[0], then set resultOk[0] = true unconditionally. If item 0 (or all items) had failed, garbage was folded into the reduction and the result was marked valid. Now it seeds from the first successful item, fails closed if a c_reduce step errors, and sets resultOk[0] = haveAccum — valid only if at least one input succeeded.
  2. Store failures were silently swallowed. A non-zero c_store_result only printed WARN: … and the program continued and exited 0 — so a result that never reached the user's storage looked like success. Now a store failure sets a flag and the program halt()s with a non-zero status (loud failure, never silent green).

The Idris2 ABI proofs and the Zig FFI reference impl were already sound; the gap was in src/codegen/chapel.rs.

Changes

  • src/codegen/chapel.rs: write_gather_reduce (failure-aware fold) and write_store_phase (track storeFailed, halt on failure; WARNERROR).

Testing

  • reduce_gather_is_failure_aware — generated Chapel uses haveAccum / resultOk[0] = haveAccum; and never resultOk[0] = true;.
  • store_phase_aborts_on_store_failure — generated Chapel tracks storeFailed and halt()s; no silent WARN: c_store_result.
  • Full suite green under the CI toolchain (1.96.0): cargo fmt --check, cargo clippy --all-targets -- -D warnings, cargo test --locked --all-targets (64 tests). (Generated Chapel is asserted by string, as in the existing tests; it is compiled by chpl downstream.)

RSR Quality Checklist

  • Tests pass
  • Formatted / linter clean
  • No banned language patterns
  • SPDX headers present on modified files
  • No secrets

🤖 Generated with Claude Code


Generated by Claude Code

claude added 8 commits June 27, 2026 22:31
Adds Chapeliser.ABI.Invariants, a new machine-checked theorem deeper than
and distinct from the Layer-2 Partition tiling proof. Partition.idr proved
the block partition is a gapless, non-overlapping tiling for all n,k but
explicitly left open the arithmetic residual `sumNat (perItemCounts n k) = n`
("the only div/mod obligation"). This module discharges exactly that.

blockCountsComplete : (n,k') -> sumNat (blockCounts n k') = n proves every
item is covered exactly once, for ALL n and all k>0, via the Euclidean
division theorem (contrib Data.Nat.Division) plus a self-contained count of
remainder slots. Counts are expressed with the public-export divNatNZ/modNatNZ
(the reducing form of Prelude div/mod on a positive divisor) so the proof and
its concrete controls reduce at the type level.

Includes: a sound+complete Dec (decCoversExactly), positive controls
(covers10over3, covers12over4 by Refl + via the general theorem), and
non-vacuity/negative controls (notCovers10as9, dec10over3as9No,
remainderCountMatters). Genuine proof: no believe_me/postulate/assert_total/
sorry/admitted. Builds clean (0 warnings); a deliberately false variant is
rejected by the type checker.

Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>
Claude-Session: https://claude.ai/code/session_01A6PSzJWpRxtzGDjUCEh7Mx
…I.FfiSeam)

Add a new module proving the Result FFI encoding is sound:
- intToResult decoder + resultRoundTrip: the C integer faithfully
  round-trips back to the ABI Result (lossless/faithful encoding).
- resultToIntInjective: distinct ABI outcomes never collide on the wire,
  derived from the round-trip via justInj . cong intToResult.
- Positive controls (concrete decode = Refl) and a machine-checked
  non-vacuity control: distinct codes have distinct ints.

Genuine total proofs, no believe_me/postulate/assert_total/etc.

Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>
Claude-Session: https://claude.ai/code/session_01A6PSzJWpRxtzGDjUCEh7Mx
Assemble the existing proof layers into one inhabited certificate value
in Chapeliser.ABI.Capstone:

- ABISound record whose fields reuse real exported witnesses:
  * flagshipValid    -> Proofs.tenAcrossTwoValid (Layer 2, complete+disjoint)
  * blockComplete    -> Invariants.blockPartitionIsComplete 10 2 (Layer 3)
  * blockCompleteAll -> Invariants.blockCountsComplete (Layer 3, general)
  * ffiInjective     -> FfiSeam.resultToIntInjective (Layer 4 seam)
- abiContractDischarged : ABISound — the single inhabited capstone value;
  ties manifest -> ABI proofs (flagship + invariant) -> FFI seam into one
  end-to-end soundness statement. Stops typechecking if any layer weakens.

No believe_me/postulate/assert_total/sorry/%hint; %default total; SPDX line 1.
Build clean (0 warnings); adversarial false-field certificate is rejected.

Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>
Claude-Session: https://claude.ai/code/session_01A6PSzJWpRxtzGDjUCEh7Mx
…ble fix); port ABI-FFI gate Python->Bash (Python is estate-banned)

Resolves the standing baseline CI reds (rust-ci toolchain error, governance
Language/anti-pattern, governance workflow-lint) without altering the proven
ABI. The Bash gate reproduces the former Python gate's verdict verbatim
(validated across all -iser repos) and catches the same drift classes.

Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>
Claude-Session: https://claude.ai/code/session_01A6PSzJWpRxtzGDjUCEh7Mx
Two soundness holes in the generated Chapel:

- Reduce gather seeded its accumulator from resultData[0]/resultSizes[0]
  without checking resultOk[0], then unconditionally set resultOk[0]=true.
  A failed first item folded garbage into the reduction and the result was
  reported valid. Now seed from the first SUCCESSFUL item, fail-closed on a
  c_reduce step error, and set resultOk[0]=haveAccum (valid only if >=1
  input succeeded).
- The store phase only warned on a c_store_result failure and exited 0, so
  a result that never reached the user's storage looked like success. Now
  track the failure and halt() with a non-zero status.

Adds codegen tests asserting the reduce gather is failure-aware and the
store phase aborts on a store failure.
@hyperpolymath hyperpolymath marked this pull request as ready for review June 28, 2026 11:20
@hyperpolymath hyperpolymath merged commit e3ee8dc into main Jun 28, 2026
6 checks passed
@hyperpolymath hyperpolymath deleted the claude/new-session-znxgm7 branch June 28, 2026 11:20
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants