Core: Sequence bounds preconditions and VC-printer fallback fix by fabiomadge · Pull Request #1100 · strata-org/Strata

fabiomadge · 2026-05-02T21:20:27Z

Blocker for Laurel Seq/Array in #1073. Two related Core changes:

1. Fix VC-printer fallback for unknown 0-ary ops

handleZeroaryOps used to emit re.none() as its fallback, so VCs containing any 0-ary op outside the regex set (e.g. Sequence.empty) rendered as re.none() in printer output. Switched to mkGenericCall, matching how handleUnaryOps / handleBinaryOps already handle unknown ops.

2. Bounds preconditions for `Sequence.select` / `update` / `take` / `drop`

Following the Int.SafeDiv pattern:

Op	Precondition
`Sequence.select`	`0 <= i && i < Sequence.length(s)`
`Sequence.update`	`0 <= i && i < Sequence.length(s)`
`Sequence.take`	`0 <= n && n <= Sequence.length(s)`
`Sequence.drop`	`0 <= n && n <= Sequence.length(s)`

Sequence.length / empty / append / build / contains remain total.

PrecondElim picks the obligations up automatically at call sites in imperative code (via transformStmt) and in pure positions like requires / ensures / quantifier bodies / function bodies (via the synthetic $$wf procedures).

Obligations carry propertyType = "outOfBoundsAccess" (new MetaData constant, mirroring divisionByZero), flow through a new PropertyType.outOfBoundsAccess enum variant and the three metadata-to-PropertyType conversion sites, and render as "out-of-bounds-access" in SARIF output.

Incidental fix

While wiring the SARIF classification, I noticed propertyTypeToClassification in SarifOutput.lean was pre-existing dead code: vcResultToSarifResult never set properties.propertyType, so every obligation defaulted to "assert" in SARIF output. This PR wires it up, so divisionByZero and arithmeticOverflow obligations now also classify correctly in SARIF alongside the new outOfBoundsAccess.

Testing

New tests in StrataTest/Transform/PrecondElim.lean:

10 / 10c / 10d — Sequence.select in procedure body, in a requires clause (triggers mkContractWFProc), and in a function body (triggers mkFuncWFStmts).
11 — collectPrecondAsserts attaches outOfBoundsAccess metadata for all four partial ops and a nested Sequence.select(Sequence.update(...)) call. Mirrors OverflowCheckTest.lean.
12 — regression test: Sequence.empty renders as a generic call in VC printer output, not re.none().

Plus new property-classification tests in StrataTest/Languages/Core/Tests/SarifOutputTests.lean covering all five PropertyType variants.

Collateral updates to existing tests reflect the new obligations (Seq.lean), updated requires on Sequence function signatures (ProgramEvalTests.lean), and the 0-ary printer fix (Loops.lean). Note: the bounds obligations in Seq.lean appear as true && 0 < length(...) — the partial evaluator simplifies 0 <= 0 to true but does not further simplify true && X to X. This is a pre-existing evaluator gap, not newly introduced by this PR; the SMT solver discharges the obligation trivially.

Known downstream impact

PR #1073 (Laurel Seq/Array) emits Sequence.select / update / take / drop calls from its translation. Its T18_Sequences test uses #guard_msgs(drop info, error) on a diagnostics-only pipeline, so the test assertions should still pass syntactically — but individual sequence-manipulation programs now require bounds guards to fully verify. PR #1073 will need to adjust any end-to-end verification examples after this one merges. No in-repo Laurel tests are broken by this PR.

Out of scope

Parseable Sequence.empty<T>() syntax in raw Core source — a grammar-level feature for a separate PR. The printer fix here handles the display side; SMT encoding was already correct.

joscoh · 2026-05-04T14:14:59Z

-private def classifyPrecondition (funcName : String) (precondIdx : Nat := 0) : Option String :=
+    and overflow), the precondition index distinguishes them.
+
+    Exposed (not `private`) so that tests can verify the classification for each


This does not need to be a in a comment. You can just change the visibility

Done — reverted to private and dropped the stale "Exposed" docstring. No test actually needs it public (the metadata-check test uses collectPrecondAsserts + metadata inspection instead).

handleZeroaryOps fell back to logging an error and returning re.none() for any 0-ary op outside the regex set. That silently substituted a regex primitive for unrelated ops in VC printer output; users saw re.none() where e.g. Sequence.empty() was intended. Switch the fallback to mkGenericCall, matching how handleUnaryOps and handleBinaryOps already handle unknown ops. The printer now emits the op name as a free-variable reference, preserving the intent. Parseable Sequence.empty<T>() syntax is still a separate grammar-level feature; this commit only fixes the printer-side noise.

polyUneval is the combinator used to declare unevaluated polymorphic functions with axioms. Unlike unaryOp and binaryOp, it had no way to attach preconditions; callers had to hand-build the WFLFunc. Add a 'preconditions' parameter and the matching free-vars proof obligation (subset of the function's input names), defaulting to empty. No behavioral change for existing callers.

atomb

This looks nice. There's a little room to reduce repetition, however.

Sequence.select and Sequence.update now require `0 <= i < length(s)`; Sequence.take and Sequence.drop require `0 <= n <= length(s)`. PrecondElim picks these up and generates VC obligations at call sites, both in statement positions (via transformStmt) and in pure positions (via mkContractWFProc / mkFuncWFProc) — so requires/ensures/quantifier-body subscripts are also covered. Obligations carry the propertyType metadata "outOfBoundsAccess" (new MetaData constant) and flow through a new PropertyType.outOfBoundsAccess enum variant — with matching entries in the statement-eval / obligation-extraction / cmd-eval metadata-to-PropertyType conversion sites — to finally render as "out-of-bounds-access" in SARIF output, matching how divisionByZero and arithmeticOverflow are classified. Side effect: `propertyTypeToClassification` in SarifOutput.lean was previously dead code; `vcResultToSarifResult` never set `properties.propertyType` so the SARIF output defaulted every obligation to "assert". Wiring this up means divisionByZero and arithmeticOverflow obligations now also classify correctly in SARIF — a pre-existing bug this PR incidentally fixes.

New tests in StrataTest/Transform/PrecondElim.lean: - Test 10: Sequence.select in a procedure body emits the bounds assert (PrecondElim is unconditional — it inserts regardless of any surrounding requires guard; the SMT solver discharges). - Test 10c: Sequence.select inside a requires clause triggers the $$wf-procedure path (mkContractWFProc). - Test 10d: Sequence.select inside a function body triggers the function-body $$wf path (mkFuncWFStmts). - Test 11: collectPrecondAsserts attaches outOfBoundsAccess metadata for all four partial ops and a nested call. Mirrors OverflowCheckTest.lean. Also verifies Sequence.length emits no obligation (it is total). - Test 12: Sequence.empty printer regression for the commit-1 fix — renders as a generic call, not re.none(). New property-classification tests in StrataTest/Languages/Core/Tests/SarifOutputTests.lean cover all five PropertyType variants, exercising the SARIF wiring fix in commit 3. Collateral test updates for real behavioral changes: - StrataTest/Languages/Core/Examples/Seq.lean: expected VC output includes the new bounds obligations (all SMT-provable from the surrounding context, except the pre-existing contains_yes unknown). - StrataTest/Languages/Core/Tests/ProgramEvalTests.lean: Sequence func signatures now render with the attached requires clauses. - StrataTest/Languages/Core/Examples/Loops.lean: commit-1 printer fix propagates (re.none() -> top, error message format updated).

tautschnig

Proof-coverage suggestions ("what to prove next"). These are all follow-ups, not blocking.

convertMetaDataPropertyType_inverse_of_classifyPrecondition — for every (fn, idx) that classifyPrecondition maps to some metaString, convertMetaDataPropertyType applied to a MetaData tagged with that string returns a non-.assert PropertyType. Concretely: there shouldn't be a gap where classifyPrecondition tags a call but convertMetaDataPropertyType silently reverts to .assert. Trivial to state, would catch the string-mismatch class of regressions the triplication enables.
PropertyType-to-SARIF round-trip — (propertyTypeToClassification p).fromClassification? = some p for all p : PropertyType, where fromClassification? is a matching deserializer (not yet written). Would make PropertyType the single source of truth and mechanically prune the MetaData.* string constants.
mkSeqBoundsPrecond_form — a structural theorem about (mkSeqBoundsPrecond x k).expr: it equals boolAnd (intLe 0 x) (k.opExpr x (seqLength s)) (or the concrete LExpr shape). Makes Test 10's off-by-one blind spot (see inline comment #2) explicit and proved rather than merely tested. Small.
seqSelect_denote_partial — semantic soundness of the bounds precondition with respect to Lean's List.get?: 0 ≤ i < s.length → (seqSelect s i).denote = .some (s.get! i). This is FactoryCorrect.lean territory (cf. PR #1103) rather than this PR, but worth tracking as the obvious continuation: the preconditions added here are only useful insofar as they correspond to a semantic partiality.
collectPrecondAsserts-spec — a purely structural theorem that for every sub-expression that is a call to a function with N preconditions, collectPrecondAsserts produces exactly N assert statements, each tagged by classifyPrecondition. Test 10 verifies counts 1, 1, 1, 1, and 2 empirically; a theorem would subsume all of that and catch e.g. a future refactor that stops visiting nested calls.

Refactoring opportunities (not blocking).

As noted in finding #3, consolidate MetaData.{divisionByZero,...} / propertyTypeToClassification / convertMetaDataPropertyType into a single table.
The new assertOutOfBoundsObligations helper in the tests is a good pattern; the pre-existing overflow/division tests in the same file (Test 1, Test 3, Test 5) could share a sibling helper, reducing per-test boilerplate uniformly.

tautschnig · 2026-05-05T12:49:10Z

 /-- Classify a function precondition into a property type for SARIF reporting.
    For functions with multiple preconditions (e.g., SafeSDiv has both div-by-zero
-    and overflow), the precondition index distinguishes them. -/
-private def classifyPrecondition (funcName : String) (precondIdx : Nat := 0) : Option String :=
+    and overflow), the precondition index distinguishes them.
+
+    Exposed (not `private`) so that tests can verify the classification for each
+    partial op without having to inspect metadata on generated asserts. -/
+def classifyPrecondition (funcName : String) (precondIdx : Nat := 0) : Option String :=


Carryover from joscoh's thread at this location (#1100 (comment)...). The reply says:

Done — reverted to private and dropped the stale "Exposed" docstring. No test actually needs it public (the metadata-check test uses collectPrecondAsserts + metadata inspection instead).

But the code in bbf0d27f7 still has def classifyPrecondition (public in the module sense) and still carries the "Exposed (not private)" paragraph. I grepped the tree: the only callers of classifyPrecondition are line 113 in this same file — the tests use collectPrecondAsserts + md.getPropertyType as the reply says — so private is genuinely safe. This is presumably an unpushed intent.

Suggested replacement for lines 64–70:

Suggested change

/-- Classify a function precondition into a property type for SARIF reporting.

For functions with multiple preconditions (e.g., SafeSDiv has both div-by-zero

and overflow), the precondition index distinguishes them. -/

private def classifyPrecondition (funcName : String) (precondIdx : Nat := 0) : Option String :=

and overflow), the precondition index distinguishes them.

Exposed (not `private`) so that tests can verify the classification for each

partial op without having to inspect metadata on generated asserts. -/

def classifyPrecondition (funcName : String) (precondIdx : Nat := 0) : Option String :=

/-- Classify a function precondition into a property type for SARIF reporting.

For functions with multiple preconditions (e.g., SafeSDiv has both div-by-zero

and overflow), the precondition index distinguishes them. -/

private def classifyPrecondition (funcName : String) (precondIdx : Nat := 0) : Option String :=

tautschnig · 2026-05-05T12:49:10Z

+private def mkSeqBoundsPrecond
+    (varName : String) (k : SeqBoundKind) :
+    Strata.DL.Util.FuncPrecondition (LExpr CoreLParams.mono) CoreLParams.Metadata :=
+  let sVar  : LExpr CoreLParams.mono := .fvar default "s" (some (seqTy mty[%a]))


Implicit coupling: mkSeqBoundsPrecond hard-codes "s" here, which is only correct because the four callers (seqSelectFunc, seqUpdateFunc, seqTakeFunc, seqDropFunc) all happen to name their sequence input "s". The h_precond proof obligation on polyUneval catches mismatches at definition time — so this isn't a correctness trap, but the constraint isn't documented at the point where it bites.

Either take a second seqVarName parameter (symmetric with the existing varName for the index/count):

Suggested change

let sVar : LExpr CoreLParams.mono := .fvar default "s" (some (seqTy mty[%a]))

/-- Precondition `0 <= varName && varName `k.opExpr` Sequence.length(seqVarName)`. -/

private def mkSeqBoundsPrecond

(seqVarName : String) (varName : String) (k : SeqBoundKind) :

Strata.DL.Util.FuncPrecondition (LExpr CoreLParams.mono) CoreLParams.Metadata :=

let sVar : LExpr CoreLParams.mono := .fvar default seqVarName (some (seqTy mty[%a]))

let xVar : LExpr CoreLParams.mono := .fvar default varName (some mty[int])

and update the four call sites to mkSeqBoundsPrecond "s" "i" .Lt etc. — slightly more verbose at the call sites, but the helper's contract is now self-describing.

Or, if you prefer to keep the helper's signature narrow, add a one-line doc note: Assumes the enclosing function names its sequence input "s".

tautschnig · 2026-05-05T12:49:10Z

+
+/-- Check that `collectPrecondAsserts` produces exactly `expectedCount`
+    obligations from `expr`, each tagged with `outOfBoundsAccess`. -/
+private def assertOutOfBoundsObligations
+    (expr : Core.Expression.Expr) (expectedCount : Nat) : IO Unit := do
+  let stmts := collectPrecondAsserts Core.Factory expr "test" #[]
+  assert! stmts.length == expectedCount
+  for s in stmts do
+    let md : MetaData Core.Expression := match s with
+      | Statement.assert _ _ md => md | _ => #[]
+    assert! md.getPropertyType == some MetaData.outOfBoundsAccess


assertOutOfBoundsObligations checks the obligation count and the metadata tag, but not the obligation content. That means an off-by-one regression in mkSeqBoundsPrecond — say, someone flips intLtFunc to intLeFunc for the .Lt case, producing ... && i <= length(s) for Sequence.select — slides past Test 10 entirely (count and tag both stay right).

The proof-coverage suggestion in the main review body (mkSeqBoundsPrecond_form) is the strong version. A lighter-weight test-level check that would catch this class of regression cheaply:

/-- Check that `collectPrecondAsserts` produces exactly `expectedCount` obligations from `expr`, each tagged with `outOfBoundsAccess` and each having the expected `boolAnd`-of-two-int-comparisons shape (so off-by-one regressions in `mkSeqBoundsPrecond` are caught). -/ private def assertOutOfBoundsObligations (expr : Core.Expression.Expr) (expectedCount : Nat) : IO Unit := do let stmts := collectPrecondAsserts Core.Factory expr "test" #[] assert! stmts.length == expectedCount for s in stmts do match s with | Statement.assert _ e md => assert! md.getPropertyType == some MetaData.outOfBoundsAccess -- The obligation is `boolAnd (intLe 0 _) (intLt|intLe _ (seqLength _))`. match e with | LExpr.app _ (LExpr.app _ andOp _) _ => assert! andOp.isBoolAnd -- pseudo; real predicate will depend on LExpr shape | _ => assert! false | _ => assert! false

(Exact predicate isBoolAnd would need spelling out; the intent is to fail loudly if the top-level constructor stops being boolAnd.) Happy to prepare a concrete patch if you'd like.

tautschnig · 2026-05-05T12:49:10Z

+/-- Convert a `MetaData` entry's property-type classification string to the
+    `PropertyType` enum. Falls back to `.assert` when the metadata carries
+    no classification or an unrecognized string; callers that emit
+    propertyType classifications should add a matching arm here. -/
+def convertMetaDataPropertyType {P : PureExpr} [BEq P.Ident]
+    (md : MetaData P) : PropertyType :=
+  match md.getPropertyType with
+  | some s =>
+    if s == MetaData.divisionByZero then .divisionByZero
+    else if s == MetaData.arithmeticOverflow then .arithmeticOverflow
+    else if s == MetaData.outOfBoundsAccess then .outOfBoundsAccess
+    else .assert
+  | none => .assert


We now have three parallel representations of the same enum:

MetaData.divisionByZero / arithmeticOverflow / outOfBoundsAccess — String constants (MetaData.lean:293–299).

propertyTypeToClassification : PropertyType → String (SarifOutput.lean:92) — the display side, kebab-case.

convertMetaDataPropertyType : MetaData → PropertyType — the decode side, using the constants from (1).

Adding a new PropertyType variant requires changes in all three places plus a new kebab/camel string pair, and forgetting (1)+(3) silently degrades to .assert at runtime — there's no compile-time check that (1) and (3) stay in sync.

One way to consolidate: a single definition that tabulates the serialization plus inverse:

/-- Canonical mapping from `PropertyType` to the pair of (metadata tag, SARIF classification) strings. -/ def PropertyType.serialization : PropertyType → Option (String × String) | .divisionByZero => some ("divisionByZero", "division-by-zero") | .arithmeticOverflow => some ("arithmeticOverflow", "arithmetic-overflow") | .outOfBoundsAccess => some ("outOfBoundsAccess", "out-of-bounds-access") | .cover => some ("cover", "cover") | .assert => none

Then propertyTypeToClassification and convertMetaDataPropertyType both derive from this single source, and the MetaData.* constants can go. A #[simp] round-trip lemma (fromClassification? (toClassification p) = some p) gives compile-time coverage of the set. See the proof-coverage suggestion in the main review body.

MikaelMayer

🤖 Clean PR. The convertMetaDataPropertyType extraction is a good de-duplication, SeqBoundKind is a nice way to prevent nested-precondition accidents, and the SARIF property-type wiring fix is a genuine improvement (was dead code before). Test coverage is solid.

No new issues beyond what's already been flagged in existing threads.

MikaelMayer · 2026-05-05T13:57:44Z

+-- -----------------------------------------------------------------------
+-- Test: unknown 0-ary operation renders as generic call (e.g.
+-- `Sequence.empty`, which is registered in the factory but not yet
+-- parseable in raw Core source). Previously, the `handleZeroaryOps`


🤖 Nit: "Previously, the handleZeroaryOps fallback silently substituted re.none()." — this is a non-stateless comment referencing the old behavior. Consider rephrasing to describe what the test verifies without referencing the prior state, e.g.:

-- Test: unknown 0-ary operation renders as a generic call (e.g. -- `Sequence.empty`, which is registered in the factory but not yet -- parseable in raw Core source).

github-actions Bot added the Core label May 2, 2026

fabiomadge force-pushed the feat/core-sequence-wf branch 2 times, most recently from dacef58 to 5e268e5 Compare May 2, 2026 23:05

fabiomadge marked this pull request as ready for review May 2, 2026 23:24

fabiomadge requested a review from a team May 2, 2026 23:24

github-actions Bot added the Waiting-For-Review label May 2, 2026

fabiomadge mentioned this pull request May 3, 2026

feat(laurel): Seq<T> and Array<T> with bounds-checked subscript and diagnostics #1073

Draft

tautschnig previously approved these changes May 4, 2026

View reviewed changes

github-actions Bot added Has 1 approval and removed Waiting-For-Review labels May 4, 2026

tautschnig assigned atomb and joscoh May 4, 2026

joscoh reviewed May 4, 2026

View reviewed changes

tautschnig assigned fabiomadge and unassigned atomb and joscoh May 4, 2026

fabiomadge added 2 commits May 5, 2026 02:21

fabiomadge dismissed tautschnig’s stale review via 1925770 May 5, 2026 00:22

fabiomadge force-pushed the feat/core-sequence-wf branch from 5e268e5 to 1925770 Compare May 5, 2026 00:22

github-actions Bot added Waiting-For-Review and removed Has 1 approval labels May 5, 2026

atomb previously approved these changes May 5, 2026

View reviewed changes

Comment thread Strata/DL/Imperative/CmdEval.lean Outdated

Comment thread Strata/Languages/Core/StatementEval.lean Outdated

Comment thread Strata/Languages/Core/ObligationExtraction.lean Outdated

Comment thread StrataTest/Transform/PrecondElim.lean

github-actions Bot added Has 1 approval and removed Waiting-For-Review labels May 5, 2026

fabiomadge added 2 commits May 5, 2026 09:57

fabiomadge dismissed atomb’s stale review via bbf0d27 May 5, 2026 07:57

fabiomadge force-pushed the feat/core-sequence-wf branch from 1925770 to bbf0d27 Compare May 5, 2026 07:57

github-actions Bot added the Waiting-For-Review label May 5, 2026

github-actions Bot removed the Has 1 approval label May 5, 2026

tautschnig reviewed May 5, 2026

View reviewed changes

MikaelMayer reviewed May 5, 2026

View reviewed changes

-  let sVar  : LExpr CoreLParams.mono := .fvar default "s" (some (seqTy mty[%a]))
+/-- Precondition `0 <= varName && varName `k.opExpr` Sequence.length(seqVarName)`. -/
+private def mkSeqBoundsPrecond
+    (seqVarName : String) (varName : String) (k : SeqBoundKind) :
+    Strata.DL.Util.FuncPrecondition (LExpr CoreLParams.mono) CoreLParams.Metadata :=
+  let sVar  : LExpr CoreLParams.mono := .fvar default seqVarName (some (seqTy mty[%a]))
+  let xVar  : LExpr CoreLParams.mono := .fvar default varName (some mty[int])

Conversation

fabiomadge commented May 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

1. Fix VC-printer fallback for unknown 0-ary ops

2. Bounds preconditions for Sequence.select / update / take / drop

Incidental fix

Testing

Known downstream impact

Out of scope

Uh oh!

Uh oh!

Uh oh!

Uh oh!

joscoh May 4, 2026

Choose a reason for hiding this comment

Uh oh!

fabiomadge May 5, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

atomb left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

tautschnig left a comment

Choose a reason for hiding this comment

Uh oh!

tautschnig May 5, 2026

Choose a reason for hiding this comment

Uh oh!

tautschnig May 5, 2026

Choose a reason for hiding this comment

Uh oh!

tautschnig May 5, 2026

Choose a reason for hiding this comment

Uh oh!

tautschnig May 5, 2026

Choose a reason for hiding this comment

Uh oh!

MikaelMayer left a comment

Choose a reason for hiding this comment

Uh oh!

MikaelMayer May 5, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

fabiomadge commented May 2, 2026 •

edited

Loading

2. Bounds preconditions for `Sequence.select` / `update` / `take` / `drop`