Skip to content

research lead: autoformalize NL requirements → Lean with proved ↔-equivalence as cert evidence #508

@avrabe

Description

@avrabe

From a focused web-research pass — a lead where rivet is plausibly ahead of the literature.

The technique: translate an NL requirement → Lean proposition, then prove the biconditional PA ↔ PB to decide requirement equivalence/consistency (DeepSeek-Prover-V2 on automotive safety reqs — https://arxiv.org/html/2511.11829).

rivet's edge: that work's single biggest manual bottleneck is variable grounding (establishing that differently-named symbols mean the same thing). rivet's typed, pre-grounded requirements largely solve that — turning their POC bottleneck into an automated step.

The open gap (publishable): no one packages autoformalized requirements as DO-178C/ISO 26262 traceability/certification evidence with a machine-checkable artifact (https://arxiv.org/html/2507.14330v1). rivet + gale/scry's Lean could be first.

Guardrails to adopt regardless: round-trip back-translation + symbolic equivalence checks; never trust an LLM-judge equivalence score (autoformalization survey: 97%→67% on manual check — https://arxiv.org/html/2505.23486v1). DeepSeek-Prover-V2 7B is locally runnable as the proving backend.

(Tracking issue from research; not scoped to a release yet.)

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions