lean-ground-truth

Lean 4 Proof Quality Verification Corpus

A ground-truth Lean 4 corpus for validating PMAT's proof quality verification pipeline. Contains theorems, lemmas, structures, and intentional sorry markers for Popperian falsification testing.

Overview

This repository is a Popperian falsification corpus — a carefully constructed test fixture that validates PMAT's Lean 4 analysis capabilities. Every theorem, lemma, and sorry marker exists to exercise a specific code path in PMAT's analysis pipeline.

What PMAT Validates Against This Corpus

Capability	Test Target
AST Extraction	`def`, `theorem`, `lemma`, `structure`, `class`, `inductive`, `abbrev`, `axiom`, `opaque`, `instance`, `namespace`
Sorry Detection	3 intentional `sorry` markers in `Incomplete.lean`
Proof Completeness	`(theorems - sorrys) / theorems` ratio = ~88%
Project Detection	`lakefile.lean` + `lean-toolchain` identification
CB-1050 Compliance	Block comment vs theorem disambiguation
CB-1052 Compliance	Sorry ratio threshold checking (< 20%)
Formal Verification Score	14.7/16 (91.9%) for pure Lean projects

PMAT Quality Metrics

Metric	Value
PMAT Grade	A (91.9%)
Formal Verification	14.7/16
Modules	4
Theorems/Lemmas	25+
Sorrys (intentional)	3
Proof Completeness	~88%
Structures	3
Inductive Types	1

Project Structure

lean-ground-truth/
  lakefile.lean              # Lake build configuration
  lean-toolchain             # Lean version pinning
  lib/
    GroundTruth.lean         # Top-level module imports
    GroundTruth/
      Basic.lean             # Boolean logic, pairs, function composition
      Nat.lean               # Natural number arithmetic proofs
      List.lean              # List properties, binary trees
      Incomplete.lean        # Intentional sorry markers (PMAT test target)
  spec/                      # Specification documents
  tests/                     # Test fixtures

Proof Modules

Basic.lean — Boolean Logic & Function Composition

Foundational proofs including:

Boolean identity (b && true = b, b || false = b)
De Morgan's laws
Pair projections and swaps
Function composition associativity

Nat.lean — Natural Number Arithmetic

Inductive proofs over natural numbers:

Commutativity and associativity of addition
Distributivity of multiplication
Zero identity proofs
Successor lemmas

List.lean — List Properties & Binary Trees

Structural proofs including:

List append associativity
Length preservation under reverse
Map-filter commutativity
Binary tree fold properties

Incomplete.lean — Intentional Sorry Markers

Contains exactly 3 sorry markers — these are the primary test targets for PMAT's FormalProofVerification falsification method. Each sorry represents an incomplete proof that PMAT must detect and report via CB-1050.

PMAT Verification

# Compliance check (CB-1050 through CB-1053)
pmat comply check -p /path/to/lean-ground-truth

# Project quality score
pmat rust-project-score -p /path/to/lean-ground-truth

# Context generation with Lean AST
pmat context -p /path/to/lean-ground-truth

# Semantic search for theorems
pmat query "theorem" -p /path/to/lean-ground-truth

# Verify sorry detection
pmat query --literal "sorry" -p /path/to/lean-ground-truth

Expected PMAT Output

CB-1050: 3 errors (sorry markers in Incomplete.lean)
CB-1052: Pass (sorry ratio 12% < 20% threshold)
CB-1053: Pass (all theorems documented)
Formal Verification Score: 14.7/16 (91.9%)

Installation

Prerequisites

Lean 4 (v4.15.0+)
Lake build system (bundled with Lean)
PMAT (v3.4.0+)

Build

# Clone
git clone https://github.com/paiml/lean-ground-truth.git
cd lean-ground-truth

# Build with Lake
lake build

# Check all proofs (will report sorry markers in Incomplete.lean)
lake env lean lib/GroundTruth.lean

Usage

This corpus is designed to be analyzed by PMAT, not used as a library. To use it as a test fixture:

# Run full PMAT analysis
pmat comply check -p . --verbose
pmat rust-project-score -p .

# Verify specific capabilities
pmat comply check -p . --failures-only  # Should show CB-1050 only

Contributing

All changes directly on master branch
Run lake build to verify proofs compile
Run pmat comply check -p . before committing
Maintain exactly 3 sorry markers in Incomplete.lean

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.github		.github
lib		lib
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.md		README.md
lakefile.lean		lakefile.lean
lean-toolchain		lean-toolchain

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

lean-ground-truth

Table of Contents

Overview

What PMAT Validates Against This Corpus

PMAT Quality Metrics

Project Structure

Proof Modules

Basic.lean — Boolean Logic & Function Composition

Nat.lean — Natural Number Arithmetic

List.lean — List Properties & Binary Trees

Incomplete.lean — Intentional Sorry Markers

PMAT Verification

Expected PMAT Output

Installation

Prerequisites

Build

Usage

Contributing

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

lean-ground-truth

Table of Contents

Overview

What PMAT Validates Against This Corpus

PMAT Quality Metrics

Project Structure

Proof Modules

Basic.lean — Boolean Logic & Function Composition

Nat.lean — Natural Number Arithmetic

List.lean — List Properties & Binary Trees

Incomplete.lean — Intentional Sorry Markers

PMAT Verification

Expected PMAT Output

Installation

Prerequisites

Build

Usage

Contributing

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages