LeanCat: A Benchmark Suite for Formal Category Theory in Lean 4

LeanCat is a benchmark suite of 100 statement-level problems in Lean 4 (mathlib), designed to stress-test abstraction-heavy, library-grounded reasoning in formal mathematics. This repository contains Part I: 1-Category Theory.

Overview

LeanCat addresses a critical gap in automated theorem proving benchmarks by focusing on category theory - the unifying language of modern mathematics that requires sophisticated abstraction and library navigation skills. While existing benchmarks target olympiad-style problems or undergraduate mathematics, LeanCat challenges AI systems with research-level categorical reasoning.

Paper

arXiv: https://arxiv.org/abs/2512.24796

> Note: This repository is the public release mirror. Active development happens in a private repo, and we periodically sync public artifacts here.

Repository Structure

LeanCat/
├── CAT_statement/          # Formal Lean 4 statements of benchmark problems
├── problems/               # Natural language problem descriptions (Markdown)
├── .github/               
├── CAT_statement.lean     # Main Lean 4 file containing all statement imports
├── lakefile.lean          
├── lean-toolchain         # Use Lean version 4.19.0
├── metadata.json          # Problem metadata (difficulty, tags, refs)
├── LeanCat.pdf            # PDF version of the benchmark
├── lake-manifest.json    
├── .gitignore            
└── LICENSE

Quick Start

Install Lean via elan: https://leanprover-community.github.io/get_started.html
Build the project:

# Clone the repository
git clone https://github.com/sciencraft/LeanCat.git
cd LeanCat

# Build with lake
lake build

Our benchmark is build with Lean + Mathlib @ 4.19.0.

Benchmark Content

Problem Categories (100 problems total for 1-Category Theory)

Basic Category Properties (Problems 1-18): Fundamental results about categories, morphisms, monomorphisms, epimorphisms, initial/terminal objects
Adjunctions (Problems 19–24, 26–29): Adjoint functors, universal properties, comma categories
Reflective and Coreflective Subcategories (Problems 30-33): Subcategory properties and classifications
Concrete Categories (Problems 34-41): Categories with faithful forgetful functors to Set
Limits and Colimits (Problems 42-73): The largest cluster covering limits, colimits, and related constructions
Cocompletions (Problems 74-78): Recent work on cocompletions requiring new definitions
Abelian Categories (Problems 79-90): Homological algebra concepts, kernels, cokernels, exact sequences
Monads (Problems 24, 91-100): Monads, Kleisli and Eilenberg-Moore categories

Difficulty Distribution

Easy: 20 problems (≤6/10 difficulty score)
Medium: 42 problems (6-8.5/10 difficulty score)
High: 38 problems (≥8.5/10 difficulty score)

Citation

@article{xu2025leancat,
  title={LeanCat: A Benchmark Suite for Formal Category Theory in Lean (Part I: 1-Categories)},
  author={Xu, Rongge and Dai, Hui and Fu, Yiming and Jiang, Jiedong and Nie, Tianjiao and Wang, Hongwei and Wang, Junkai and Yang, Holiverse and Yang, Jiatong and Zhang, Zhi-Hao},
  journal={arXiv preprint arXiv:2512.24796},
  year={2025}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

LeanCat: A Benchmark Suite for Formal Category Theory in Lean 4

Overview

Paper

Repository Structure

Quick Start

Benchmark Content

Problem Categories (100 problems total for 1-Category Theory)

Difficulty Distribution

Citation

About

Uh oh!

Releases

Packages

Contributors 2

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
.github/workflows		.github/workflows
CAT_statement		CAT_statement
problems		problems
.gitignore		.gitignore
CAT_statement.lean		CAT_statement.lean
LICENSE		LICENSE
LeanCat.pdf		LeanCat.pdf
README.md		README.md
lake-manifest.json		lake-manifest.json
lakefile.lean		lakefile.lean
lean-toolchain		lean-toolchain
metadata.json		metadata.json

License

sciencraft/LeanCat

Folders and files

Latest commit

History

Repository files navigation

LeanCat: A Benchmark Suite for Formal Category Theory in Lean 4

Overview

Paper

Repository Structure

Quick Start

Benchmark Content

Problem Categories (100 problems total for 1-Category Theory)

Difficulty Distribution

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages