Autoformalization Pipeline

Multi-agent system for translating LaTeX mathematics into verified Lean 4 proofs using Mathlib.

Setup

make setup    # creates venv, installs deps, builds Lean + REPL (~20 min)

Create .env with the API key for your chosen provider:

ANTHROPIC_API_KEY=your-key-here   # for Claude models
OPENAI_API_KEY=your-key-here      # for GPT models
GEMINI_API_KEY=your-key-here      # for Gemini models

Quick Start

1. Prepare book data — place book.md (and optionally book.pdf) in autoform/data/<name>/. See autoform/data/example/ for a sample.

2. Extract targets:

python -m autoform.statement_extraction run \
    --book-dir=autoform/data/my_book \
    --output=autoform/data/my_book/targets.yaml

3. Create a config (see autoform/bot/configs/ for examples):

workspace:
  path: ../my-workspace
  mathlib_path: submodules/mathlib
  lib_name: My_Book

book:
  path: my_book
  files: [book.md]
  targets: targets.yaml

llm:
  model: Opus 4.6

workers:
  agents_per_node: 5
  num_repls_per_node: 5
  min_agents_per_task: 3
  max_agents_per_task: 5

4. Run:

# Start fresh
python -m autoform.bot.main run --config=path/to/config.yaml --name=my-run --fresh

# Resume an interrupted run (omit --fresh)
python -m autoform.bot.main run --config=path/to/config.yaml --name=my-run

# Multi-node with SLURM
srun --nodes=N --ntasks-per-node=1 python -m autoform.bot.main run --config=... --name=my-run

5. Monitor:

python -m autoform.visualizer.app --runs-dir=../my-workspace --port=8003

6. Evaluate:

python -m autoform.eval run \
    --repo-dir=../my-workspace/my-run/code \
    --task-file=autoform/data/my_book/targets.yaml \
    --book-dir=autoform/data/my_book

Architecture

autoform-pipeline/
├── autoform/
│   ├── bot/                  Multi-agent pipeline (orchestrator, workers, reviewers)
│   ├── eval/                 Evaluation (grading, lean checks, metrics, rubrics)
│   ├── visualizer/           Web dashboard for inspecting runs and traces
│   ├── statement_extraction/ Statement chunking and extraction from LaTeX
│   └── data/                 Book datasets (book.md + targets.yaml)
├── core/                     Framework (agent, inference, trace, coordination)
├── tools/                    MCP tool servers (filesystem, git, bash, Lean REPL/LSP, mathlib)
├── template/                 Lean 4 + Mathlib workspace template
├── submodules/               Git submodules (mathlib, repl, lean-lsp-mcp)
└── docs/                     Documentation

Documentation

Pipeline:

Bot — multi-agent architecture, DAG workflow, multi-node SLURM, agent roles, config reference
Evaluation — matching, axiom checking, LLM grading rubrics, dependency graphs
Statement Extraction — chunking, multi-agent extraction, deduplication
Visualizer — dashboard views, API endpoints, hub mode

Tools:

Tools Overview — MCP tool system, available servers, adding new tools
REPL Reference — Lean REPL architecture, pooled server, Python API

License

This project is licensed under the CC BY-NC 4.0 license.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
autoform		autoform
core		core
docs		docs
template		template
tests		tests
tools		tools
.gitignore		.gitignore
.gitmodules		.gitmodules
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Autoformalization Pipeline

Setup

Quick Start

Architecture

Documentation

License

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Autoformalization Pipeline

Setup

Quick Start

Architecture

Documentation

License

About

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages