MathAgent

MathAgent is a research repository for agentic discovery of elementary Diophantine proofs and later Lean formalization. It is a collaboration between Ishan and Kieren, with room for multiple AI workflows to explore separate branches.

The repo is organized around four core artifact types:

methods/: reusable proof techniques, one method per file.
library/: small high-impact identities, lemmas, and transformations.
problems/: target theorem folders with statements, allowed inputs, attempts, Lean notes, and evaluation records.
examples/: solved demonstrations that teach how a method is used.

Supporting areas include workflows/ for agent protocols, evaluation/ for scoring, formal/ for Lean-facing artifacts, and papers/ for systems literature about theorem-proving agents, Lean tooling, search, refinement, and research automation.

Examples and problems are intentionally separated. Examples may contain solved demonstrations and method skeletons. Problems should usually avoid full standard proofs, so workflow evaluations are not contaminated by copied solutions.

The papers/ folder is for brainstorming coherent software/workflow configurations and benchmarking plans. It should influence workflow design and evaluation infrastructure, not silently expand the mathematical facts available to a problem run.

First Tasks

Fill method files.
Add high-impact Tagebuch identities.
Create target problem statements.
Run workflows on separate branches.
Score attempts using evaluation metrics.
Use papers/ to design and compare workflow configurations.

Run the skeleton check with:

make check

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.agents/skills		.agents/skills
docs		docs
evaluation		evaluation
examples		examples
formal		formal
instructions		instructions
library		library
methods		methods
papers		papers
problems		problems
scripts		scripts
workflows		workflows
.gitignore		.gitignore
AGENTS.md		AGENTS.md
CONTRIBUTING.md		CONTRIBUTING.md
Makefile		Makefile
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MathAgent

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

MathAgent

About

Resources

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages