A mathematical proof assistant (Version 2.0)

This project aims to develop (in Python) a lightweight proof assistant that is substantially less powerful than full proof assistants such as Lean, Isabelle or Rocq, but which (hopefully) is easy to use to prove short, tedious tasks, such as verifying that one inequality or estimate follows from others. One specific intention of this assistant is to provide support for asymptotic estimates.

Documentation links (current version)

Older versions and posts

Original blog post explaining the project - Terence Tao, May 1 2025
- A companion post on the algebraic structure of orders of infinity - Terence Tao - May 4 2025

Getting started

Install uv - this will take care of managing dependencies, and will install Python for you as needed:

# On macOS and Linux.
curl -LsSf https://astral.sh/uv/install.sh | sh

# On Windows.
powershell -ExecutionPolicy ByPass -c "irm https://astral.sh/uv/install.ps1 | iex"

To start the assistant in an interactive Python session, run:

uvx --with git+https://github.com/teorth/estimates python

This will launch an interactive terminal. From there, enter:

>>> from estimates.main import *

To launch a new proof assistant, type p = ProofAssistant().
Alternatively, to try one of the exercises, such as linarith_exercise(), type p = linarith_exercise(). A list of exercises can be found here.

How the assistant works

The assistant can be in one of two modes: Assumption mode and Tactic mode. We will get to assumption mode later, but let us first discuss tactic mode, which is the mode one ends up in when one tries any of the exercises. The format of this mode is deliberately designed to resemble the tactic mode in modern proof assistant languages such as Lean, Isabelle or Rocq.

Let's start for instance with linarith_exercise(). Informally, this exercise asks to establish the following claim:

Informal version: If $x, y, z$ are positive reals with $x < 2 y$ and $y < 3 z + 1$ , prove that $x < 7 z + 2$ .

If one follows the above quick start instructions, one should now see the following:

>>> from main import *
>>> p = linarith_exercise()
Starting proof.  Current proof state:
x: pos_real
y: pos_real
z: pos_real
h1: x < 2*y
h2: y < 3*z + 1
|- x < 7*z + 2

We are now in Tactic mode, in which we try to establish a desired goal (the assertion after the |- symbol, which in this case is $x < 7 z + 2$ ) from the given hypotheses x, y, z, h1, h2. Hypotheses come in two types:

Variable declarations, such as x: pos_real, which asserts that we have a variable x that is a positive real number.
Predicates, such as h1: x < 2*y, which have a name (in this case, h1), and a boolean-valued assertion involving the variables, in this case $x < 2 y$ .

The goal is also a predicate. The list of hypotheses together with a goal is collectively referred to as a proof state.

In order to obtain the goal from the hypotheses, one usually uses a sequence of tactics, which can transform a given proof state to zero or more further proof states. This can decrease, increase, or hold steady the number of outstanding goals. The "game" is then to keep using tactics until the number of outstanding goals drops to zero, at which point the proof is complete. A full list of tactics can be found here.

In this particular case, there is a "linear arithmetic" tactic Linarith() (inspired by the Lean tactic linarith) that is specifically designed for the task of obtaining a goal as a linear combination of the hypotheses, and it "one-shots" this particular exercise:

>>> p.use(Linarith())
Goal solved by linear arithmetic!
Proof complete!

This may seem suspiciously easy, but one can ask Linarith to give a more detailed explanation:

>>> from main import *
>>> p = linarith_exercise()
Starting proof.  Current proof state:
x: pos_real
y: pos_real
z: pos_real
h1: x < 2*y
h2: y < 3*z + 1
|- x < 7*z + 2
>>> p.use(Linarith(verbose=True))
Checking feasibility of the following inequalities:
1*z > 0
1*x + -7*z >= 2
1*y + -3*z < 1
1*y > 0
1*x > 0
1*x + -2*y < 0
Infeasible by summing the following:
1*z > 0 multiplied by 1/4
1*x + -7*z >= 2 multiplied by 1/4
1*y + -3*z < 1 multiplied by -1/2
1*x + -2*y < 0 multiplied by -1/4
Goal solved by linear arithmetic!
Proof complete!

This gives more details as to what Linarith actually did:

First, it argued by contradiction, by taking the negation $x \geq 7 z + 2$ of the goal $x < 7 z + 2$ and added it to the hypotheses.
Then, it converted all the inequalities that were explicit or implicit in the hypotheses into a "linear programming" form in which the variables are on the left-hand side, and constants on the right-hand side. For instance, the assertion that x was a positive real became $1x>0$, and the assertion $y < 3z$ became $1y + -3*z < 1$.
Finally, it used exact linear programming to seek out a linear combination of these inequalities that would lead to an absurd inequality, in this case $0 < 1$ .

One can also inspect the final proof after solving the problem by using the proof() method, although in this case the proof is extremely simple:

>>> print(p.proof())
example (x: pos_real) (y: pos_real) (z: pos_real) (h1: x < 2*y) (h2: y < 3*z + 1): x < 7*z + 2 := by
  linarith

Here, the original hypotheses and goal are listed in a pseudo-Lean style, followed by the actual proof, which in this case is just one line.

One could ask what happens if Linarith fails to resolve the goal. With the verbose flag, it will give a specific counterexample consistent with all the inequalities it could find:

>>> from main import *
>>> p = linarith_impossible_example()
Starting proof.  Current proof state:
x: pos_real
y: pos_real
z: pos_real
h1: x < 2*y
h2: y < 3*z + 1
|- x < 7*z
>>> p.use(Linarith(verbose=true))
Checking feasibility of the following inequalities:
1*x + -7*z >= 0
1*x > 0
1*y + -3*z < 1
1*x + -2*y < 0
1*z > 0
1*y > 0
Feasible with the following values:
y = 2
x = 7/2
z = 1/2
Linear arithmetic was unable to prove goal.
1 goal remaining.
>>>

Here, the task given was an impossible one: to deduce $x < 7 z$ from the hypotheses that $x, y, z$ are positive reals with $x < 2 y$ and $y < 3 z + 1$ . A specific counterexample $x = 7 / 2$ , $y = 2$ , $z = 1 / 2$ was given to this problem. (In this case, this means that the original problem was impossible to solve; but in general one cannot draw such a conclusion, because it may have been possible to establish the goal by using some non-inequality hypotheses).

Now let us consider a slightly more complicated proof, in which some branching of cases is required.

>>> from main import *
>>> p = case_split_exercise()
Starting proof.  Current proof state:
P: bool
Q: bool
R: bool
S: bool
h1: P | Q
h2: R | S
|- (P & R) | (P & S) | (Q & R) | (Q & S)

Here, we have four atomic propositions (boolean variables) P, Q, R, S, with the hypothesis h1 that either P or Q is true, as well as the hypothesis h2 that either R or S is true. The objective is then to prove that one of the four statements P & R (i.e., P and R are both true), P & S, Q & R, and Q & S is true.

Here we can split the hypothesis h1 : P | Q into two cases:

>>> p.use(Cases("h1"))
Splitting h1: P | Q into cases P, Q.
2 goals remaining.

Let's now look at the current proof state:

>>> print(p)
Proof Assistant is in tactic mode.  Current proof state:
P: bool
Q: bool
R: bool
S: bool
h1: P
h2: R | S
|- (P & R) | (P & S) | (Q & R) | (Q & S)
This is goal 1 of 2.

Note how the hypothesis h1 has changed from P | Q to just P. But this is just one of the two goals. We can see this by looking at the current state of the proof:

>>> print(p.proof())
example (P: bool) (Q: bool) (R: bool) (S: bool) (h1: P | Q) (h2: R | S): (P & R) | (P & S) | (Q & R) | (Q & S) := by
  cases h1
  . **sorry**
  sorry

The proof has now branched into a tree with two leaf nodes (marked ``sorry''), representing the two unresolved goals. We are currently located at the first goal (as indicated by the asterisks). We can move to the next goal:

>>> p.next_goal()
Moved to goal 2 of 2.
>>> print(p.proof())
example (P: bool) (Q: bool) (R: bool) (S: bool) (h1: P | Q) (h2: R | S): (P & R) | (P & S) | (Q & R) | (Q & S) := by
  cases h1
  . sorry
  **sorry**
>>> print(p)
Proof Assistant is in tactic mode.  Current proof state:
P: bool
Q: bool
R: bool
S: bool
h1: Q
h2: R | S
|- (P & R) | (P & S) | (Q & R) | (Q & S)
This is goal 2 of 2.

So we see that in this second branch of the proof tree, h1 is now set to Q. For further ways to navigate the proof tree, see this page.

Now that we know that Q is true, we would like to use this to simplify our goal, for instance simplifying Q & R to Q. This can be done using the SimpAll() tactic:

>>> p.use(SimpAll())
Simplified (P & R) | (P & S) | (Q & R) | (Q & S) to R | S using Q.
Simplified R | S to True using R | S.
Goal solved!
1 goal remaining.

Here, the hypothesis Q was used to simplify the goal (using sympy's powerful simplification tools), all the way down to R | S. But this is precisely hypothesis h2, so on using that hypothesis as well, the conclusion was simplified to True, which of course closes off this goal. This then lands us automatically in the first goal, which can be solved by the same method:

>>> p.use(SimpAll())
Simplified (P & R) | (P & S) | (Q & R) | (Q & S) to R | S using P.
Simplified R | S to True using R | S.
Goal solved!
Proof complete!

And here is the final proof:

>>> print(p.proof())
example (P: bool) (Q: bool) (R: bool) (S: bool) (h1: P | Q) (h2: R | S): (P & R) | (P & S) | (Q & R) | (Q & S) := by
  cases h1
  . simp_all
  simp_all

One can combine propositional tactics with linear arithmetic tactics. Here is one example (using some propositional tactics we have not yet discussed, but whose purpose should be clear, and which one can look up in this page):

>>> from main import *
>>> p = split_exercise()
Starting proof.  Current proof state:
x: real
y: real
h1: (x > -1) & (x < 1)
h2: (y > -2) & (y < 2)
|- (x + y > -3) & (x + y < 3)
>>> p.use(SplitHyp("h1"))
Decomposing h1: (x > -1) & (x < 1) into components x > -1, x < 1.
1 goal remaining.
>>> p.use(SplitHyp("h2"))
Decomposing h2: (y > -2) & (y < 2) into components y > -2, y < 2.
1 goal remaining.
>>> p.use(SplitGoal())
Split into conjunctions: x + y > -3, x + y < 3
2 goals remaining.
>>> p.use(Linarith())
Goal solved by linear arithmetic!
1 goal remaining.
>>> p.use(Linarith())
Goal solved by linear arithmetic!
Proof complete!
>>> print(p.proof())
example (x: real) (y: real) (h1: (x > -1) & (x < 1)) (h2: (y > -2) & (y < 2)): (x + y > -3) & (x + y < 3) := by
  split_hyp h1
  split_hyp h2
  split_goal
  . linarith
  linarith

Creating a new problem

The previous demonstrations of the Proof Assistant used some "canned" examples which placed one directly in Tactic Mode with some pre-made hypotheses and goal. To make one's own problem to solve, one begins with the Proof Assistant constructor:

>>> p = ProofAssistant()

This places the proof assistant in Assumption Mode. Now one can add variables and assumptions. For instance, to introduce a positive real variable x, one can use the var() method to write

>>> x = p.var("real", "x")

This creates a sympy Python variable x, which is real and can be manipulated symbolically using the full range of sympy methods:

>>> x
x
>>> x.is_real
True
>>> x+x
2*x
>>> from sympy import expand
>>> expand((x+2)**2)
x**2 + 4*x + 4
>>> x<5
x < 5
>>> isinstance(x<5, Boolean)
True

One can also use vars() to introduce multiple variables at once:

>>> y,z = p.vars("pos_int", "y", "z")   # "pos_int" means "positive integer"
>>> y.is_positive
True
>>> (y+z).is_positive
True
>>> (x+y).is_positive
>>> (x+y).is_real
True

(Here, (x+y).is_positive returned None, reflecting the fact that the hypotheses do not allow one to easily assert that x+y is positive.)

One can then add additional hypotheses using the assume() command:

>>> p.assume(x+y+z <= 3, "h")
>>> p.assume((x>=y) & (y>=z), "h2")
>>> print(p)
Proof Assistant is in assumption mode.  Current hypotheses:
x: real
y: pos_int
z: pos_int
h: x + y + z <= 3
h2: (x >= y) & (y >= z)

Now, one can start a goal with the begin_proof() command:

>>> p.begin_proof(Eq(z,1))
Starting proof.  Current proof state:
x: real
y: pos_int
z: pos_int
h: x + y + z < 3
h2: (x >= y) & (y >= z)
|- Eq(z, 1)

(Here we are using sympy's symbolic equality relation Eq, because Python has reserved the = and == operators for other purposes.) Now one is in Tactic Mode and can use tactics as before.

For a full list of navigation commands that one can perform in either Assumption Mode or Tactic Mode, see this page.

Lemmas

In addition to general proof tactics, I plan to build a library of lemmas that can be used for more specialized applications. Here is one example, using an arithmetic mean geometric mean lemma $(x_{1} + \dots + x_{n})^{1 / n} \leq \frac{x_{1} + \dots + x_{n}}{n}$ to prove a slight variant of that lemma:

>>> from main import *
>>> p = amgm_exercise()
Starting proof.  Current proof state:
x: nonneg_real
y: nonneg_real
|- 2*x*y <= x**2 + y**2
>>> x,y = p.get_vars("x","y")
>>> p.use_lemma(Amgm(x**2,y**2))
Applying lemma am_gm(x**2, y**2) to conclude this: x**1.0*y**1.0 <= x**2/2 + y**2/2.
1 goal remaining.
>>> p.use(SimpAll())
Goal solved!
Proof complete!

Contributions and feedback

I would be happy to receive contributions and feedback on this tool, either as Github issues and pull requests, or the associated blog post. Examples of such contributions can include

Bug reports and corrections
Suggestions or submissions of exercises (or problems which are currently difficult to solve with the existing tactics and lemmas, but which can suggest new tactics and lemmas to implement)
Suggestions or submissions for new tactics
Suggestions or submissions for new lemmas
Suggestions or submissions for new data types
Suggestions or submissions of new user interfaces

Name		Name	Last commit message	Last commit date
Latest commit History 126 Commits
docs		docs
src/estimates		src/estimates
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A mathematical proof assistant (Version 2.0)

Documentation links (current version)

Older versions and posts

Getting started

How the assistant works

Creating a new problem

Lemmas

Contributions and feedback

About

Releases

Packages

Contributors 5

Languages

License

teorth/estimates

Folders and files

Latest commit

History

Repository files navigation

A mathematical proof assistant (Version 2.0)

Documentation links (current version)

Older versions and posts

Getting started

How the assistant works

Creating a new problem

Lemmas

Contributions and feedback

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 5

Languages

Packages