Add `still-nanoda` checker with some ongoing optimization efforts by SchrodingerZhu · Pull Request #38 · leanprover/lean-kernel-arena

SchrodingerZhu · 2026-04-25T20:37:34Z

still-nanoda is my fork of nanoda/sonanoda to examine some new optimization efforts

nomeata · 2026-04-26T06:50:07Z

Thanks! Based on the readme this is a similar idea as @datokrat's sonanoda. Maybe you want to join forces? Or is the difference worth keeping both?

(I'm unsure about the value of having several nanoda forks with small variations on the arena. Maybe unless we start seeing people shooting holes into them, and that's how you put up a target for investigation, but so far that hasn't happened a lot)

It would be preferable if the checker description includes a list of changes relative to nanoda, or at least the high level ideas.

SchrodingerZhu · 2026-04-26T13:51:54Z

This is an experimental fork on top of:

The purpose of this fork is to evaluate some optimizations related to the costly
interning cache access and other engineering perspectives. The changes are not
intended to change the underlying typechecking logic and we may contribute back
once the evaluation stabilizes.

Current Change Set

Local-First Expr Allocation Check: it appears that the imported expressions
table is huge such that its lookup becomes costly. We adjusted the order
of checks to look at local interning cache first to reduce global cache visiting.
We also adjust the default IndexSet API to allow holding an entry
first and then decide the interning position in a second step. If the expr really
needs to be allocated locally, this remove one extra lookup. In general, this
optimization shows 5% speedup in Cedar and Mathlib.
Local-only Expression Filtering: another optimization is to do a filtering
scan of the expression to avoid global cache lookup if the expression to local-only.
- Local/Var are apparently local-only;
- Nodes tied to TcCtx are also local-only.
This change brings another 10% speedup in Cedar and Mathlib.

SchrodingerZhu · 2026-04-26T13:56:30Z

Even after these changes, it appears that alloc_expr alone in each thread still take up to 1/3 of the CPU cycles. We are still looking into it.

SchrodingerZhu · 2026-04-26T16:57:29Z

(I'm unsure about the value of having several nanoda forks with small variations on the arena. Maybe unless we start seeing people shooting holes into them, and that's how you put up a target for investigation, but so far that hasn't happened a lot)

My fork is more like a temporarily thing that will go into PR if found valuable. Not a variant in the core algorithm. However, I'd like to have a separate tree to evaluate some thoughts.

nomeata · 2026-04-26T17:06:35Z

Ok, so how about we feature it on the arena until it either gets merged, or it gets rejected and not worked on anymore.

SchrodingerZhu · 2026-04-26T21:56:43Z

It seems that I get a 6.3m to 4.9m wall-time improvement but not a clear CPU instruction count improvement.

nomeata · 2026-04-27T04:44:33Z

Looks pretty impressive to me, also the memory footprint improvement. Note that what is shown as time on the arena is actually instruction counts (more reliable to measure).

nomeata · 2026-04-27T04:46:33Z

And all of that purely from optimizing cache access patterns! Not bad.

nomeata · 2026-04-27T04:53:10Z

Ah, nevermind, you actually fork sonanoda, and that's where it is from.

The wall time improvement is real, though, just not well shown on the arena.

SchrodingerZhu · 2026-04-27T05:31:14Z

Haha yeah, I am mainly to examine some common patterns that happen both in sonanoda and nanoda. Sorry for not clarifying this.

This is one of the output from our trace analysis at our lab. Hopefully, the wall-time improvement still means something helpful.

Another finding we are exploring is that most of the costly lookups are App terms. Adding an additional layer of quick lookup cache helps reducing time by another 1~5% but that part is relatively small and unstable. Also the eviction policy is hard to decide...

SchrodingerZhu · 2026-04-27T05:32:11Z

Apart from the global imported expr cache, another 10% of time is spent solely in inst_aux.

nomeata · 2026-04-27T06:13:17Z

This reminds me a lot of what Claude worked on when creating https://github.com/nomeata/nanobruijn. I think it even added a special fast cache for creating App nodes.

Add still-nanoda checker

6493658

Update revision hash in still-nanoda.yaml

4cb2e20

nomeata merged commit 5fd2bee into leanprover:master Apr 26, 2026
3 checks passed

SchrodingerZhu deleted the add-still-nanoda-checker branch April 26, 2026 18:17

datokrat mentioned this pull request Apr 28, 2026

merge nanoda into sonanoda, fixing bug #41

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add `still-nanoda` checker with some ongoing optimization efforts#38

Add `still-nanoda` checker with some ongoing optimization efforts#38
nomeata merged 2 commits intoleanprover:masterfrom
SchrodingerZhu:add-still-nanoda-checker

SchrodingerZhu commented Apr 25, 2026

Uh oh!

nomeata commented Apr 26, 2026

Uh oh!

SchrodingerZhu commented Apr 26, 2026 •

edited

Loading

Uh oh!

SchrodingerZhu commented Apr 26, 2026

Uh oh!

SchrodingerZhu commented Apr 26, 2026

Uh oh!

nomeata commented Apr 26, 2026

Uh oh!

Uh oh!

SchrodingerZhu commented Apr 26, 2026 •

edited

Loading

Uh oh!

nomeata commented Apr 27, 2026

Uh oh!

nomeata commented Apr 27, 2026

Uh oh!

nomeata commented Apr 27, 2026

Uh oh!

SchrodingerZhu commented Apr 27, 2026 •

edited

Loading

Uh oh!

SchrodingerZhu commented Apr 27, 2026

Uh oh!

nomeata commented Apr 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

SchrodingerZhu commented Apr 25, 2026

Uh oh!

nomeata commented Apr 26, 2026

Uh oh!

SchrodingerZhu commented Apr 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Current Change Set

Uh oh!

SchrodingerZhu commented Apr 26, 2026

Uh oh!

SchrodingerZhu commented Apr 26, 2026

Uh oh!

nomeata commented Apr 26, 2026

Uh oh!

Uh oh!

SchrodingerZhu commented Apr 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

nomeata commented Apr 27, 2026

Uh oh!

nomeata commented Apr 27, 2026

Uh oh!

nomeata commented Apr 27, 2026

Uh oh!

SchrodingerZhu commented Apr 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

SchrodingerZhu commented Apr 27, 2026

Uh oh!

nomeata commented Apr 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

SchrodingerZhu commented Apr 26, 2026 •

edited

Loading

SchrodingerZhu commented Apr 26, 2026 •

edited

Loading

SchrodingerZhu commented Apr 27, 2026 •

edited

Loading