Task 109 by jt0202 · Pull Request #171 · leanprover/human-eval-lean

jt0202 · 2025-05-06T04:59:41Z

Currently WIP.

I dislike the given solution as I feel like it should be possible with just linear amount of comparisions for arbitrary datatypes that have a linear order. I also got confused a bit with what is a right shift and proved a bunch of theorems for left shifts which is also easier state. Not sure yet if proving an equivalence between both is easier or redoing it all for a right shift definition.

jt0202 · 2025-05-06T19:59:18Z

The proof strategy is as follows:

We consider a (global) break point in a list an index i with l[i] > l [i+1] with l wrapping around i.e. l[l.length] = l[0].

A list is sorted iff the number of non-local break points is zero (the wrap around might be a break point but that does not matter), which is already proven in Lean.

The number of global break points is obviously at most one above the number of non-global break points.

The algorithm counts the number of global break points and compares that with two.

If the number of global breaks points is zero, then the list itself suffices.
If the number of global break points is one, there is a number so that shifting the list by this number results in a sorted list (open)
If the number of global break points is at least two, than there is no solution because the number of global break points is preserved under shift (open) and hence the number of local break points never becomes zero.

This can be computed in linear time, but proving the open questions is a bit of a pain. There might be a better intermediate representation though that I cant think of right now.

jt0202 · 2025-05-06T20:54:14Z

Thinking further about that, it is probably clearer to do with arrays instead of lists.

jt0202 · 2025-05-17T13:09:39Z

This was kind of painful. The algorithmic idea was pretty easy to find, but all lemmas related to counting the breakpoints and the properties of List.sum were a bit difficult. In some of these chases mathlib might have been more helpful. omega was however a life saver.

TwoFX

This is very nice! It would probably be possible to go through this and shorten the proofs in places, but I think apart from the two comments at the top I'll just merge this as is, and we can incrementally shorten it in future PRs as we like (and as better automation becomes available).

TwoFX

Merging this now. It would certainly be interesting to explore how much easier the proofs get when switching to List.countP.

jt0202 added 2 commits May 5, 2025 22:12

Left shift theory

4dee5cf

Algorithm

4b39897

jt0202 commented May 6, 2025

View reviewed changes

Comment thread lean-toolchain Outdated

jt0202 added 10 commits May 7, 2025 22:06

Next try

fe8aa77

Further

1106cf3

Progress on 0 and 1 cases

f8e1502

Finish zero case

9ef0e36

Finish one case

2d36a1f

Sketch of main theorem

aaf5049

Different try

7553f73

Case for ge 2

abf05c6

Finished

5b03220

simp -> simp only

28419b9

jt0202 marked this pull request as ready for review May 17, 2025 13:04

Update to stable release

58bdd17

TwoFX reviewed Jun 5, 2025

View reviewed changes

Comment thread HumanEvalLean/HumanEval109.lean Outdated

Comment thread HumanEvalLean/HumanEval109.lean Outdated

Comment thread lean-toolchain Outdated

jt0202 and others added 2 commits June 5, 2025 17:33

Suggestions from review

a8d2935

Merge branch 'master' into 109

cb696a8

TwoFX approved these changes Jun 9, 2025

View reviewed changes

TwoFX merged commit 2a6757d into leanprover:master Jun 9, 2025
1 check passed

TwoFX mentioned this pull request Jun 9, 2025

HumanEval/109 #109

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Task 109#171

Task 109#171
TwoFX merged 15 commits intoleanprover:masterfrom
jt0202:109

jt0202 commented May 6, 2025

Uh oh!

Uh oh!

jt0202 commented May 6, 2025

Uh oh!

jt0202 commented May 6, 2025

Uh oh!

jt0202 commented May 17, 2025

Uh oh!

TwoFX left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

TwoFX left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

jt0202 commented May 6, 2025

Uh oh!

Uh oh!

jt0202 commented May 6, 2025

Uh oh!

jt0202 commented May 6, 2025

Uh oh!

jt0202 commented May 17, 2025

Uh oh!

TwoFX left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

TwoFX left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants