[Merged by Bors] - feat: tactic frontend for slim_check #3114

semorrison · 2023-03-26T22:02:15Z

Adds a tactic front end for slim_check, and provides commands #test and #sample.
Updates all the mathlib3 tests, although many are broken.

There are still some missing parts of Mathlib.Testing.SlimCheck.Sampleable: in particular we can't currently sample from lists, and this explains most of the broken tests.

semorrison · 2023-03-26T22:04:10Z

Closes #1103.

hrmacbeth · 2023-04-18T14:23:44Z

I don't actually know anything about slim_check (though now I'm excited to try it) -- the request on the teaching list had been passed on from @robertylewis so I'm requesting his review instead.

hargoniX

I'm sorry the review request slipped past me :/

I think the change looks good. Leo has also told me that
the wider verification community is very interested in counter
example generation so this is certainly a good step towards it.

Regarding the teaching purposes I will write down a basic explanation
of what it does:

You give it some proof goal or proposition
It tries to generate random examples for inputs of that proposition
Checks if they hold, if a certain amount does it will abort and be happy, if they do not it will instead try to minimize the counter example by a certain shrinking strategy

This general technique is called property testing, it was popularized by a Haskell library called quickcheck if you want to look more into it

So this might be pretty useful for students to figure out they are trying to prove False early on. Note that slim_check in its current state is pretty bare bones though. It barely has instances for the sampling values of types. I think in order to actually use it on a larger scale we would definitely be interested in a deriving handler and in general more base instances.

eric-wieser · 2023-05-04T23:45:51Z

in particular we can't currently sample from lists, and this explains most of the broken tests.

Can't the mathlib3 instances be forward-ported here?

hargoniX · 2023-05-05T06:17:12Z

It can in principle but the mathlib3 code here looks vastly different from what you would write in Lean 4. I remember trying for like an hour to get it to work before eventually giving up and just PRing what I already head.

semorrison · 2023-05-05T23:35:14Z

I propose merging as is. Instance and derive handlers can be added as there is energy to do so, but likely won't happen if people can't see that slim_check is available in the first place.

(e.g. it might have been useful already here).

hrmacbeth · 2023-05-06T22:35:12Z

I just tried this out for the first time, it's very cool! Here's a counterexample it couldn't find:

example (a b c d : ℤ) (h1 : a ≤ b) (h2 : d ≤ c) (h4 : 0 ≤ a) (h3 : 0 ≤ d) : a / c ≤ b / d := by
  slim_check -- 2 3 1 0 is a counterexample

Is this to be expected? (eg is Int not sampleable yet?)

Anyway, I'm on board with merging now.

semorrison · 2023-05-07T09:59:28Z

example (a b c d : ℤ) (h1 : a ≤ b) (h2 : d ≤ c) (h4 : 0 ≤ a) (h3 : 0 ≤ d) : a / c ≤ b / d := by
  slim_check -- 2 3 1 0 is a counterexample

I'm actually finding that slim_check finds a counterexample here about half the time, and the other half it prints a message about giving up after ~80 tries.

Using slim_check (config := {numRetries := 100}) for various values of 100 does work better. I guess it would be possible to have slim_check watch the maxHeartbeat clock, and keep retrying while it has time...? Not sure if this belongs here, however.

hargoniX · 2023-05-07T11:45:42Z

example (a b c d : ℤ) (h1 : a ≤ b) (h2 : d ≤ c) (h4 : 0 ≤ a) (h3 : 0 ≤ d) : a / c ≤ b / d := by
  slim_check -- 2 3 1 0 is a counterexample

The integers are very much sampleable, the issue is just that slim_check is throwing
randomized values up to a certain (configurable) size (default value 100) at the formula.
That is, unlike something like an SMT solver there is not any intelligence behind
the way that we choose values to test here.

So it might very well be that as soon as the amount of variables rises and there are
many combinations that fulfill the property it does not always find an issue. There
are two ways to try and address this. One is if you are only looking for small counter
examples you can reduce the maximum size for which we search:

example (a b c d : ℤ) (h1 : a ≤ b) (h2 : d ≤ c) (h4 : 0 ≤ a) (h3 : 0 ≤ d) : a / c ≤ b / d := by
  slim_check (config := { maxSize := 10 })

This will force it to only explore a smaller space but with the same amount of tries so
the probability of finding a counter example is higher, in order to increase this probability
even more you can also increase the amount of tries:

example (a b c d : ℤ) (h1 : a ≤ b) (h2 : d ≤ c) (h4 : 0 ≤ a) (h3 : 0 ≤ d) : a / c ≤ b / d := by
  slim_check (config := { maxSize := 10, numInst := 1000 })

Which is now basically guaranteed to find a counter example

jcommelin

@hargoniX Thanks for the review! 🎉

bors merge

Adds a tactic front end for `slim_check`, and provides commands `#test` and `#sample`. Updates all the mathlib3 tests, although many are broken. There are still some missing parts of `Mathlib.Testing.SlimCheck.Sampleable`: in particular we can't currently sample from lists, and this explains most of the broken tests. Co-authored-by: Scott Morrison <scott.morrison@gmail.com> Co-authored-by: Scott Morrison <scott.morrison@anu.edu.au>

bors · 2023-05-10T06:33:09Z

Pull request successfully merged into master.

Build succeeded!

The publicly hosted instance of bors-ng is deprecated and will go away soon.

If you want to self-host your own instance, instructions are here.
For more help, visit the forum.

If you want to switch to GitHub's built-in merge queue, visit their help page.

Per [zulip](https://leanprover.zulipchat.com/#narrow/stream/287929-mathlib4/topic/slim_check.20is.20broken/near/356447820). - [x] depends on: #3114 [![Open in Gitpod](https://gitpod.io/button/open-in-gitpod.svg)](https://gitpod.io/from-referrer/) Co-authored-by: Scott Morrison <scott.morrison@gmail.com> Co-authored-by: Scott Morrison <scott.morrison@anu.edu.au>

feat: tactic frontend for slim_check

3af5063

semorrison requested a review from hargoniX March 26, 2023 22:02

import all files

bdaa2ef

hrmacbeth linked an issue Mar 27, 2023 that may be closed by this pull request

slim_check frontend #1103

Closed

semorrison added awaiting-review t-meta Tactics, attributes or user commands labels Mar 27, 2023

semorrison requested a review from hrmacbeth April 18, 2023 03:30

hrmacbeth requested review from robertylewis and removed request for hrmacbeth April 18, 2023 14:22

semorrison added 2 commits May 5, 2023 09:09

Merge remote-tracking branch 'origin/master' into slim_check_frontend

054aea0

import all

6a5210f

hargoniX approved these changes May 4, 2023

View reviewed changes

Merge remote-tracking branch 'origin/master' into slim_check_frontend

ce5afbc

semorrison mentioned this pull request May 7, 2023

[Merged by Bors] - feat: simplify slim_check, removing proofs #3835

Closed

1 task

semorrison mentioned this pull request May 8, 2023

feat: allow slim_check to do work in MetaM #3838

Open

jcommelin approved these changes May 10, 2023

View reviewed changes

semorrison added ready-to-merge This PR has been sent to bors. and removed awaiting-review labels May 10, 2023

bors bot changed the title ~~feat: tactic frontend for slim_check~~ [Merged by Bors] - feat: tactic frontend for slim_check May 10, 2023

bors bot closed this May 10, 2023

bors bot deleted the slim_check_frontend branch May 10, 2023 06:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Merged by Bors] - feat: tactic frontend for slim_check #3114

[Merged by Bors] - feat: tactic frontend for slim_check #3114

semorrison commented Mar 26, 2023

semorrison commented Mar 26, 2023

hrmacbeth commented Apr 18, 2023

hargoniX left a comment •

edited

Loading

eric-wieser commented May 4, 2023

hargoniX commented May 5, 2023

semorrison commented May 5, 2023

hrmacbeth commented May 6, 2023 •

edited

Loading

semorrison commented May 7, 2023

hargoniX commented May 7, 2023 •

edited

Loading

jcommelin left a comment

bors bot commented May 10, 2023

[Merged by Bors] - feat: tactic frontend for slim_check #3114

[Merged by Bors] - feat: tactic frontend for slim_check #3114

Conversation

semorrison commented Mar 26, 2023

semorrison commented Mar 26, 2023

hrmacbeth commented Apr 18, 2023

hargoniX left a comment • edited Loading

Choose a reason for hiding this comment

eric-wieser commented May 4, 2023

hargoniX commented May 5, 2023

semorrison commented May 5, 2023

hrmacbeth commented May 6, 2023 • edited Loading

semorrison commented May 7, 2023

hargoniX commented May 7, 2023 • edited Loading

jcommelin left a comment

Choose a reason for hiding this comment

bors bot commented May 10, 2023

hargoniX left a comment •

edited

Loading

hrmacbeth commented May 6, 2023 •

edited

Loading

hargoniX commented May 7, 2023 •

edited

Loading