Z3 Crashes #4533

wrwg · 2020-06-19T03:58:47Z

We are observing in the Libra Move Prover project some z3 (4.8.8) segmentation faults with smtlib files generated by Boogie (2.6.17.0). We were able to nail this down to specific smtlib verification problems which Boogie generates, which every now and then produce the crash.

How to best deal with passing this info to the Z3 team? I could attach the smtlib file to this issue, or create a PR which gives you a test to repro this. The repro requires the test to run many times, I'm not sure whether there is already some test framework support for this kind of problem.

@shazqadeer @DavidLDill @emmazzz

NikolajBjorner · 2020-06-19T21:23:46Z

You should communicate bug reproductions as SMTLIB2 files.

It is not practical for me to install and run some other tool to reproduce produce bugs even if these other tools are called Boogie. Sometimes I can work with python scripts, Java, c++ programs using the API, but also this adds a level of friction to reproducing bugs. So SMTLIB2 files are much preferred.

If a repro requires multiple runs make a note, I should be able to see.
It may be that you are setting a soft timeout in the script and then resuming check-sat calls after such a soft timeout. Reproducing bugs with soft timeouts is tricky because timeout behavior changes between platforms. A more reliable method is to use rlimit, which isn't a good time indicator, but it behaves deterministically. AdaCore and other users make use of this.

Overall a crash resilient approach to dealing with timeouts is to allow them, but when a timeout happens reset the solver state completely. I recognize this is not the first line of user-experience you want, but it is going to make the client behavior independent of bugs in the cancellation path.
Of course I aim to make sure the cancellation path is dealt with, but it entails a substantial test matrix.

Small files that fit in the edit window should be just pasted in. Use triple quote and to differentiate from formatted text. Larger files can be uploaded to gist or other storage point. If you don't want to share files widely, send them.

wrwg · 2020-06-19T22:17:33Z

Yes, smtlib files appear the best way to communicate this. I was just asking where to put them.

The one in question is quite large because it involves the entire Move language encoding generated from Boogie. So I submitted this into our depot and you can find it in my clone here. There you find z3_crash_200619.{smt,sh}. The later is a script which just runs the former smt file until z3 crashes. It may reproduce quickly, or after a 1000 iterations, or never (I'm executing it on MacOS).

AFAICT, this does not involve any timeouts. We are seeing Z3 crashes in Continuous Integration ever now and then, as well as on our laptops (I would guess we are looking at ~5% of the runs). The likelihood to experience this crash seems to have increased since we switched from a few month older version of z3 to 4.8.8 recently, in case this is helpful.

Thanks for helping with this, and I hope you can reproduce in debug mode!

Signed-off-by: Nikolaj Bjorner <nbjorner@microsoft.com>

NikolajBjorner · 2020-06-19T23:12:28Z

it crashes immediately for me in debug mode in the current master.
This crash is now fixed.
I would actually have expected it to crash very reliably for you as well so maybe we are not out of the woods yet.

wrwg · 2020-06-20T00:08:54Z

Awesome!

I'm wondering what your release model is. Are you creating them frequently? Otherwise we likely need to start versioning z3 via a commit hash. We have CI tests running and little changes in behavior can make some baseline/golden tests fail for many folks, so we need to ensure that everyone and the CI bot runs the exact same version.

NikolajBjorner · 2020-06-20T00:15:07Z

Releases are not too frequent.
Nightly builds are available with hashes
A release can be pushed up based on user request.
We need to stabilize features. A main point of instability is that we are moving to a new arithmetic solver core. The current release still uses the old core and if we were to create another release next week it would still be the old core even as there is good progress on the new core (and a few ongoing regressions such as a fuzz unsoundness filed yesterday).

wrwg · 2020-06-20T03:23:14Z

We are actually interested in the new arithmetic core (@junkil-park is looking at it), and hope that it solves some problems we have. On the other hand we have high stability requirements, since our plan is to run the Move Prover/Boogie/Z3 in continues integration for most Move smart contracts by later in July.

We also have some problems with QI, which we currently try to nail down. So taking up other changes from you frequently is a good thing.

Let me talk beginning of next week with the team to figure out what our best strategy is. We could (a) work on your nightly or some weekly (i.e. we tag some of your automatic daily as "green", based on our tests) or (b) we use your next official release and if there is some urgent issue, we kindly ask for accelerated release.

wrwg · 2020-06-20T04:01:26Z

it crashes immediately for me in debug mode in the current master.
This crash is now fixed.
I would actually have expected it to crash very reliably for you as well so maybe we are not out of the woods yet.

I've built from head and cannot reproduce the crash anymore (running the test driver now since 15 minutes which is approx. 100,000 times of running the repro smtlib file).

It was quite clearly not reliably reproducible on our side. Perhaps that is because we did not use the debug build which may have extra sanity checks (valgrind and so on). Looks like we should have a way to run the debug version of Z3 to help diagnose such problems faster.

NikolajBjorner added a commit that referenced this issue Jun 19, 2020

fix reset order for #4533

274323b

Signed-off-by: Nikolaj Bjorner <nbjorner@microsoft.com>

NikolajBjorner closed this as completed Jun 22, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Z3 Crashes #4533

Z3 Crashes #4533

wrwg commented Jun 19, 2020

NikolajBjorner commented Jun 19, 2020

wrwg commented Jun 19, 2020

NikolajBjorner commented Jun 19, 2020

wrwg commented Jun 20, 2020

NikolajBjorner commented Jun 20, 2020

wrwg commented Jun 20, 2020 •

edited

Loading

wrwg commented Jun 20, 2020

Z3 Crashes #4533

Z3 Crashes #4533

Comments

wrwg commented Jun 19, 2020

NikolajBjorner commented Jun 19, 2020

wrwg commented Jun 19, 2020

NikolajBjorner commented Jun 19, 2020

wrwg commented Jun 20, 2020

NikolajBjorner commented Jun 20, 2020

wrwg commented Jun 20, 2020 • edited Loading

wrwg commented Jun 20, 2020

wrwg commented Jun 20, 2020 •

edited

Loading