Fix bool lets #791

mezpusz · 2025-12-06T11:06:03Z

This amends #783 from the SMTLIB side with the case of Boolean let binders.

I tested with:

TPTP Discount/Otter where 2 problems previously almost timing out with Otter flipped to timeout,
SMTLIB2 with Discount on ALIA, AUFDTLIA, AUFDTLIRA, AUFDTNIRA, AUFLIA, AUFLIRA, AUFNIA and AUFNIRA, where the 20k benchmarks crashing mentioned in Polymorphic arrays #786 all disappeared, and 18951 flipped to being solved and we lose 6 benchmarks for whatever reason.

Not sure if I should test on more SMTLIB benchmarks or look into the differences, but I want to eventually run longer term regressions over SMTLIB too to see if we somehow decline.

MichaelRawson · 2025-12-07T11:15:29Z

TPTP Discount/Otter where 2 problems previously almost timing out with Otter flipped to timeout,

we lose 6 benchmarks for whatever reason.

I'm not too bothered by these, we'll doubtless get them some other way and being correct is usually slower than being incorrect.

Will now review - but thanks for looking into SMT-LIB, this is the kind of large bug we should catch.

MichaelRawson · 2025-12-07T11:17:35Z

Parse/SMTLIB2.cpp

+    Term* lhs;
+    if (exprSort == AtomicSort::boolSort()) {
+      // This solution is ugly, but either := has to be special or we have
+      // to wrap this as a formula to preserve the term-formula boundary.


At some point we can re-consider - I think you're of the opinion that more interpreted symbols and fewer special terms are a good thing, which I agree with. But for now this is a good fix.

Yes, I would be happy if the special term wrapper disappeared at some point, but let's see if we can get rid of it.

… they are not used

mezpusz · 2025-12-07T19:39:07Z

I ran a regression from f0fb5f9 (before the let-bound PR) to this branch and I found another bug introduced in the let-bound PR. It was because of my seemingly permanent misunderstanding of the functions isBoolean and isLiteral.

Now it still seems a bit suspicious, because we can somehow solve ~500 more benchmarks than before the let-bound PR. ~50 are lost. I will look into it for some time.

mezpusz · 2025-12-07T19:51:37Z

Looking at a couple of benchmarks in the difference, it seems that up to clausification the difference is trivial (for example, using $difference(...,1) instead of $sum(...,$uminus(1))) or having different order of definitions/namings). 🤷

MichaelRawson · 2025-12-08T07:38:57Z

Do I read that as "we're not unsound because this only affects Vampire up to clausification"?

mezpusz · 2025-12-08T07:44:29Z

The let-bound changes mostly affected everything up to clausification, otherwise I don't know what other changes in between could have caused this result. Any ideas?

MichaelRawson · 2025-12-08T09:30:05Z

It was because of my seemingly permanent misunderstanding of the functions isBoolean and isLiteral.

I think I also suffered from this. :-/ Any suggestions for a rename? I could go for isBoolean -> isTrueOrFalse.

The let-bound changes mostly affected everything up to clausification, otherwise I don't know what other changes in between could have caused this result. Any ideas?

I wouldn't be too surprised if this were the case. I'm not sure what it would tell us here, but sometimes I write the clausification output to a file and then run Vampire on the CNF. If you don't see the same effect, it's probably something like term IDs or symbol precedence?

mezpusz · 2025-12-08T11:17:45Z

I have no idea what it should be. Fortunately it is not used in many places (and maybe some of them are incorrect too!) and it looks like it is always used in the context of preprocessing special Boolean terms?

But anyways, this PR is nevertheless an improvement over master. I also ran TPTP regression, now it came back without any difference compared to master.

mezpusz · 2025-12-08T11:37:04Z

Btw, the function could be probably simplified by taking getSpecialData()->getSort() instead of the loop+switch, otherwise I'm not even sure if the current one is correct because it yields false for special terms with a variable body.

mezpusz added 2 commits December 5, 2025 14:41

Cherry pick fix from polymorphic-arrays branch

d26e0d8

Add comment

9cb68bd

mezpusz requested review from MichaelRawson and quickbeam123 December 6, 2025 11:06

MichaelRawson reviewed Dec 7, 2025

View reviewed changes

MichaelRawson approved these changes Dec 7, 2025

View reviewed changes

mezpusz added the on hold (don't merge) label Dec 7, 2025

mezpusz added 3 commits December 7, 2025 16:40

Merge branch 'master' into fix-bool-lets

943b3a9

Fix UB reported by sanitizer

f6704c8

Incorrectly used isBoolean in a couple of places; disable tuples when…

efb440d

… they are not used

mezpusz removed the on hold (don't merge) label Dec 9, 2025

quickbeam123 approved these changes Dec 10, 2025

View reviewed changes

MichaelRawson merged commit 2cf06d2 into master Dec 10, 2025
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix bool lets #791

Fix bool lets #791

Uh oh!

mezpusz commented Dec 6, 2025

Uh oh!

MichaelRawson commented Dec 7, 2025

Uh oh!

MichaelRawson Dec 7, 2025

Uh oh!

mezpusz Dec 7, 2025

Uh oh!

mezpusz commented Dec 7, 2025

Uh oh!

mezpusz commented Dec 7, 2025

Uh oh!

MichaelRawson commented Dec 8, 2025

Uh oh!

mezpusz commented Dec 8, 2025

Uh oh!

MichaelRawson commented Dec 8, 2025

Uh oh!

mezpusz commented Dec 8, 2025

Uh oh!

mezpusz commented Dec 8, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Fix bool lets #791

Fix bool lets #791

Uh oh!

Conversation

mezpusz commented Dec 6, 2025

Uh oh!

MichaelRawson commented Dec 7, 2025

Uh oh!

MichaelRawson Dec 7, 2025

Choose a reason for hiding this comment

Uh oh!

mezpusz Dec 7, 2025

Choose a reason for hiding this comment

Uh oh!

mezpusz commented Dec 7, 2025

Uh oh!

mezpusz commented Dec 7, 2025

Uh oh!

MichaelRawson commented Dec 8, 2025

Uh oh!

mezpusz commented Dec 8, 2025

Uh oh!

MichaelRawson commented Dec 8, 2025

Uh oh!

mezpusz commented Dec 8, 2025

Uh oh!

mezpusz commented Dec 8, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants