Fix Neg clausify #168

brossignol · 2024-04-12T15:34:47Z

Neg clausify is broken, it returns itself as a clause but I don't think it should ever do that.
An example:

from pysat.formula import Atom, And, Or, Neg
f = Or(Atom(1), Neg(Atom(2)), Neg(Atom(2)))
print(list(f))
>>> [[-2], [-2], [1, -2, -2]]

This assume that all models contain -2, but [1, 2] is a valid model. (The correct clause should be only [1, -2, -2], without the [-2]).

I propose a fix where Neg clauses is always empty. An extra variable clausified is used to avoid multiple clausify of the subformula.

alexeyignatiev · 2024-04-12T18:53:10Z

I can't check this behaviour right now - will be back to my computer on Monday. But negation should return a clause if it is a highest level (outer most) term.

alexeyignatiev · 2024-04-12T19:42:17Z

I suppose this is caused by the use of the duplicate literal. I should have handled it properly and this is what requires a fix. I can't try it now without a computer but my guess is it will work if you consider a non-repeated Neg(Atom(2)).

alexeyignatiev · 2024-04-12T19:50:35Z

If my guess is correct then fixing should be done by changing the condition for clauses creation. Instead of checking only if the clauses are empty, it should also take into account whether the name exists already.

brossignol · 2024-04-14T18:31:27Z

When listing the clauses __iter__ is used for the highest level and _iter otherwise.
A solution could be to overwrite Neg.__iter__ to return only subformula and have Neg._iter return itself.

Also, this could be extended to other classes.
This could solves differences in behaviour that appear if you clausify subformula directly:

f = And(Atom(1), Atom(2))
g = And(f, Atom(3))
list(f)
>>> [[1], [2]]

f = And(Atom(1), Atom(2))
g = And(f, Atom(3))
g.clausify()
list(f)
>>> [[1, -4], [2, -4], [4, -1, -2]]

alexeyignatiev · 2024-04-15T01:04:45Z

I agree!

alexeyignatiev · 2024-04-15T01:21:52Z

So if you think you can reimplement the patch implementing these changes and taking into account that Neg should return a clause if that is the outermost negation then I will be happy to merge! Alternatively, I can do it myself but I will have to decline the PR.

brossignol · 2024-04-15T11:21:03Z

I fixed for Neg.

from pysat.formula import Atom, And, Neg

print(list(Neg(Atom(2))))
>>> [[-2]]

f = Or(Atom(1), Neg(Atom(2)), Neg(Atom(2)))
print(list(f))
>>> [[1, -2, -2]]

I will look for other classes too.

alexeyignatiev · 2024-04-15T12:06:33Z

Thanks!

brossignol · 2024-04-15T13:13:02Z

Should a single atom return itself as a clause when it is outermost? (basically a single atom formula)
Currently

list(Atom(1))
>>>[]

add _clauses_tseitin

brossignol · 2024-04-15T17:20:29Z

I finished to fix all classes.

A new attribut _clauses_tseitin is used to store clauses after Tseitin transformation.
The attribut .clauses keeps the raw clauses.
_iter has a new argument outermost to decide witch clauses to return between clauses and _clauses_tseitin.
Add a new test test_clausification.py with simple example reproducing current problems.

This fix avoids all the inner/outermost clause problems and avoid recomputing clausification multiple times.
The only drawback is it double memory footprint.

I made _clauses_tseitin hidden but we can rename it to visible variable clauses_tseitin or even take another name.

edit: The build fails for python 3.11 and 3.12 only, I don't know why.

alexeyignatiev · 2024-04-15T19:25:17Z

But why do you need to have another variable for storing clauses? Why is clauses not enough? All the formula types except for Neg seem have no issue with the outermost case.

Yes, Atom should return a clause if is outermost but I could not find a use for that.

brossignol · 2024-04-15T23:02:53Z

It may not be important, but if you read the subformula clauses, you may get different results.
Currently:

f = And(Atom(1),Atom(2))
g = And(f, Atom(3))

print(list(f))
>>> [[1], [2]]

g.clausify()
print(list(f))
>>> [[1, -4], [2, -4], [4, -1, -2]]

I aimed to have always the same results when called directly (still return Tseitin variables when called as a subformula)

print(list(f))
>>> [[1], [2]]

alexeyignatiev · 2024-04-16T00:34:05Z

I see what you mean. This makes sense although I feel somehow uneasy doubling the number of clauses. Do you make it work uniformly across all the formula types, including CNF?

UPDATE: No worries, I can see now that CNF is among the classes updated.

brossignol · 2024-04-16T01:19:43Z

I think I can make that only the requested clauses (.clauses or .clauses_tseitin) are stored. This way, the subformula won't store any unnecessary clauses.
This will increase calculation time when both types of clauses are called on the same formula. This seems more acceptable.

alexeyignatiev · 2024-04-16T01:22:22Z

Well, re-encoding every time is a problem. 🙂

alexeyignatiev · 2024-04-16T01:31:36Z

So I think it is a lesser evil to do the work once (although it spends more memory).

brossignol · 2024-04-16T01:46:37Z

I was thinking more as re-encoding twice at most. Once for .clauses (if needed) and once for .clauses_tseitin (if needed).
Given that .clauses_tseitin require .clauses, if the call order is .clauses then .clauses_tseitin there is no overcost.

alexeyignatiev · 2024-04-16T01:48:08Z

Yes, but every time a sub-formula appears in a larger formula, you have to "re-tseitinize" it.

brossignol · 2024-04-16T01:49:51Z

But if we store the tseitin result inside .clauses_tseitin there is no problem. Am I wrong?

alexeyignatiev · 2024-04-16T01:52:03Z

My current solution was to keep the clauses representing a non-tseitinized (unnamed) variant of the clauses and then give it a name by applying Tseitin transformation on demand. You correctly pointed out that a user may want to consider clauses representing not only the most complex formula but also its specific sub-formulas, which requires one to keep both versions of the clauses after clausification or re-clausifying on the fly every time. I don't see how you can do it by keeping a single version only and encoding on the fly only once.

alexeyignatiev · 2024-04-16T01:54:50Z

But if we store the tseitin result inside .clauses_tseitin there is no problem. Am I wrong?

In this case, there is no problem of re-encoding but there is a problem of a duplicate of the clauses. 🙂

brossignol · 2024-04-16T01:58:19Z

There is a duplicate only if the user asks both regular clauses and tseistin clauses on the same formula.
My guess is it will not happen very often.

brossignol · 2024-04-16T02:03:55Z

I could aim that clausifying a big formula gives the result without extra computing or memory.
But if regular clauses of subformula are required then there are extra cost of memory and computation. (at most x2).

alexeyignatiev · 2024-04-16T02:05:22Z

I could aim that clausifying a big formula once gives the result without extra computing or memory. But if regular clauses of subformula are required then there are extra cost of memory and computation. (at most x2).

I presume this is what your current patch is doing?

brossignol · 2024-04-16T02:08:07Z

For the moment, it's doubling the memory, but I think I can achieve this goal relatively easily.
I need to tell .clausify to not store .clauses when not required.

alexeyignatiev · 2024-04-16T02:09:39Z

For the moment, it's doubling the memory, but I think I can achieve this goal relatively easily. I need to tell .clausify to not store .clauses when not required.

But the thing is you never know if a user will need that clausal representation of the sub-formula. It is safer to keep it.

brossignol · 2024-04-16T02:15:25Z

They will have access to the tseitin clauses of subformula at no cost, but accessing the regular clause will add cost.

To be fair I have now idea how regular users will use the code (I am not a SAT expert). The most common case can guide how the code is optimized for.

brossignol · 2024-04-16T12:44:13Z

I updated with the memory saving. The results seem convincing to me.

And example to display the new behaviour:

f = And(Atom('1'), Atom('2'))
g = And(f, Atom('3'))

print('f', f.clauses, 'tseitin:', f.clauses_tseitin)
print('g', g.clauses, 'tseitin:', g.clauses_tseitin)
>>> f [] tseitin: []
>>> g [] tseitin: []

g.clausify()
print('f', f.clauses, 'tseitin:', f.clauses_tseitin)
print('g', g.clauses, 'tseitin:', g.clauses_tseitin)
>>>f [] tseitin: [[1, -3], [2, -3], [3, -1, -2]]
>>>g [[3], [4]] tseitin: []

f.clausify()
print('f', f.clauses, 'tseitin:', f.clauses_tseitin)
print('g', g.clauses, 'tseitin:', g.clauses_tseitin)
>>>f [[1], [2]] tseitin: [[1, -3], [2, -3], [3, -1, -2]]
>>>g [[3], [4]] tseitin: []

alexeyignatiev · 2024-04-16T23:01:46Z

This looks good, thank you!

alexeyignatiev · 2024-04-17T12:11:57Z

Just in case you will continue working on something related to these changes, I've renamed the variable self.clauses_tseitin to self.encoded. Again, thanks for the contribution!

brossignol force-pushed the patch-6 branch from 6ce38d5 to 7dde481 Compare April 15, 2024 11:18

brossignol added 11 commits April 15, 2024 18:18

add outermost argument to _iter

9377e03

add _clauses_tseitin

update And clausification

01571be

update Or clausification

b3fc6f3

update Neg clausification

309857c

update Implies clausification

b2d32ab

update Equals clausification

68069d3

update XOr clausification

43074fb

update ITE clausification

b0768a9

update CNF clausification

f8f0520

update Atom clausification

a2d92ec

add test_clausification

9892161

brossignol force-pushed the patch-6 branch from 7dde481 to 9892161 Compare April 15, 2024 16:57

brossignol added 3 commits April 16, 2024 14:17

fix test_clausification

a99bc0a

save clauses only if required

6803a74

rename _clauses_tseitin into clauses_tseitin

8e28201

alexeyignatiev merged commit 9d22f3e into pysathq:master Apr 16, 2024
1 check passed

alexeyignatiev mentioned this pull request Apr 16, 2024

solve returns false, but the formula is satisfiable #169

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix Neg clausify #168

Fix Neg clausify #168

brossignol commented Apr 12, 2024

alexeyignatiev commented Apr 12, 2024

alexeyignatiev commented Apr 12, 2024

alexeyignatiev commented Apr 12, 2024

brossignol commented Apr 14, 2024 •

edited

Loading

alexeyignatiev commented Apr 15, 2024

alexeyignatiev commented Apr 15, 2024 •

edited

Loading

brossignol commented Apr 15, 2024

alexeyignatiev commented Apr 15, 2024

brossignol commented Apr 15, 2024

brossignol commented Apr 15, 2024 •

edited

Loading

alexeyignatiev commented Apr 15, 2024

brossignol commented Apr 15, 2024

alexeyignatiev commented Apr 16, 2024 •

edited

Loading

brossignol commented Apr 16, 2024

alexeyignatiev commented Apr 16, 2024

alexeyignatiev commented Apr 16, 2024

brossignol commented Apr 16, 2024

alexeyignatiev commented Apr 16, 2024

brossignol commented Apr 16, 2024

alexeyignatiev commented Apr 16, 2024

alexeyignatiev commented Apr 16, 2024

brossignol commented Apr 16, 2024

brossignol commented Apr 16, 2024 •

edited

Loading

alexeyignatiev commented Apr 16, 2024

brossignol commented Apr 16, 2024

alexeyignatiev commented Apr 16, 2024

brossignol commented Apr 16, 2024 •

edited

Loading

brossignol commented Apr 16, 2024

alexeyignatiev commented Apr 16, 2024

alexeyignatiev commented Apr 17, 2024

Fix Neg clausify #168

Fix Neg clausify #168

Conversation

brossignol commented Apr 12, 2024

alexeyignatiev commented Apr 12, 2024

alexeyignatiev commented Apr 12, 2024

alexeyignatiev commented Apr 12, 2024

brossignol commented Apr 14, 2024 • edited Loading

alexeyignatiev commented Apr 15, 2024

alexeyignatiev commented Apr 15, 2024 • edited Loading

brossignol commented Apr 15, 2024

alexeyignatiev commented Apr 15, 2024

brossignol commented Apr 15, 2024

brossignol commented Apr 15, 2024 • edited Loading

alexeyignatiev commented Apr 15, 2024

brossignol commented Apr 15, 2024

alexeyignatiev commented Apr 16, 2024 • edited Loading

brossignol commented Apr 16, 2024

alexeyignatiev commented Apr 16, 2024

alexeyignatiev commented Apr 16, 2024

brossignol commented Apr 16, 2024

alexeyignatiev commented Apr 16, 2024

brossignol commented Apr 16, 2024

alexeyignatiev commented Apr 16, 2024

alexeyignatiev commented Apr 16, 2024

brossignol commented Apr 16, 2024

brossignol commented Apr 16, 2024 • edited Loading

alexeyignatiev commented Apr 16, 2024

brossignol commented Apr 16, 2024

alexeyignatiev commented Apr 16, 2024

brossignol commented Apr 16, 2024 • edited Loading

brossignol commented Apr 16, 2024

alexeyignatiev commented Apr 16, 2024

alexeyignatiev commented Apr 17, 2024

brossignol commented Apr 14, 2024 •

edited

Loading

alexeyignatiev commented Apr 15, 2024 •

edited

Loading

brossignol commented Apr 15, 2024 •

edited

Loading

alexeyignatiev commented Apr 16, 2024 •

edited

Loading

brossignol commented Apr 16, 2024 •

edited

Loading

brossignol commented Apr 16, 2024 •

edited

Loading