Laziness of boolean operators #129

divarvel · 2023-04-24T15:40:41Z

Evaluating some expressions trigger an evaluation failure (for instance ill-typed expressions, or arithmetic operations that trigger over/underflows).

The specification does not explicitly say what is the expected behaviour:

strict evaluation, where having a non-computable expression aborts the whole evaluation
lazy evaluation where having a non-computable expression doesn't necessarily abort the whole evaluation (for instance true || <boom>

The || operator is usually lazy on its second argument, even in strict languages. I think we should do the same, following the principle of least surprise.

The same applies for or (|| is a binary operator allowing to take the disjunction of 2 booleans, whereas or works with rule bodies).

Once the matter is settled, we need to add test cases in the conformance suite to make sure all implementations are consistent.

Currently, in the rust implementation, or is lazy on its second argument, while || is strict on both its arguments.

Making the stack evaluator lazy non-strict

Datalog expressions are evaluated with a stack machine whose semantics are inherently eager: the expression tree is computed from the bottom up, so the boolean operators are evaluated after both sides have been evaluated.

It is possible, however, to make the evaluation non-strict: instead of not evaluating the rhs of true || or false &&, we can evaluate the right hand side, and discard it (even if evaluating it resulted in an error). So the key part is to not abort computation in the case of an error, because a boolean evaluator higher up could discard the error branch altogether. Since datalog expression evaluation cannot have side effects, the outcome is similar to a true lazy evaluation.

One way to do that is to have the stack carry not only sucessfully evaluated values, but also errors. In rust that would mean a Stack<Result<Term,Error>> instead of a Stack<Term>. Evaluating an eager operation on an Err value results in popping this value (or those two values in the case of a binary operation), and pushing it back to the stack. For non-strict operations however (&& and ||), it means popping both left and right values from the stack, then:

if left is Err, push it back to the stack
if left is not err, and right can be discarded, push the result to the stack without inspecting right
if left is not err, and right cannot be discarded, if right is Err, push it back to the stack, if it is not, push the result back to the stack

The expression "test".type() == "int" && "test" > 0, (with $test will translate to the following opcodes: "test", TypeOf, "int", ==, "test", 0, >, &&
Here is how it would be executed:

Op     | stack
       | [ ]
"test" | [ Ok("test") ]
TypeOf | [ Ok("string") ]
"int"  | [ Ok("int"), Ok("string") ]
==     | [ Ok(false) ]
"test" | [ Ok("test"), Ok(false) ]
0      | [ Ok(0), Ok("test"), Ok(false) ]
>      | [ Err(TypeMismatch), Ok(false) ]
&&     | [ Ok(false) ]

The text was updated successfully, but these errors were encountered:

Tipnos · 2023-04-24T16:37:04Z

From my point of view this PR is bound with PR #130 (Heterogeneous == ). I think if you decide to not enforce that "strict evaluation, where having a non-computable expression aborts the whole evaluation". I think it's important also to accept "equality on heterogeneous terms" to stay consistent. The specification will be more intuitive.

Moreover the absence of type checking and compilation is an argument in favor of that. It would be weird to have unexpected "type checking" errors at runtime and no tools to prevent them upstream.

Geal · 2023-04-25T19:47:31Z

I agree, it should be lazy, that's the most commonly expected behaviour

divarvel · 2023-10-11T13:34:09Z

i have a proof of concept here biscuit-auth/biscuit-rust#188

the proof-of-concept makes them non-strict rather than lazy. The difference is subtle (and not observable from the outside in the absence of side-effects), but still important to keep in mind.

Geal mentioned this issue Apr 25, 2023

boolean logic should be lazy biscuit-auth/biscuit-rust#150

Open

divarvel added this to the Datalog update milestone Oct 10, 2023

divarvel mentioned this issue Oct 10, 2023

Datalog breaking changes #147

Open

This was referenced Oct 18, 2023

Roadmap #12

Open

Lazy operators #150

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Laziness of boolean operators #129

Laziness of boolean operators #129

divarvel commented Apr 24, 2023 •

edited

Loading

Tipnos commented Apr 24, 2023 •

edited

Loading

Geal commented Apr 25, 2023

divarvel commented Oct 11, 2023

Laziness of boolean operators #129

Laziness of boolean operators #129

Comments

divarvel commented Apr 24, 2023 • edited Loading

Making the stack evaluator lazy non-strict

Tipnos commented Apr 24, 2023 • edited Loading

Geal commented Apr 25, 2023

divarvel commented Oct 11, 2023

divarvel commented Apr 24, 2023 •

edited

Loading

Tipnos commented Apr 24, 2023 •

edited

Loading