Solving the FP completeness issues in #1723 #3023

mattulbrich · 2023-02-05T15:37:12Z

Related Issue

This pull request answers to issue #1723.

Intended Change

Floats and doubles are currently incorrectly insufficiently handled when they appear in
one expression.

Casts need to be added and casts need to be dealt with. ...

introducing casts where needed
adding rules / SMT support for such casts.

Type of pull request

Bug fix (non-breaking change which fixes an issue)
Refactoring (behaviour should not change or only minimally change)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to change)
There are changes to the (Java) code
There are changes to the taclet rule base
There are changes to the deployment/CI infrastructure (gradle, github, ...)
Other:

Ensuring quality

I made sure that introduced/changed code has been well documented (javadoc).
I added new test case(s) for new functionality.
I have tested the feature as follows: small 2-liners
I have checked that runtime performance has not deteriorated

Additional information and contact(s)

It's still W I P.

@samysweb

The contributions within this pull request will be licensed under GPL2-or-later wihtin KeY.

samysweb · 2023-03-09T15:55:10Z

I just did some testing and the floating point balancing seems to work relatively reliably for binary operations.
I currently still see two issues with the current state:

Edge case for binary expressions

There seems to be one edge case due to the replacement strategy of ProgramElementReplacer which always substitutes the first occurrence.
The bug shows up when analyzing the following program:

/*@ public normal_behavior
  @ requires x1 > 0.0f && x2 > 0.0f;
  @ ensures \result>0.0;
  @*/
public double myFunction05(float x1, float x2) {
    double squared1 = ((double) x1)+x1;
    return squared1;
}

At some point within the proof we get to the expression squared1 = ((double)_x1)+_x1
At this point we can either apply the compound_addition_1 taclet (then everything works out fine) or we can apply the cast taclet which (applied repeatedly) results in:

((double)(double)_x1)+_x1
((double)(double)(double)_x1)+_x1
...

This is because ProgramElementReplacer finds _x1 in the left child of + first and replaces it there.
I'm not sure about the consequences of this:

It seems that this introduces some non-determinism in the symbolic execution meaning if we always apply the wrong rule we could get stuck
From my understanding of how replacements work this does not seem easily fixable without reimplementing a position based version of ProgramElementReplacer

How to do position based replacement?

Assignments/Returns

The current rules did not yet take this case into account, to this end I began drafting rules in #3062 (On my own fork since I don't have writing rights on this branch).
The added taclets allow for the treatment of most widening assignments, however there are still some limitations:

Auto mode currently breaks on the first appearance of such an assignment, as the assignment taclet seems to take precedence but fails upon application due to type incompatibilities
If I understand this correctly, a binary operation between two integral types (e.g. two ints) is not a simple expression and thus the rules added in Floating Point rules for casts #3062 currently do not apply. Currently this means the addition rule for adding 2 ints is applied and fails with an exception due to a type mismatch. I'm not sure if this can be fixed without replicating all assignmentMultiplicationInt,assignmentAdditionInt etc. taclets for float and long

Notes from our conversation just now:

Maybe add type sanity checks to all currently existing assignment rules and only allow assignment if same sort or subtype (currently this is only discovered upon application and prevented through an exception which aborts auto mode)
If we do this then new rules are needed:
- Binary Operation between two simple expression with unbalanced types: Adjust unbalanced_float_expression from this branch
- Assignment d = se0 + se1 where se0 and se1 have same type but d differs: e=se0+se1;d=e where e has type of se0
- d=e with e simple expression: type cast

samysweb · 2023-03-13T09:30:49Z

Huh interesting, so what really happens when you click GitHub's innocently looking "rebase" button is a force-push...

mattulbrich · 2023-03-13T09:33:25Z

Huh interesting, so what really happens when you click GitHub's innocently looking "rebase" button is a force-push...

Ok, anything lost or was it a fast-forward with a more aggressive name?

samysweb · 2023-03-13T09:36:42Z

I don't think anything should have gotten lost.
Probably just a more aggressive name because the main commits are inserted before the commits on this branch.

On that note: Unfortunately, I just realized #3027 is not merged yet

…mu/unbalancedFloats

github-advanced-security · 2023-04-04T16:44:16Z

You have successfully added a new QDJVMC configuration project/qodana. As part of the setup process, we have scanned this repository and found 1 existing alert. Please check the repository Security tab to see all alerts.

mattulbrich · 2023-04-11T13:18:25Z

key.core/src/main/java/de/uka/ilkd/key/rule/conditions/FloatingPointBalancedCondition.java

+
+        BinaryOperator properResultInst = balance(inInst, services);
+        if (properResultInst == null) {
+            return null;


Smells like a false alert?

samysweb · 2023-04-05T11:47:37Z

I'm saving the runtimes of the last test run here so that we can compare runtimes once my pull request (with lots of additional \varcond statements) is merged into this branch:

Job	Run time
optional-tests	32m 35s
integration-tests (testProveRules, ubuntu-latest, 11)	16m 5s
integration-tests (testRunAllFunProofs, ubuntu-latest, 11)	1h 27m 29s
integration-tests (testRunAllInfProofs, ubuntu-latest, 11)	56m 47s
unit-tests (ubuntu-latest, 11)	21m 42s
unit-tests (ubuntu-latest, 17)	20m 44s
Overall	3h 55m 22s

This should fix the floating point cast completeness issues -- let's see whether it breaks anything (check the CI tests and runtimes after this merge)

github-actions · 2023-04-11T16:52:37Z

Thank you for your contribution.

The test artifacts are available on Artiweb.
The newest artifact is here.

samysweb · 2023-04-11T17:32:00Z

Looks like all tests are passing.
Comparison of runtimes:

Job	Run time old	Runtime after merge
optional-tests	32m 35s	--
integration-tests (testProveRules, ubuntu-latest, 11)	16m 5s	11m 46s
integration-tests (testRunAllFunProofs, ubuntu-latest, 11)	1h 27m 29s	1h 46m 50s*
integration-tests (testRunAllInfProofs, ubuntu-latest, 11)	56m 47s	45m 43s
unit-tests (ubuntu-latest, 11)	21m 42s	17m 24s
unit-tests (ubuntu-latest, 17)	20m 44s	25m 18s

*repeating the run resulted in a runtime of 1h 24m 34s -- it seems this was just due to the high variance of runtimes on GitHub (so it's very likely that all the other "better" results are noise as well)

Left to do:

Check once more that rules actually do the job
Check that SMT translation is available (see above)
Is slower run of testRunAllFunProofs due to changes in int rules?

samysweb · 2023-04-14T12:57:13Z

Alternative to variable conditions: Create a new schema variable IntegralVariable and then use #integralLoc (code for Schema Variable class provided by Richard Bubel).

mattulbrich and others added 4 commits February 5, 2023 16:00

this contributed to solving the FP completeness issues in #1723

5fcf646

Merge branch 'main' into mu/unbalancedFloats

f7924c8

First try in fixing cast assignments

bd9d054

Added Int/Long to Double casting rules

89cfded

samysweb mentioned this pull request Mar 9, 2023

Floating Point rules for casts #3062

Merged

this contributed to solving the FP completeness issues in #1723

085614e

samysweb force-pushed the mu/unbalancedFloats branch from f7924c8 to 085614e Compare March 13, 2023 09:29

FliegendeWurst added Calculus Completeness labels Mar 24, 2023

samysweb added 8 commits April 3, 2023 11:44

Merge branch 'main' into steuber/unbalancedFloats

2683767

Added type constraint to assignment and int assignment rules

f5970f1

Might have fixed completeness bug w.r.t. casts

1595a2e

Rebuilt test oracle

b6603d4

Merge branch 'mu/unbalancedFloats' of github.com:KeYProject/key into …

73da788

…mu/unbalancedFloats

Merge branch 'main' into mu/unbalancedFloats

56f918f

Merge branch 'main' into steuber/unbalancedFloats

1a6b938

Merge branch 'mu/unbalancedFloats' into steuber/unbalancedFloats

59ba9ea

github-advanced-security bot found potential problems Apr 4, 2023

View reviewed changes

Fix spotless

d05f402

Floating Point rules for casts (#3062)

02be342

This should fix the floating point cast completeness issues -- let's see whether it breaks anything (check the CI tests and runtimes after this merge)

wadoon assigned mattulbrich Aug 20, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Solving the FP completeness issues in #1723 #3023

Solving the FP completeness issues in #1723 #3023

mattulbrich commented Feb 5, 2023

samysweb commented Mar 9, 2023

samysweb commented Mar 13, 2023

mattulbrich commented Mar 13, 2023

samysweb commented Mar 13, 2023

github-advanced-security bot commented Apr 4, 2023

mattulbrich Apr 11, 2023

samysweb commented Apr 5, 2023

github-actions bot commented Apr 11, 2023

samysweb commented Apr 11, 2023 •

edited

Loading

samysweb commented Apr 14, 2023

Solving the FP completeness issues in #1723 #3023

Are you sure you want to change the base?

Solving the FP completeness issues in #1723 #3023

Conversation

mattulbrich commented Feb 5, 2023

Related Issue

Intended Change

Type of pull request

Ensuring quality

Additional information and contact(s)

samysweb commented Mar 9, 2023

Edge case for binary expressions

Assignments/Returns

samysweb commented Mar 13, 2023

mattulbrich commented Mar 13, 2023

samysweb commented Mar 13, 2023

github-advanced-security bot commented Apr 4, 2023

mattulbrich Apr 11, 2023

Choose a reason for hiding this comment

samysweb commented Apr 5, 2023

github-actions bot commented Apr 11, 2023

samysweb commented Apr 11, 2023 • edited Loading

samysweb commented Apr 14, 2023

samysweb commented Apr 11, 2023 •

edited

Loading