OSS: extremely bad replaceKnown performance #3248

FliegendeWurst · 2023-08-19T15:23:48Z

Description

Automated proof search, replaceKnown accounts for 43% of all CPU time: (the view below is zoomed in)

As far as I can tell, almost no replace_known_left / right rules were actually applied.

Reproducible

always

Steps to reproduce

Load a Java file with a hard to prove specification
Run one of the strategies

I can provide the Java file if needed.

Expected behaviour: acceptable performance, KeY spends CPU time on useful stuff
Actual behaviour: performance gets worse as the proof grows, KeY wastes half its CPU time

Additional information

Commit: 659802c

I should mention that I modified the Simplifier to consider more rulesets.

The text was updated successfully, but these errors were encountered:

mattulbrich · 2023-08-20T06:21:40Z

Thanks for investigating. A number of years back, we had once investigated this issue, and I confirm your observation: The majority of time is spent (wasted?) on equality treatment. There were a few suggestions to change the treatment then.

One plan was to set up a set of left-hand-sides of equations and only check for newly introduced terms if they occur in this set. Trigger a rule application then. For some reason the implementation turned out not to be faster. But I still believe it should have been faster.

I'd be happy if we could revisit this issue.

FliegendeWurst · 2023-08-20T08:04:57Z

This is the hash code used for the map:

key/key.core/src/main/java/de/uka/ilkd/key/rule/OneStepSimplifier.java

Lines 735 to 739 in cacc0fb

    
           @Override 
        
           public int hashCode() { 
        
               return term.op().hashCode(); // Allow more conflicts to ensure that naming and term 
        
                                            // labels are ignored. 
        
           }

I think performance can be improved by adding more data to the hash code. For example the term depth must also be equal (I believe):

@Override
public int hashCode() {
    return Objects.hash(term.op(), term.depth());
}

mattulbrich · 2023-08-20T14:10:54Z

I don't think that this coarse hash code scheme is the problem.

I replaced it with the hashing function in https://gist.github.com/mattulbrich/28c0f53f5a9f608c8c86f2ab1da39546, and it produced proofs of almost the same runtime (and same number of rule apps).

I am not fully sure how it works, but Richard's compiled taclet matcher may be an efficient alternative to checking all subterms with a hashtable ...

/cc @unp1

FliegendeWurst · 2023-08-20T19:13:40Z

I replaced it with the hashing function in gist.github.com/mattulbrich/28c0f53f5a9f608c8c86f2ab1da39546, and it produced proofs of almost the same runtime (and same number of rule apps).

I do think it has a big impact, at least in some situations. Your proposed hash is much slower (mine: 282 seconds for 100% replay, yours: 360 seconds for 19% of the replay).

mattulbrich · 2023-08-21T06:43:08Z

Ok, that obviously depends on the program to load. I apparently looked at a smaler program. Which example do you use for reference?

mattulbrich · 2023-08-21T06:48:32Z

While you are at it: The treatment of equalities (applyEq) was found out to be also very slow and inefficient and turned out be the major time waster in many proofs. Feel free to fix it if you stumble across it. :-P

FliegendeWurst · 2023-08-21T07:04:41Z

Ok, that obviously depends on the program to load. I apparently looked at a smaler program. Which example do you use for reference?

I use https://gist.github.com/FliegendeWurst/9be15e20e3bfd0722fcdd7f5c4afebcd. It requires some of my changes in https://github.com/FliegendeWurst/key/tree/testing, most notably the support for \dl_seqGet__int (although by now I think it could simply be written as (int) \dl_seqGet).

FliegendeWurst · 2023-08-21T15:21:56Z

With some more optimizations it is possible to eliminate this particular slowdown:

Now the other bit of logic in the Simplifier takes approx. 90% of the CPU time. This is actuallly expected because I added some more rules to the OSS for this experiment (selectOfStore, selectOfCreate, selectOfAnon, selectOfMemset, dismissNonSelectedField, selectCreatedOfAnon, castDel, sortsDisjointModuloNull, ifthenelse_negated, elementOfUnion, elementOfArrayRange).

In fact, almost all CPU time is spent in refactorLabelsRecursive.

mattulbrich · 2023-08-21T15:44:56Z

Regarding selectOfStore etc. Christoph Scheben implemented a set of taclets which pull out common heap terms such that such select-store chains need not be reduced more than once. It may be that adding taclets to the OSS may render these techniques useless. It may be that nowadays using the OSS is the faster solution, but that should be looked at. Perhaps some other rules of this techniques would then better be removed.

FliegendeWurst · 2023-08-21T15:59:23Z

Regarding selectOfStore etc. Christoph Scheben implemented a set of taclets which pull out common heap terms such that such select-store chains need not be reduced more than once. It may be that adding taclets to the OSS may render these techniques useless.

That's of course even better. I will look into it.

In any case, I have done another profiling session (this time with origin tracking disabled):

It appears CPU time is spent on:

finding the taclet (~75%)
rewriting the term (~20%)

FliegendeWurst · 2023-09-04T07:45:53Z

I have done some more tests on ShortestPath.java:

Settings	Proof	Rule apps	Time
`No origin tracking, OSS: with additional rules`	3008 nodes	226,182 / 462,239	71.2, 92.2 seconds
`Origin tracking, OSS: with additional rules`	3008 nodes	226,182 / 462,239	110.2, 136.6 seconds
`No origin tracking, OSS: no additional rules`	>53k nodes	16,877 / >124k	<25 seconds, >9h

I ran the macros Finish Symbolic Execution and Heap Simplification in each configuration (rule app count above is after executing each macro).
The last run did not finish in 9 hours. After stopping the macro the UI became unresponsive, meaning the numbers are a lower bound.

FliegendeWurst added 🐞 Bug Prover Core labels Aug 19, 2023

FliegendeWurst mentioned this issue Aug 21, 2023

Optime OSS replaceKnown and cycle check #3256

Closed

FliegendeWurst linked a pull request Sep 5, 2023 that will close this issue

One Step Simplifier: optimizations and new rules #3272

Open

FliegendeWurst self-assigned this Sep 9, 2023

FliegendeWurst mentioned this issue Sep 22, 2023

One Step Simplifier: optimizations (no new rules) #3280

Open

FliegendeWurst added the 🚀 Performance label Oct 21, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OSS: extremely bad replaceKnown performance #3248

OSS: extremely bad replaceKnown performance #3248

FliegendeWurst commented Aug 19, 2023 •

edited

Loading

mattulbrich commented Aug 20, 2023

FliegendeWurst commented Aug 20, 2023 •

edited

Loading

mattulbrich commented Aug 20, 2023

FliegendeWurst commented Aug 20, 2023 •

edited

Loading

mattulbrich commented Aug 21, 2023

mattulbrich commented Aug 21, 2023

FliegendeWurst commented Aug 21, 2023

FliegendeWurst commented Aug 21, 2023 •

edited

Loading

mattulbrich commented Aug 21, 2023 via email

FliegendeWurst commented Aug 21, 2023 •

edited

Loading

FliegendeWurst commented Sep 4, 2023

OSS: extremely bad replaceKnown performance #3248

OSS: extremely bad replaceKnown performance #3248

Comments

FliegendeWurst commented Aug 19, 2023 • edited Loading

Description

Reproducible

Steps to reproduce

Additional information

mattulbrich commented Aug 20, 2023

FliegendeWurst commented Aug 20, 2023 • edited Loading

mattulbrich commented Aug 20, 2023

FliegendeWurst commented Aug 20, 2023 • edited Loading

mattulbrich commented Aug 21, 2023

mattulbrich commented Aug 21, 2023

FliegendeWurst commented Aug 21, 2023

FliegendeWurst commented Aug 21, 2023 • edited Loading

mattulbrich commented Aug 21, 2023 via email

FliegendeWurst commented Aug 21, 2023 • edited Loading

FliegendeWurst commented Sep 4, 2023

FliegendeWurst commented Aug 19, 2023 •

edited

Loading

FliegendeWurst commented Aug 20, 2023 •

edited

Loading

FliegendeWurst commented Aug 20, 2023 •

edited

Loading

FliegendeWurst commented Aug 21, 2023 •

edited

Loading

FliegendeWurst commented Aug 21, 2023 •

edited

Loading