Inlining overhaul for test generation #4255

Dargones · 2023-07-06T12:14:02Z

This PR overhauls implementation of inlining, on which test generation relies to generate system-level tests, and which can in the future be used for other purposes, such as bounded model checking. Inlining, in this context, means adjusting the Dafny to Boogie translation in such a way that the resulting Boogie program, when you call Boogie's Inline method on it, merges all specified Dafny functions and methods, including all corresponding well-formedness checks, into a single procedure. The key AST transformation necessary to make this happen is removal of short circuiting expressions from the original program. Most of what this PR does has to do with lifting this short-circuit-removal pass from Boogie to Dafny. This achieves three things:

The new implementation is much more efficient. The old implementation can generate unit tests and inline one or two methods but not entire programs. Once Boogie 3.0 is merged into Dafny, the speed up should increase further.
The new implementation preserves Dafny line/column information to report coverage in terms of Dafny source rather than its Boogie representation. Making this change has been one of the main asks pertaining to test generation.
The new code should be, I think, easier to maintain going forward. It is split into disjoined transformations over Dafny/Boogie ASTs (here is the main coordinating method) and makes as few assumptions about the translation process as possible (whereas the previous implementation would rely very heavily on position and existence of various constructs that Dafny translator creates)

By submitting this pull request, I confirm that my contribution is made under the terms of the MIT license.

atomb

This looks like a great improvement. I have a couple of suggestions about error message wording, and a couple of questions, but nothing major.

atomb · 2023-07-14T21:21:11Z

Source/DafnyTestGeneration.Test/Setup.cs

@@ -26,7 +29,6 @@ public class Setup {

    public static TheoryData<List<Action<DafnyOptions>>> OptionSettings() {
      var optionSettings = new TheoryData<List<Action<DafnyOptions>>>();
-      optionSettings.Add(new() { options => options.TypeEncodingMethod = CoreOptions.TypeEncoding.Arguments });
      optionSettings.Add(new() { options => options.TypeEncodingMethod = CoreOptions.TypeEncoding.Predicates });


Just leaving a note to have a record that we would like to figure out why it doesn't work well with Arguments and move it over to that mode once we do that for Dafny more broadly.

atomb · 2023-07-14T21:29:25Z

Source/DafnyTestGeneration/Main.cs

+                     $"{coveredByTests} should be covered by tests " +
+                     $"(assuming no tests were found to be duplicates of each other). " +
+                     $"Moreover, {coveredByCounterexamples} locations have been found to be reachable " +
+                     $"(i.e. the verifier returned a counterexample and did not timeout). " +


I wonder whether a word other than "counterexample" would be more useful here to folks who don't know as much about the inner workings of how test generation uses SMT.

Done - paraphrased to "...the verifier did not timeout and produced example inputs to reach these locations"

atomb · 2023-07-14T21:48:51Z

Source/DafnyTestGeneration/Inlining/AddByMethodRewriter.cs

+  }
+
+  internal void PreResolve(Program program) {
+    AddByMethod(program.DefaultModule);


Does this skip other modules?

It shouldn't - all modules are submodules of the default module

atomb · 2023-07-14T21:52:23Z

Source/DafnyTestGeneration/Inlining/SeparateByMethodRewriter.cs

+namespace DafnyTestGeneration.Inlining;
+
+/// <summary>
+/// Separates the bodies of function-by-methods into separate methods so that translator will process them accordingly.


It's cool that you can do this. I can think of some other places it could be useful.

atomb · 2023-07-14T22:02:41Z

Source/DafnyTestGeneration/ProgramModification.cs

@@ -131,25 +156,35 @@ private enum Status { Success, Failure, Untested }
        return counterexampleLog;
      }
      var log = writer.ToString();
+      // If Dafny finds several assertion violations (e.g. because of inlining one trap assertion multiple times),
+      // pick the first execution trace and extract the counterexample from it


Is this always the right trace?

The choice of the trace shouldn't matter since I replace all native assertions with assumptions and so the only assertion that can fail is the one that is being targeted. One can imagine an optimization that picks a trace covering the most blocks but in terms correctness the choice should not affect the result.

atomb · 2023-07-14T22:03:18Z

Source/DafnyTestGeneration/ProgramModification.cs

-          $"e.g. if branching is conditional on the result of a trait instance " +
-          $"method call.");
+          $"for {duplicate.uniqueId} - this may occur if the code under test is non-deterministic, " +
+          $"if a method/function is not inlined, or if test generation cannot extract a value from a counterexample.");


Same comment about "counterexample" as before.

atomb · 2023-07-20T23:00:50Z

Those two failures aren't your fault. One will be fixed by #4317 and the other is non-deterministic.

Dargones · 2023-07-20T23:04:43Z

Got it, thank you @atomb!

This PR overhauls implementation of inlining, on which test generation relies to generate system-level tests, and which can in the future be used for other purposes, such as bounded model checking. Inlining, in this context, means adjusting the Dafny to Boogie translation in such a way that the resulting Boogie program, when you call Boogie's `Inline` method on it, merges all specified Dafny functions and methods, including all corresponding well-formedness checks, into a single procedure. The key AST transformation necessary to make this happen is removal of short circuiting expressions from the original program. Most of what this PR does has to do with lifting this short-circuit-removal pass from Boogie to Dafny. This achieves three things: - The new implementation is much more efficient. The old implementation can generate unit tests and inline one or two methods but not entire programs. Once Boogie 3.0 is merged into Dafny, the speed up should increase further. - The new implementation preserves Dafny line/column information to report coverage in terms of Dafny source rather than its Boogie representation. Making this change has been one of the main asks pertaining to test generation. - The new code should be, I think, easier to maintain going forward. It is split into disjoined transformations over Dafny/Boogie ASTs (here is the [main coordinating method](https://github.com/Dargones/dafny/blob/a5004d6d62d324433b57937363605b8ba975dca6/Source/DafnyTestGeneration/Inlining/InliningTranslator.cs#L22-L47)) and makes as few assumptions about the translation process as possible (whereas the previous implementation would rely very heavily on position and existence of various constructs that Dafny translator creates) By submitting this pull request, I confirm that my contribution is made under the terms of the MIT license. --------- Co-authored-by: Aleksandr Fedchin <fedchina@amazon.com>

Dargones and others added 30 commits March 9, 2023 21:14

Fix self-referencing issue

b15343c

Take nullability into account

28be201

Merge branch 'master' into SelfReference

ecc17bc

Use Assert.Single instead of Assert.Equal

4259481

Support constant fields

ea7aa6c

Improve Code Quality

b919fde

Merge branch 'master' into ConstantFields

930c200

Dummy commit ot rerun tests

b1daf73

Fix issue to pass tests

380286e

Add inlining attribute

9881d89

Add another potential error message

02db8b2

Rename div to mod

eb98375

Dummy commit to rerun tests

a6342bc

rerun tests

bf8b7e1

Merge branch 'master' into InliningAttribute

05663c4

Remove --inline option and detect attributes automatically

b382eb2

rerun tests

e44f849

Increase stack size

544d32b

Only transform procedures to be tested/inlined

56d9992

Merge branch 'IncreaseStackSize' into Dev

d04c7d6

Merge branch 'InliningAttribute' into Dev

b69d6ad

Merge branch 'ConstantFields' into Dev

0237c85

Merge branch 'SelfReference' into Dev

6e6dba5

Add changes from currentfixed

8c5b078

User defined constructor methods prototype

1d57e2c

Fix by-method transformation

fd904ea

Merge branch 'master' into Dev

7b327bb

Use cloner in function to method transformation

0d2a846

Merge branch 'FunctionToMethod' into Dev

db16cfa

Minor fix

8a3d35d

Aleksandr Fedchin and others added 13 commits July 5, 2023 23:04

Various fixed

98de29d

Fixed all existing tests

94559dc

Better Test Formatting

0ec9e1b

Done with tests for statements

747ad5d

Final changes

b2427a4

Remove comments about things to do once Booige 3.0 is merged

6256d0e

Release notes and README fix

a5004d6

Merge branch 'dafny-lang:master' into InlinePR

02c9379

Dotnet format

042c2ff

Patch tests

bbf1dd0

Merge branch 'master' into InlinePR

92439c1

See if Test Generation passes windows tests with predicates only

f7be37c

Merge branch 'master' into InlinePR

0c3acd1

atomb requested changes Jul 14, 2023

View reviewed changes

Aleksandr Fedchin added 2 commits July 20, 2023 15:34

Do not reference counterexamples in debug messages

d11b11f

Merge remote-tracking branch 'origin/master' into InlinePR

52e785b

atomb previously approved these changes Jul 20, 2023

View reviewed changes

Dargones and others added 2 commits July 20, 2023 18:29

Merge branch 'master' into InlinePR

9ecf6d4

Merge remote-tracking branch 'origin/master' into InlinePR

198d730

Dargones dismissed atomb’s stale review via 198d730 July 22, 2023 00:05

This was referenced Jul 24, 2023

Dafny Test Coverage Report #4325

Merged

[Test Generation] Change the set of coverage criteria that can be targeted #4326

Merged

atomb approved these changes Jul 24, 2023

View reviewed changes

atomb enabled auto-merge (squash) July 25, 2023 21:32

Merge branch 'master' into InlinePR

eef5efb

atomb merged commit 22d5422 into dafny-lang:master Jul 26, 2023
18 checks passed

BrewTestBot mentioned this pull request Sep 29, 2023

dafny 4.3.0 Homebrew/homebrew-core#146965

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Inlining overhaul for test generation #4255

Inlining overhaul for test generation #4255

Dargones commented Jul 6, 2023

atomb left a comment

atomb Jul 14, 2023

atomb Jul 14, 2023

Dargones Jul 20, 2023

atomb Jul 14, 2023

Dargones Jul 20, 2023

atomb Jul 14, 2023

atomb Jul 14, 2023

Dargones Jul 20, 2023

atomb Jul 14, 2023

atomb commented Jul 20, 2023

Dargones commented Jul 20, 2023

Inlining overhaul for test generation #4255

Inlining overhaul for test generation #4255

Conversation

Dargones commented Jul 6, 2023

atomb left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

atomb commented Jul 20, 2023

Dargones commented Jul 20, 2023