Fix EvalAssert transformation to work on coreutil programs #855

jerhard · 2022-10-13T13:30:40Z

The EvalAssert transformation is currently not able to produce C-output for the bench/coreutil programs, caused by several problems.

Todo:

Fix crash that occurs because some globals are not found by during the transformation EvalAssert transformation crashes on some programs #852
Add declarations of functions before function definitions in the Cil-file that is produced. This way, it is ensured that functions are declared, before references to them are made in asserts (e.g. in asserts about function pointers).
Fix generation of asserts about pointers to malloced-memory. With malloced memory, there are (tmp-)variables of type void* that point to it. The assertions that are currently generated for these variables are not type correct, e.g. because a field offset is applied to a de-referenced void-pointer. Avoid generating invariants for pointers to abstract values of type TVoid for now.
Fix error when running gcc on the output of the transformation generated for cp_comb.c: extent-scan.c:107:18: error: ‘struct fiemap’ has no member named ‘f’. Fix generation of invariants for unions.
Add option to not generate asserts for blobs.

Testing this branch on `bench/coreutil` programs

Run goblint on the selected input file, e.g. cp_comb.c

./goblint  --set pre.cppflags[+] "--std=gnu89" --trans.activated '["assert"]' ../bench/coreutils/cp_comb.c

This will generate a transformed.c. One can check whether this compiles as follows:

gcc transformed.c  -lrt --std=gnu89

fixes #852

…ile. Some asserts may contain function identifiers (e.g. to express the value of a function pointer). Inserting the declarations before the function definitions ensures that the asserts can reference the function identifiers.

sim642 · 2022-10-13T13:51:39Z

Fix generation of asserts about pointers to malloced-memory. With malloced memory, there are (tmp-)variables of type void* that point to it. The assertions that are currently generated for these variables are not type correct, e.g. because a field offset is applied to a de-referenced void-pointer.

I didn't check, but this sounds a lot like something I fixed in #796: 73df4ca. If that's the right thing, then you can just cherry pick it.

…ed-to type is void. If some block has void-type in our abstract domain, we cannot produce meaningful invariants, because we cannot insert the necessary casts easily. To generate invariants even for these casees, one may try to determine the type to cast to by looking at the abstract value more closely.

jerhard · 2022-10-13T15:32:44Z

I didn't check, but this sounds a lot like something I fixed in #796: 73df4ca. If that's the right thing, then you can just cherry pick it.

I needed to additionally add a check whether the pointed to abstract block has type TVoid, because otherwise it would still generate invariants without necessary casts. In the future one may want to look at the abstract value more closely, e.g. by inferring from the fields of a Struct which type it has. Then one could also generate invariants for these cases.

…on declarations directly after last type definition.

jerhard · 2022-10-14T09:21:50Z

I tested this with bench/cp_comb.c: when one sets the surroundByAtomic in EvalAssert to false, and adds an #include<assert.h> to the resulting transformed.c, the C-file can be compiled.

However, the executable one obtains crashes with a failed assertion:

a.out: hash.c:606: hash_initialize: Assertion `table___0->tuning == & default_tuning' failed.
Aborted (core dumped)

sim642 · 2022-10-14T11:06:58Z

However, the executable one obtains crashes with a failed assertion:
a.out: hash.c:606: hash_initialize: Assertion `table___0->tuning == & default_tuning' failed.
Aborted (core dumped)

The code is this:

#line 605
  tmp = malloc(sizeof(*table___0));
#line 605
  table___0 = (Hash_table *)tmp;
#line 606
  if ((unsigned long )table___0 == (unsigned long )((void *)0)) {
#line 607
    return ((Hash_table *)((void *)0));
  }
#line 609
  if (! tuning) {
#line 610
    tuning = & default_tuning;
  }

and the tuning argument actually always is a NULL pointer.

I think the issue is due to flow-insensitivity of the global alloc variable, which initially has all bot fields, but then gets default_tuning in there. But we don't distinguish at which point the initialization has happened, so we think the assertion about the global should also hold before.
This is quite an issue that we haven't considered before regarding the invariant generation.

jerhard · 2022-10-14T11:52:51Z

I think the issue is due to flow-insensitivity of the global alloc variable, which initially has all bot fields, but then gets default_tuning in there. But we don't distinguish at which point the initialization has happened, so we think the assertion about the global should also hold before.
This is quite an issue that we haven't considered before regarding the invariant generation.

Yes, indeed, we came to the same conclusion on the Munich side with @michael-schwarz. For the general case, involving possible multithreading, one would have to perform a must-initialized analysis to ensure that any abstract value we have for globals isn't "shadowing" a possible uninitialized value.

Since at least cp_comb is a single-threaded program and we assume that no threads are created, I reran the transformation with --disable sem.unknown_function.spawn. This should hopefully be sufficient to avoid that the analysis enters the multithreaded mode. When compiling the transformed code and running the resulting binary, the program then does crash with a seg_fault, instead of a failed assertion. There may be some assertion that dereferences some pointer that is not set properly.

jerhard · 2022-10-14T16:43:06Z

Looking at this with @michael-schwarz and Helmut, we found that performing a context-insensitive analysis and not generating asserts for blobs gets rid of the asserts that fail during some simple executions of cp.

src/cdomains/valueDomain.ml

src/util/options.schema.json

src/cdomains/unionDomain.ml

…ransformation.

…Assert transformation." This reverts commit 12a7da9.

This reverts commit 29ef774.

…is given." This reverts commit af65ce4.

michael-schwarz · 2022-10-23T13:01:37Z

Something else comes to mind here: Do we need to submit Goblint as a validator as well in order to benefit from us being able to solve such huge tasks in sv-comp?

Otherwise our ability to reverify these will not benefit us when it comes to points.

sim642 · 2022-10-24T08:12:46Z

That's a good point, but it's too late to do anything about it for this year. We don't have legitimate witness validation support ready (and nor are YAML witnesses as a whole). We can't just use Goblint as a validator, which completely ignores any witness and solves from scratch, because it constitutes as blatant cheating: we would self-confirm all our own witnesses without actually confirming anything. CPAchecker and Ultimate have well-defined and -documented invariant generation subsystems that they disable for validation purposes, we don't. And this is supported by the fact that they have unconfirmed witnesses.

We should just focus on getting these benchmarks fixed and merged first. The plan is to publish a non-SV-COMP-tool-paper on a proper approach to witness validation with Goblint and bring that to SV-COMP legitimately, like it was done with the existing GraphML witnesses.

…re do not match.

…tions.

… configurations.

File moved to bench/scripts/generate_asserts_coreutils.py

src/cdomains/valueDomain.ml

jerhard · 2022-10-25T15:40:38Z

There are still three issues in the generated code that required manual fixing:

e.g. here, asserts are generated for table -- which is a type, not a variable.
e.g. here, the introduction of additional function declarations lead to compiler warnings. Since in other places the introduction of additional function declarations was necessary, I would keep it as is, for now.
here, a coreutil specific issue: I had to replace the type definition of gl_recursive_lock_t with pthread_mutex_t, as the code contained (probably) an old (now incompatible) definition.

jerhard added 2 commits October 13, 2022 10:44

Default to bottom values in the ask.global for transformations.

8230749

fixes #852

sim642 and others added 2 commits October 13, 2022 15:56

Fix YAML witness precondition invariant TypeOfError-s

04aa143

jerhard added 4 commits October 14, 2022 10:14

UnionDomain: Fix invariant generation for the case an offset is given.

af65ce4

Fix indentation in UnionDomain.

29ef774

Use List.partition, remove unnecessary function.

1c899eb

Transformation: Change reordering of Cile.file.globals: insert functi…

bd65a86

…on declarations directly after last type definition.

jerhard added 2 commits October 14, 2022 14:56

Format cilfacade.ml

c2fc3db

Add option witness.invariant.blobs (disabled by default.)

efc9f96

This was referenced Oct 15, 2022

Refinement for unsigned long type not working properly #857

Closed

Use correct ikind for result in inv_exp #858

Merged

Change assert to __VERIFIER_assert for assertions created by evalAssert.

139b03a

sim642 added bug sv-comp SV-COMP (analyses, results), witnesses benchmarking labels Oct 18, 2022

sim642 added this to the SV-COMP 2023 milestone Oct 18, 2022

sim642 reviewed Oct 18, 2022

View reviewed changes

src/cdomains/valueDomain.ml Outdated Show resolved Hide resolved

src/util/options.schema.json Outdated Show resolved Hide resolved

src/cdomains/unionDomain.ml Outdated Show resolved Hide resolved

jerhard added 6 commits October 18, 2022 11:33

Pass WitnessUtil.Invariant to transformations; use it in EvalAssert t…

12a7da9

…ransformation.

Revert "Pass WitnessUtil.Invariant to transformations; use it in Eval…

2dd919e

…Assert transformation." This reverts commit 12a7da9.

Rename option witness.invariant.blobs to ana.base.invariant.blob

61e1b09

Merge branch 'master' into fix_evalAssert

e840d56

Remove check von TVoid in i_deref in ad_invariant.

8b338da

Add script to generate sv-comp bencmarks from coreutils

b5e0fd6

sim642 mentioned this pull request Oct 19, 2022

Fix base Invariant query lval and offset handling #861

Merged

jerhard added 3 commits October 19, 2022 12:57

Revert "Fix indentation in UnionDomain."

105b5e2

This reverts commit 29ef774.

Revert "UnionDomain: Fix invariant generation for the case an offset …

cfa07d6

…is given." This reverts commit af65ce4.

Merge #861

d85be1f

This was referenced Oct 21, 2022

Introduce new possible widening strategy sides-local for globals #863

Merged

exception Lattice.Invalid_widen with enum domain & refinement #864

Closed

Add cast to void* in generated asserts about pointers

bb4da03

michael-schwarz mentioned this pull request Oct 24, 2022

Add definitions for library functions used in cp program #865

Merged

3 tasks

jerhard added 4 commits October 25, 2022 13:35

ad_invariant: Add cast to void* to both pointers if their typesignatu…

22da93a

…re do not match.

EvalAssert transformation: Do not add declarations for __builtin func…

949f1e8

…tions.

coreutils.py: Generate assertions with both interval and non-interval…

cc78242

… configurations.

Remove scripts/coreutils.py from analyzer repo.

6599ed8

File moved to bench/scripts/generate_asserts_coreutils.py

github-advanced-security bot found potential problems Oct 25, 2022

View reviewed changes

src/cdomains/valueDomain.ml Fixed Show fixed Hide fixed

jerhard added 3 commits October 25, 2022 16:39

Use Cilfacade.mkCast instead of Cil.mkCast.

c85cc67

Fix indentation in ValueDomain.ad_invariant.

bed9a19

Update comment in test case.

1071e25

jerhard marked this pull request as ready for review October 25, 2022 14:49

sim642 self-requested a review October 25, 2022 17:28

Merge branch 'master' into fix_evalAssert

a929c64

sim642 mentioned this pull request Oct 28, 2022

Automate manual tests #847

Open

3 tasks

sim642 approved these changes Oct 28, 2022

View reviewed changes

jerhard merged commit e9df254 into master Oct 28, 2022

jerhard deleted the fix_evalAssert branch October 28, 2022 14:25

sim642 added a commit that referenced this pull request Oct 28, 2022

Fix indentation (PR #855)

c9e9e38

sim642 added a commit that referenced this pull request Oct 28, 2022

Fix indentation (PR #855)

9b7fc2b

sim642 added a commit that referenced this pull request Oct 29, 2022

Fix String.starts_with pre OCaml 4.13 (PR #855)

74b4f71

michael-schwarz mentioned this pull request Nov 23, 2022

Improve speed of witness generation for context-insensitive analyses #927

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix EvalAssert transformation to work on coreutil programs #855

Fix EvalAssert transformation to work on coreutil programs #855

jerhard commented Oct 13, 2022 •

edited

Loading

sim642 commented Oct 13, 2022

jerhard commented Oct 13, 2022

jerhard commented Oct 14, 2022

sim642 commented Oct 14, 2022

jerhard commented Oct 14, 2022

jerhard commented Oct 14, 2022

michael-schwarz commented Oct 23, 2022

sim642 commented Oct 24, 2022

jerhard commented Oct 25, 2022

Fix EvalAssert transformation to work on coreutil programs #855

Fix EvalAssert transformation to work on coreutil programs #855

Conversation

jerhard commented Oct 13, 2022 • edited Loading

Testing this branch on bench/coreutil programs

sim642 commented Oct 13, 2022

jerhard commented Oct 13, 2022

jerhard commented Oct 14, 2022

sim642 commented Oct 14, 2022

jerhard commented Oct 14, 2022

jerhard commented Oct 14, 2022

michael-schwarz commented Oct 23, 2022

sim642 commented Oct 24, 2022

jerhard commented Oct 25, 2022

jerhard commented Oct 13, 2022 •

edited

Loading

Testing this branch on `bench/coreutil` programs