SSA feature extractor #1811

rafaelsamenezes · 2024-05-03T13:31:24Z

As discussed in the local meetings, we could start checking other types of encoding that could be used. But first we should check how many benchmarks could be affected by this. So this PR basically prints features found in the SSA trace, we can then use benchexec to collect statistics about how many benchmarks are going to be affected.

src/esbmc/options.cpp

Co-authored-by: intrigus-lgtm <60750685+intrigus-lgtm@users.noreply.github.com>

fbrausse

In general looks like a good idea which I do approve of. But I have a few comments, mostly about being too conservative.

fbrausse · 2024-05-07T16:31:09Z

src/esbmc/options.cpp

+     {"symex-trace", NULL, "print instructions during symbolic execution"},
+     {"ssa-trace", NULL, "print SSA during SMT encoding"},
+     {"ssa-smt-trace", NULL, "print generated SMT during SMT encoding"},
+     {"ssa-features-dump", NULL, "print features in the SSA (just before conversion)"},
    {"symex-ssa-trace", NULL, "print generated SSA during symbolic execution"},
    {"goto2c", NULL, "translate the GOTO program to C"},
    {"show-goto-value-sets",


This is weird formatting. Not your PR. Just wondering whether clang-format is having a bad time with this options.cpp file and we should just do it in one go...

src/goto-symex/features.cpp

fbrausse · 2024-05-07T16:35:28Z

src/goto-symex/features.cpp

+  case expr2t::modulus_id:
+    if (!(is_constant_expr(dynamic_cast<const arith_2ops &>(*e).side_1) ||
+          is_constant_expr(dynamic_cast<const arith_2ops &>(*e).side_2)))
+      features.insert(SSA_FEATURES::NON_LINEAR);


Questionable choice: Is (unsigned short)23 * 42 (i.e. a typecast) really a non-linear expression?

Could you clarify? You commented on modulus. Also, shouldn't be simplified by the constant folding?

Sorry, I was referring to mul_id, which is also handled by this switch case, Github just defaults to 4 lines of context. Constant folding should do it, yes. Is it a necessary preprocessing step to this algorithm?

Sorry, I was referring to mul_id, which is also handled by this switch case, Github just defaults to 4 lines of context. Constant folding should do it, yes. Is it a necessary preprocessing step to this algorithm?

Ow I see, you are right. Well, I could add a contains_symbolic(const expr2tc &e) into irep utils and use it then. Guess that is safer.

Well, the question is, what "SSA contains non-linear features" really means. Consider for instance a := b / 0. This cannot be simplified because it's undefined. But if b was a constant, contains_symbolic() would return false. How would that correspond to "contains non-linear features"? Maybe an easier approach is to call the simplification directly and ask whether the result is "entirely constant" meaning, that it positively only contains constant_*2t expressions and defined functions (such as typecasts and no signed overflows). This predicate could be extended later, no need to get it perfect right now in this PR.

src/goto-symex/features.cpp

rafaelsamenezes requested review from fbrausse and emanino May 3, 2024 13:31

intrigus-lgtm reviewed May 3, 2024

View reviewed changes

src/esbmc/options.cpp Outdated Show resolved Hide resolved

rafaelsamenezes and others added 2 commits May 7, 2024 12:38

[ssa] added features extractor algorithm

52bda95

[ssa] fixed typo on option

38d25e5

Co-authored-by: intrigus-lgtm <60750685+intrigus-lgtm@users.noreply.github.com>

rafaelsamenezes force-pushed the ssa_features branch from 8941a20 to 38d25e5 Compare May 7, 2024 11:38

fbrausse reviewed May 7, 2024

View reviewed changes

rafaelsamenezes added 2 commits May 9, 2024 10:54

[ssa] removed header message from features

6dad1a4

[ssa] inlined for loop of arrays

c9995bb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SSA feature extractor #1811

SSA feature extractor #1811

rafaelsamenezes commented May 3, 2024

fbrausse left a comment

fbrausse May 7, 2024

fbrausse May 7, 2024

rafaelsamenezes May 9, 2024

fbrausse May 9, 2024

rafaelsamenezes May 9, 2024 •

edited

fbrausse May 9, 2024

SSA feature extractor #1811

Are you sure you want to change the base?

SSA feature extractor #1811

Conversation

rafaelsamenezes commented May 3, 2024

fbrausse left a comment

Choose a reason for hiding this comment

fbrausse May 7, 2024

Choose a reason for hiding this comment

fbrausse May 7, 2024

Choose a reason for hiding this comment

rafaelsamenezes May 9, 2024

Choose a reason for hiding this comment

fbrausse May 9, 2024

Choose a reason for hiding this comment

rafaelsamenezes May 9, 2024 • edited

Choose a reason for hiding this comment

fbrausse May 9, 2024

Choose a reason for hiding this comment

rafaelsamenezes May 9, 2024 •

edited