Add verifier passes #41

jpsamaroo · 2020-10-27T23:31:02Z

Add dead instruction-detection pass
Add loop-detection pass
Add instruction walker utility

jpsamaroo · 2020-10-27T23:49:02Z

I'm not familiar with how the test framework works (I don't have the necessary python expertise to make it work locally), but I'd like to try enabling the verifier passes for all tests that are supposed to be valid programs. If this is desired, could someone help me determine the right invocation to set this up?

jpsamaroo · 2020-10-28T00:38:51Z

I figured out how to run the tests locally; I'll try to get them all running and passing with the verifier enabled.

coveralls · 2020-10-28T00:52:57Z

Coverage remained the same at 96.049% when pulling 459a393 on jpsamaroo:jps/verifier into 089f627 on iovisor:master.

jpsamaroo · 2020-10-28T13:44:51Z

I'm also going to add a pass to ensure all accessed registers are initialized before use, which should be a suitable alternative to #37.

jpsamaroo · 2020-10-29T02:27:55Z

One slight caveat with the uninitialized registers pass: the pass doesn't confirm that a used register is initialized for a given instruction for all of the branches leading to the instruction. I'd be happy to fix this later, but I think for now the current implementation should at least catch the most common cases.

pchaigno

Thanks for tackling this! It's exciting 😃

vm/test.c

pchaigno · 2021-02-25T12:21:07Z

vm/ubpf_verifier.c

+        }
+        if (visited[next_pc] == 0) {
+            cmd = ubpf_walk_paths(vm, walk_fn, data, next_pc, visited);
+            if (cmd == UBPF_WALK_STOP || cmd == UBPF_WALK_INVALID)


I'd prefer an iterative algorithm, or at least, a tail recursive algorithm.

Agreed, this recursive approach also makes me slightly uncomfortable 😄

pchaigno · 2021-02-25T14:51:15Z

vm/ubpf_verifier.c

+_walker_no_loops(struct ubpf_vm *vm, struct ebpf_inst inst, void *data, int inst_off, char *visited)
+{
+    if (isjmp(inst) && (inst_off+1+inst.offset < inst_off) && visited[inst_off+1+inst.offset]) {
+        fprintf(stderr, "Loop detected at offset %d\n", inst_off);


I don't think that's correct. Consider the following graph:

When you follow the edge from 5 to 1, you may have already visited 1, but that doesn't mean there's a cycle. And in general, I don't think there's a relation between the position of the code and the presence of back-edges; the compiler can reorganize code blocks at will.

We should also have a couple test cases for this, with handwritten bytecode.

Good catch! To fix this, I suspect that it would probably be best to construct a CFG so that we can reason about blocks which branch to a predecessor block, right? So then, to determine if we have a loop, we check if we can get back to our current block by following each edge exiting our block. Does this sound reasonable, and if so, should I start constructing a CFG?

Maybe it would be beneficial to describe the algorithm here before diving into its implementation? What did you have in mind?

Last time I had to implement an algorithm to find loops (for Oko, Apache2 license), I only needed to annotate the vertices and edges IIRC. It seems DPDK has a different approach which we may also use (BSD license AFAICT).

I was thinking that for each block, if you walk along each path leaving the block, for any path you take, you'll either end up at a program terminator (exit), or back at the same block. But how to do that efficiently (instead of doing a full walk for every block), we should probably copy from somewhere else.

Since it seems like you have a full copy of ubpf in oko, how would you feel about me copying the relevant loop verifier bits from oko? It would be great if you didn't need to maintain two copies of ubpf then 😄

vm/ubpf_verifier.c

test_framework/test_verifier.py

pchaigno · 2021-02-27T16:16:39Z

test_framework/test_verifier.py

+    if 'asm' not in data and 'raw' not in data:
+        raise SkipTest("no asm or raw section in datafile")
+    if 'result' not in data and 'verifier error' not in data:
+        raise SkipTest("no result or verifier error section in datafile")


Shouldn't we also run on test cases that don't result in verifier errors, to check for false positives?

pchaigno · 2021-02-27T16:20:24Z

vm/ubpf_verifier.c

+            return true;
+        }
+    } else {
+        return false;


I would prefer to list explicitly all the return false cases, and default to return true. We're less likely to have false negatives that way.

pchaigno · 2021-02-27T16:22:40Z

vm/ubpf_verifier.c

+        (cls == EBPF_CLS_JMP)) {
+        return false;
+    }
+    return true;


Add dead instruction-detection pass Add loop-detection pass Add instruction walker utility

Alan-Jowett · 2022-10-03T18:20:30Z

Closing out stale PRs. Please re-open the PR if it's still being worked on.

jpsamaroo marked this pull request as draft October 28, 2020 00:03

jpsamaroo force-pushed the jps/verifier branch from 2e34572 to eed1861 Compare October 28, 2020 17:08

jpsamaroo marked this pull request as ready for review October 28, 2020 19:38

pchaigno requested changes Feb 27, 2021

View reviewed changes

jpsamaroo force-pushed the jps/verifier branch from ec7bdab to c877969 Compare February 27, 2021 17:18

jpsamaroo added 6 commits February 27, 2021 11:18

Add verifier passes

32328ee

Add dead instruction-detection pass Add loop-detection pass Add instruction walker utility

Use verifier for tests

c39bd69

Add uninit regs verifier pass

acb0bd4

Use -v for verifier

f3d87e7

Merge loop and dead inst verification

c877969

Slightly simplify test_verifier.py error printing

1327f35

Alan-Jowett closed this Oct 3, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add verifier passes #41

Add verifier passes #41

jpsamaroo commented Oct 27, 2020

jpsamaroo commented Oct 27, 2020

jpsamaroo commented Oct 28, 2020

coveralls commented Oct 28, 2020 •

edited

Loading

jpsamaroo commented Oct 28, 2020

jpsamaroo commented Oct 29, 2020

pchaigno left a comment

pchaigno Feb 25, 2021

jpsamaroo Feb 28, 2021

pchaigno Feb 25, 2021

jpsamaroo Feb 27, 2021

pchaigno Feb 28, 2021

jpsamaroo Feb 28, 2021

pchaigno Feb 27, 2021

pchaigno Feb 27, 2021

pchaigno Feb 27, 2021

Alan-Jowett commented Oct 3, 2022

Add verifier passes #41

Add verifier passes #41

Conversation

jpsamaroo commented Oct 27, 2020

jpsamaroo commented Oct 27, 2020

jpsamaroo commented Oct 28, 2020

coveralls commented Oct 28, 2020 • edited Loading

jpsamaroo commented Oct 28, 2020

jpsamaroo commented Oct 29, 2020

pchaigno left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Alan-Jowett commented Oct 3, 2022

coveralls commented Oct 28, 2020 •

edited

Loading