🐸 #36

lqd · 2018-05-18T21:06:22Z

This switches from using @frankmcsherry’s differential dataflow to
using @frankmcsherry’s bespoke datalog engine: datafrog.

99% of this commit was authored by the one and only @frankmcsherry.

@frankmcsherry

This switches from using @frankmcsherry’s differential dataflow to using @frankmcsherry’s bespoke datalog engine: datafrog. 99% of this commit was authored by the one and only @frankmcsherry.

5% perf for free

frankmcsherry · 2018-05-19T07:33:02Z

src/output/datafrog_opt.rs

+        let requires_1 = iteration.variable("requires_1");
+        let requires_2 = iteration.variable("requires_2");
+        let requires_bp = iteration.variable("requires_bp");
+        let requires_rp = iteration.variable("requires_rp");


I apologize for not noticing this before, but each of the requires_ variables can be variable_indistinct(). We just need requires itself (without a _) to be a variable() without _indistinct(). This has a solid perf benefit for me, taking the time from 9.43s to 7.99s.

Solid benefits for me as well, 10-15% 👍

Another 10-15% for free

The old code dropped `p` and renamed `q` to `p`, which confused me.

nikomatsakis · 2018-05-18T21:07:42Z

src/cli.rs

@@ -12,8 +12,8 @@ arg_enum! {
    #[derive(Debug, Clone, Copy)]
    pub enum Algorithm {
        Naive,
-        TimelyOpt,
-        LocationInsensitive,
+        DatafrogOpt,


nikomatsakis · 2018-05-19T09:07:15Z

src/output/naive.rs

+            subset.from_join(&subset_2, &region_live_at, |&(r2,q),&r1,&()| (r1,r2,q));
+        }
+
+        subset_r1p.complete()


Just to check my understanding here: it seems to me that the subset variable is not needed. We could just use subset_r1p instead? (That is, nothing directly joins from subset; the only use of it is to be mapped into different arrangements, and we could do those maps from subset_r1p just as easily, right?)

(cc @frankmcsherry)

Answer: yes, I checked elsewhere, and even got a slight win from doing so (in datafrog-opt).

nikomatsakis · 2018-05-19T09:12:17Z

src/output/naive.rs

+
+        // since we're using subset mapped ((r, p), r) we can use it directly out of iteration 1
+        let subset_r1p = iteration2.variable::<((Region, Point), Region)>("subset_r1p");
+        subset_r1p.insert(subset);


This "variable" is invariant, right? Do we really need to 'copy' the values from subset in here, or could we use subset directly?

(cc @frankmcsherry)

Well, let me answer my own question: it won't build if we don't, because from_join wants a Variable and not a Relation: presumably that could be modified in datafrog? (Or is this insert very cheap anyway?)

(I realize in this specific case we ought to collapse the iteration anyway.)

(this is also relevant, it seems to the "redundantly computed index" entries we see below)

Inserting a sorted relation should be pretty cheap. It will still do linear work, confirming that none of subset exists in the empty list of prior tuples, which we could totally optimize. But it shouldn't be too expensive either way.

More generally, there a bunch of cases where the difference between Variable and Relation and between Key and (Key, ()) bite us a little. These can be picked off as appropriate; just some repetition in the library code in the interest of squeaking out some perf. :)

Oh, my mistake. It will not even do the linear work, I think, on account of self.stable should be empty. I could be wrong about that, and we could work through exactly what happens when you install a relation. Not a great amount of care was used putting the wrappers in place. :)

nikomatsakis · 2018-05-19T09:51:35Z

Final measurements:

Version	Workers	Polonius compile time	clap analysis time
⏲️	1	210s	14.191s
⏲️	2	210s	11.035s
⏲️	4	210s	8.898s
⏲️	8	210s	7.953s
🐸	1	11s	6.843s

Amazing! Great work @frankmcsherry and @lqd!

lqd added 2 commits May 18, 2018 23:02

Transition from differential dataflow to datafrog.

d7b55ca

This switches from using @frankmcsherry’s differential dataflow to using @frankmcsherry’s bespoke datalog engine: datafrog. 99% of this commit was authored by the one and only @frankmcsherry.

Remove cfg copy

8691b60

5% perf for free

frankmcsherry reviewed May 19, 2018

View reviewed changes

lqd and others added 5 commits May 19, 2018 11:06

More indistincts

f745d87

Another 10-15% for free

use q consistently instead of p

1f36ada

The old code dropped `p` and renamed `q` to `p`, which confused me.

remove inaccurate FIXME and rename variable

d6ac3a6

remove an intermediate variable for a 5% win

23e4f6b

add comment about dead_can_reach

0f42e30

nikomatsakis approved these changes May 19, 2018

View reviewed changes

nikomatsakis added 2 commits May 19, 2018 05:47

run cargo fmt

c5762fd

since we have a binary, actually, add Cargo.lock

3927471

nikomatsakis merged commit 3b985d0 into rust-lang:master May 19, 2018

lqd deleted the introducing-datafrog branch May 19, 2018 10:40

kennytm mentioned this pull request May 21, 2018

Use different datastructure for MIRI relocations rust-lang/rust#50866

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

🐸 #36

🐸 #36

lqd commented May 18, 2018

frankmcsherry May 19, 2018

lqd May 19, 2018

nikomatsakis May 18, 2018

nikomatsakis May 19, 2018

nikomatsakis May 19, 2018

nikomatsakis May 19, 2018

nikomatsakis May 19, 2018

nikomatsakis May 19, 2018

nikomatsakis May 19, 2018

frankmcsherry May 19, 2018

frankmcsherry May 19, 2018

nikomatsakis commented May 19, 2018 •

edited

🐸 #36

🐸 #36

Conversation

lqd commented May 18, 2018

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

nikomatsakis commented May 19, 2018 • edited

nikomatsakis commented May 19, 2018 •

edited