perf: keep directly the intersection of derivations in Memory #33

mpizenberg · 2020-10-11T21:57:39Z

Memory stores a Vec of derivations per package. If we are to evaluate potential packages in a smarter way #32 , we need the intersection of derivation terms.

There are two methods in Memory that can return iterators to individual assignment terms, terms_for_package and potential_packages. The latter is the use case we are discussing for performance improvements in #32 . And the former is to compute relations in incompatibilities that eventually ends up computing the intersection of those terms. Considering that we seem to spend much time in relation computation #27 , this could be a non-negligible speedup.

The text was updated successfully, but these errors were encountered:

Eh2406 · 2020-10-12T03:04:47Z

One thing I was wondering about is how this works with backtracking. If we backtrack to before a decision that introduced that derivation then does that mean that some terms need to come out of that intersection? If so how do we do that? Even if this is a problem, it is not insurmountable, we can keep the Vec of derivations and a pre computed intersection. In the case of backtracking, the vec gets smaller and the intersection needs to be recomputed.

mpizenberg · 2020-10-12T08:20:19Z

That's a very relevant question! Backtracking is currently operated thanks to history which is a chronological vec of all the assignments. Once we backtrack to a given level, we iterate on assignments with a lower level and rebuild the memory.

Conflict resolution is not common I think in a normal setting since newer versions tend to be compatible with older ones, and we choose newer versions by default. Also, once we've found the root cause for a conflict, I don't see a reason we should have another conflict soon. ~~But backtracking happens once per prior cause until the root cause is found so it could typically happen multiple times until we've found the root cause~~. <- woops, that is wrong, there is 1 backtrack per conflict resolution

It's probably worth having two things for memory derivations, a precomputed intersection, and terms that have not been intersected yet in a complementary vec. Something like this in the Memory.assignments.

struct PackageAssignments<V: Version> {
    decision: Option<(V, Term<V>)>,
    derivations_intersected: Term,
    derivations_not_intersected_yet: Vec<Term<V>>,
}

The Memory method terms_for_package(&self, p: &P) would become terms_intersection_for_package(&mut self, p: &P). It should empty the derivations_not_intersected_yet vec into the derivations_intersected term and return that. There should also be changes in the potential_packages.
This would provide both the benefit of never computing that intersection more than once, and not computing the intersection if it's not required yet (for backtracking for example).

Slightly unrelated, but related performance-wise, I couldn't figure how to mutate in place the history and memory and so had to clone the whole history which is quite bad. Is this another case where interior mutability is needed? is there a better way to do that?

Eh2406 · 2020-10-12T14:34:22Z

std::mem::take(&mut self.history) would be the escape hatch that will work there, when that clone becomes significant. (it did not change the runtime of the current benchmarks.)

mpizenberg · 2020-10-12T14:46:57Z

I've had a try at this today. I chose better-heuristic (smart package choice) as a base and ran benchmark large_case_1, then cherry-picked the improvements from dont-readd-deps and finally implemented the improvement discussed here on top. The timings I got for large_case_1 on my computer are the following:

better-heuristic: 160 ms / iter
+ dont-readd-deps: 115 ms / iter
+ this: 60 ms / iter

So on the large_case_1 benchmark, this looks like a 2x speedup. It is not cleaned up (few things are not needed anymore) but the commit is here (f803428) in branch precompute_intersections.

mpizenberg · 2020-10-12T15:01:57Z

And on large_case_2 im seeing:

better-heuristic: 312 ms / iter
+ dont-readd-deps: 220 ms / iter
+ this: 122 ms / iter

Eh2406 · 2020-10-12T15:24:38Z

On my computer the improvements are similarly grate! I look forward to that getting polished into a PR!

Eh2406 · 2020-10-12T15:52:12Z

I was excited, so pushed a commit to your branch with some cleaned ups. If that was rude I am sorry and ignore the commit.

mpizenberg · 2020-10-12T15:56:54Z

ahah don't worry, I didn't even plan to make a PR of this soon. I used the commits for the two new benchmarks, which definitely shouldn't end in Git (we'll have to re-assess that git LFS situation, but that's another subject). I see that branch more as exploration stuff. Feel free to fill it with all the ideas and cleanup you want ^^

mpizenberg · 2020-10-17T21:40:01Z

Done in #37 which is on its way to being merged.

mpizenberg changed the title ~~Optimization: keep directly the intersection of derivations in Memory~~ perf: keep directly the intersection of derivations in Memory Oct 12, 2020

This was referenced Oct 14, 2020

Benchmark v2 #34

Closed

perf: precompute intersections (2x speedup) #37

Merged

mpizenberg closed this as completed Oct 17, 2020

mpizenberg mentioned this issue May 1, 2021

perf: remove not_intersected_yet #87

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf: keep directly the intersection of derivations in Memory #33

perf: keep directly the intersection of derivations in Memory #33

mpizenberg commented Oct 11, 2020 •

edited

Loading

Eh2406 commented Oct 12, 2020

mpizenberg commented Oct 12, 2020 •

edited

Loading

Eh2406 commented Oct 12, 2020

mpizenberg commented Oct 12, 2020 •

edited

Loading

mpizenberg commented Oct 12, 2020 •

edited

Loading

Eh2406 commented Oct 12, 2020

Eh2406 commented Oct 12, 2020

mpizenberg commented Oct 12, 2020

mpizenberg commented Oct 17, 2020

perf: keep directly the intersection of derivations in Memory #33

perf: keep directly the intersection of derivations in Memory #33

Comments

mpizenberg commented Oct 11, 2020 • edited Loading

Eh2406 commented Oct 12, 2020

mpizenberg commented Oct 12, 2020 • edited Loading

Eh2406 commented Oct 12, 2020

mpizenberg commented Oct 12, 2020 • edited Loading

mpizenberg commented Oct 12, 2020 • edited Loading

Eh2406 commented Oct 12, 2020

Eh2406 commented Oct 12, 2020

mpizenberg commented Oct 12, 2020

mpizenberg commented Oct 17, 2020

mpizenberg commented Oct 11, 2020 •

edited

Loading

mpizenberg commented Oct 12, 2020 •

edited

Loading

mpizenberg commented Oct 12, 2020 •

edited

Loading

mpizenberg commented Oct 12, 2020 •

edited

Loading