Vastly speed up ancestry check by not re-crawling duplicate histories. #4753

ChrisPenner · 2024-03-06T19:50:33Z

Overview

Noticed when doing fast-forward detection in PG that base only has ~500 history nodes at the top level, so was curious why it was infeasible to do full ancestor detection; turns out that with UNION instead of UNION ALL we don't cull duplicates, meaning every time we would merge it'd crawl the entire history again 🙃 .

It's unintuitive, because you'd think UNION ALL is correct since the root causal shouldn't ever appear in the recursive tail, but it appears that's just not how UNIONs work in a WITH RECURSIVE CTE 🤷🏼‍♂️

Should speed up merges since this affects lca, but we can also make lca much faster, split that off into its own PR since it's a bigger change that I don't have time to test thoroughly right now: #4754

Implementation notes

UNION ALL -> UNION;
speeds this up by infinity percent (I never waited it to finish before, now it finishes instantly).

Test coverage

Did some sqlite tests and it should work.

Loose ends

While digging around I noticed that in https://github.com/unisonweb/unison/blob/trunk/parser-typechecker/src/Unison/Codebase/Causal.hs there are many functions which are computing predecessors and 'before' checks on in-memory haskell objects, these would be better on memory and probably much faster if they just used SQLite directly. Probably something for @mitchellwrosen and @tstat to look at as part of the merge rewrite 😄

See #4754

aryairani · 2024-03-08T15:01:31Z

Just curious if we have any existing / regression tests that exercise this?

Vastly speed up ancestry check by not re-crawling duplicate histories.

d3c96f8

ChrisPenner force-pushed the cp/speed-up-before-check branch from 6ca07f8 to d3c96f8 Compare March 6, 2024 19:52

ChrisPenner mentioned this pull request Mar 6, 2024

Speed up LCA check #4754

Draft

ChrisPenner marked this pull request as ready for review March 6, 2024 19:57

ChrisPenner requested review from mitchellwrosen, tstat and aryairani March 6, 2024 19:57

mitchellwrosen approved these changes Mar 7, 2024

View reviewed changes

aryairani approved these changes Mar 8, 2024

View reviewed changes

aryairani merged commit de8a7ca into trunk Mar 8, 2024
7 checks passed

aryairani deleted the cp/speed-up-before-check branch March 8, 2024 18:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Vastly speed up ancestry check by not re-crawling duplicate histories. #4753

Vastly speed up ancestry check by not re-crawling duplicate histories. #4753

ChrisPenner commented Mar 6, 2024 •

edited

Loading

aryairani commented Mar 8, 2024 •

edited

Loading

Vastly speed up ancestry check by not re-crawling duplicate histories. #4753

Vastly speed up ancestry check by not re-crawling duplicate histories. #4753

Conversation

ChrisPenner commented Mar 6, 2024 • edited Loading

Overview

Implementation notes

Test coverage

Loose ends

aryairani commented Mar 8, 2024 • edited Loading

ChrisPenner commented Mar 6, 2024 •

edited

Loading

aryairani commented Mar 8, 2024 •

edited

Loading