Parallel pull from Share #3153

mitchellwrosen · 2022-06-24T16:30:16Z

Depends on Use Values literal in elaborateHashes #3155

Overview

This PR implements a concurrent pull procedure involving a few different threads:

One thread that inserts entities fetched from Share into sqlite. Because writing to sqlite requires a lock, we can't do better than that.
One thread that elaborates subsets of those hashes and adds the elaborated hashes to the work queue (the set of hashes we need to download).
One thread to coordinate pulling batches of work off of the work queue and spawning one-shot workers, and deciding when there's no more work left to do.

…rter/faster

ChrisPenner · 2022-06-29T19:58:20Z

unison-cli/src/Unison/Share/Sync.hs

+
+  workerCount <- newWorkerCount
+
+  Ki.scoped \scope -> do


This works really nicely 👍🏼 Good work on this lib

ChrisPenner · 2022-06-29T20:16:56Z

unison-cli/src/Unison/Share/Sync.hs

+    elaborator hashesVar uninsertedHashesVar newTempEntitiesQueue workerCount =
+      connect \conn ->
+        forever do
+            (join . atomically) do


Is there a benefit to the (join . atomically) do pattern here over something like:

forever do mayNewTempEntities <- atomically $ do ... case mayNewTempEntities of Nothing -> ... Just _ -> ...

?

I find interleaving STM and IO makes it tougher to see what's being accomplished transactionally 😅

Sure, we can switch it to not do the join . atomically thing. I think instead of a Maybe (Set Hash) (or w/e) I'll go with a one-off data type, so it's more clear what going down one branch versus another means.

also reduce number of downloaders from 10 to 5, because 5 has performed better in a few tests

tstat and others added 10 commits June 21, 2022 16:38

Report correct number of downloaded entities

3647993

begin implementing concurrent pull

7baebea

implement concurrent pull

ad60aa4

break up completeTempEntities a bit so it's easier to understand

7bc7c3d

more cleanup of concurrent pull code

47bd01b

Add Values literal to SQLite, use it to make elaborateHashes much sma…

8b4bf8e

…rter/faster

Fix bug in uninsertedHashesVar

7400238

update ki hash

261c6b3

stack.yaml.lock changes

73419dd

⅄ trunk → travis/pull-conc

6ea42db

mitchellwrosen mentioned this pull request Jun 28, 2022

Parallel push to share #3164

Merged

1 task

ChrisPenner reviewed Jun 29, 2022

View reviewed changes

ChrisPenner approved these changes Jun 29, 2022

View reviewed changes

address PR feedback and do some minor cleanup

66cfab2

also reduce number of downloaders from 10 to 5, because 5 has performed better in a few tests

mitchellwrosen marked this pull request as ready for review June 30, 2022 02:43

mitchellwrosen added the ready-to-merge Apply this to a PR and it will get merged automatically once CI passes and 1 reviewer has approved label Jun 30, 2022

mergify bot merged commit 8baee64 into trunk Jun 30, 2022

mergify bot deleted the travis/pull-conc branch June 30, 2022 03:40

mergify bot removed the ready-to-merge Apply this to a PR and it will get merged automatically once CI passes and 1 reviewer has approved label Jun 30, 2022

pchiusano mentioned this pull request Jul 8, 2022

M4 Release Notes (DRAFT) #3209

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Parallel pull from Share #3153

Parallel pull from Share #3153

mitchellwrosen commented Jun 24, 2022 •

edited

ChrisPenner Jun 29, 2022

mitchellwrosen Jun 30, 2022

ChrisPenner Jun 29, 2022

mitchellwrosen Jun 30, 2022

Parallel pull from Share #3153

Parallel pull from Share #3153

Conversation

mitchellwrosen commented Jun 24, 2022 • edited

Overview

ChrisPenner Jun 29, 2022

Choose a reason for hiding this comment

mitchellwrosen Jun 30, 2022

Choose a reason for hiding this comment

ChrisPenner Jun 29, 2022

Choose a reason for hiding this comment

mitchellwrosen Jun 30, 2022

Choose a reason for hiding this comment

mitchellwrosen commented Jun 24, 2022 •

edited