Switch back to using one threadpool #7898

illicitonion · 2019-06-19T15:42:25Z

This reverts #7848 and fixes the performance issue found in it.

The first commit is just a revert (with a few manual merge cleanups), the second commit fixes the performance issue.

This reverts commit 0d9d214.

The previous method ended up reading lots of small byte buffers, and then joining them quadratically. This just fills a buffer.

ity

looks great - thank you for coming back to it!

As per https://docs.rs/tokio-threadpool/0.1.15/tokio_threadpool/fn.blocking.html when you use a blocking block in a task, it blocks the _entire_ task, not just the bit which you're marking blocking. This means that when we're in one giant join_all calling read_dir on a bunch of directories, we end up serializing them to only read one directory at a time. This change introduces a new logging::Executor (a slightly weird place for it, but it depends on the logging stuf, so...) which mirrors the tokio::Runtime and the futures::sync::oneshot::spawn APIs to conveniently allow blocking Futures to be spawned in their own tasks, so that they don't hinder parallelism within a task.

tokio-fs has some weird semantics whereby using blocking in a larger task (e.g. mapping stats from the result of a readdir) forces the work to not be done in parallel. I'm pretty sure we _can_ make tokio-fs do what we want efficiently, but it's taking longer than I'd hoped, so I'm re-introducing the separate io threadpool so we don't have a performance regression while I investigate. This reverts a small portion of pantsbuild#7898 manually.

* Introduce task_executor This is a single object we can pass around to allow all sorts of future running to happen, with logging happening properly, rather than needing to pass around different threadpools and manually sort out logging. * Core has an Executor not a Runtime * Scandir runs on io pool * PosixFS uses IO pool * ShardedLMDB uses io pool * Calculate fingeprint on io pool * Add TODO to move Executor some time * Add docstring to spawn_on_io_pool * Move Executor to its own crate * fmt

illicitonion added 2 commits June 19, 2019 15:56

Revert "Re-instate PosixFS Threadpool (pantsbuild#7848)"

c253c42

This reverts commit 0d9d214.

Use efficient file reading

4dacc42

The previous method ended up reading lots of small byte buffers, and then joining them quadratically. This just fills a buffer.

illicitonion requested review from stuhood, ity and blorente June 19, 2019 15:42

fmt

2d98b19

ity approved these changes Jun 19, 2019

View reviewed changes

illicitonion mentioned this pull request Jun 19, 2019

Prefactor: Extract store and sharded_lmdb into their own crates #7904

Merged

illicitonion merged commit fa50e6e into pantsbuild:master Jun 19, 2019

illicitonion deleted the dwagnerhall/tokio/unrevert-clean branch June 19, 2019 23:32

illicitonion mentioned this pull request Jul 5, 2019

Switch to use IO pool instead of tokio-fs #8017

Closed

illicitonion mentioned this pull request Jul 9, 2019

Fix performance regression introduced by #7898 #8006

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Switch back to using one threadpool #7898

Switch back to using one threadpool #7898

illicitonion commented Jun 19, 2019 •

edited

ity left a comment

Switch back to using one threadpool #7898

Switch back to using one threadpool #7898

Conversation

illicitonion commented Jun 19, 2019 • edited

ity left a comment

Choose a reason for hiding this comment

illicitonion commented Jun 19, 2019 •

edited