Reduce blocking operations in eventloop thread #1342

itamarst · 2023-10-16T19:00:58Z

This is likely only a subset, but it's a start.

Fixes https://tahoe-lafs.org/trac/tahoe-lafs/ticket/4068, will probably file follow-up for round 2.

…eventloop-thread

coveralls · 2023-10-16T19:19:29Z

coverage: 94.627%. first build when pulling 20cfe70 on 4068-reduce-cpu-in-eventloop-thread into a08a622 on master.

exarkun

Thanks. It looks like many of these implementation changes should improve responsiveness during bulk operations. Maybe some of them will even improve throughput for concurrent bulk operations? I didn't try running the benchmarks and comparing against master. Did you try that? Is there a responsiveness benchmark?

Overall the new global state in the implementation is a bit bothersome. I can think of extensive refactorings that would let us avoid that but I'm not sure how realistic those are. Maybe we could try to brainstorm together and see if we can come up with something feasible that avoids it?

src/allmydata/client.py

exarkun · 2023-10-18T19:25:29Z

src/allmydata/codec.py

@@ -53,9 +53,9 @@ def encode(self, inshares, desired_share_ids=None):

        for inshare in inshares:
            assert len(inshare) == self.share_size, (len(inshare), self.share_size, self.data_size, self.required_shares)
-        shares = self.encoder.encode(inshares, desired_share_ids)


As I understand the code, the Python ZFEC bindings don't release the GIL (which is a good thing, since until very recently the C library initialization was not thread-safe - and now requires an explicit single-threaded initialization step). Were you able to observe any throughput increase with these encode/decode changes?

I mostly just put stuff in threads that the blocking-detector code flagged. I am happy to go and make ZFEC release the GIL as a relevant follow-up.

src/allmydata/mutable/publish.py

exarkun · 2023-10-18T19:54:28Z

src/allmydata/mutable/retrieve.py

@@ -767,9 +769,9 @@ def _validate_block(self, results, segnum, reader, server, started):
                                        "block hash tree failure: %s" % e)

        if self._version == MDMF_VERSION:
-            blockhash = hashutil.block_hash(salt + block)
+            blockhash = await defer_to_thread(hashutil.block_hash, salt + block)


Shame about that salt + block, that's presumably a lot of time spent allocating and copying and freeing in the main thread. Possibly a spot for future improvement?

Copying memory is actually quite fast typically, but maybe.

src/allmydata/storage/http_client.py

src/allmydata/util/cputhreadpool.py

itamarst · 2023-10-19T15:02:06Z

Here is my thought on reactor as state: it shouldn't be an attribute or parameter, it should be accessible via a function get_reactor() that internally retrieves it from contextvar.ContextVar(). And this should really be in Twisted itself...

itamarst · 2023-10-19T15:04:30Z

I just looked and asyncio does something similar, a thread local. ContextVar is a bit nicer in that you can override it for the current context in a stack-based way, which is useful for tests, but same basic idea.

meejah · 2023-10-19T15:56:07Z

"In general" trio seems to have better-thought-out async ideas (vs. asyncio) -- although I don't know what it does about the reactor.

The asyncio things I have looked at unfortunately have problems similar to many Twisted internal (and third-party) libraries when dealing with the global / implicit reactor ("thread-local context var" still seems global-adjacent to me? but maybe I don't understand ContextVar enough)

itamarst · 2023-10-19T17:21:01Z

Decided to punt on reactor, just leave API as is.
For testing flag, the worry with swapping out synchronous version is that you don't immediately await the value, so synchronous test doesn't catch bad parallelism. Switching the defer_to_thread() API to be a coroutine makes this less likely, you get warning if you don't await and it's not a thing you pass around, unlike Deferred.

exarkun

Thanks. Looks good to me.

pythonspeed added 15 commits September 25, 2023 15:41

Run key generation in a thread.

1743d51

News fragment

ccdc2ff

Detect blocked threads.

08e8dd3

Run blocking operations in thread pool

d3ca02f

Just always run in thread

2ccdd18

Run blocking code in a thread

b60e53b

Decouple from reactor

cb83b08

Lints

daec717

Run in thread pool

6e93b12

Run in thread

07a1288

More reasonable defaults

72041c0

Make it optional

040bb53

Merge remote-tracking branch 'origin/master' into 4068-reduce-cpu-in-…

c78d7b6

…eventloop-thread

Merge remote-tracking branch 'origin/master' into 4068-reduce-cpu-in-…

c6b4b9e

…eventloop-thread

Don't assume the result is immediately available

7f53f40

itamarst added the Benchmarking and Performance label Oct 16, 2023

itamarst requested a review from a team October 16, 2023 19:01

exarkun self-assigned this Oct 18, 2023

exarkun reviewed Oct 19, 2023

View reviewed changes

pythonspeed added 3 commits October 19, 2023 11:09

Just do whole thing in one thread job

5d896e8

Document constraints.

bab97cf

Expand docs

303e45b

Switch defer_to_thread() API to hopefully be harder to screw up.

20cfe70

itamarst requested a review from exarkun October 19, 2023 18:06

exarkun approved these changes Oct 19, 2023

View reviewed changes

itamarst merged commit 4fbf31b into master Oct 20, 2023
30 checks passed

itamarst deleted the 4068-reduce-cpu-in-eventloop-thread branch October 20, 2023 14:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reduce blocking operations in eventloop thread #1342

Reduce blocking operations in eventloop thread #1342

itamarst commented Oct 16, 2023

coveralls commented Oct 16, 2023 •

edited

Loading

exarkun left a comment

exarkun Oct 18, 2023

itamarst Oct 19, 2023

exarkun Oct 18, 2023

itamarst Oct 19, 2023

itamarst commented Oct 19, 2023

itamarst commented Oct 19, 2023

meejah commented Oct 19, 2023

itamarst commented Oct 19, 2023

exarkun left a comment

Reduce blocking operations in eventloop thread #1342

Reduce blocking operations in eventloop thread #1342

Conversation

itamarst commented Oct 16, 2023

coveralls commented Oct 16, 2023 • edited Loading

exarkun left a comment

Choose a reason for hiding this comment

exarkun Oct 18, 2023

Choose a reason for hiding this comment

itamarst Oct 19, 2023

Choose a reason for hiding this comment

exarkun Oct 18, 2023

Choose a reason for hiding this comment

itamarst Oct 19, 2023

Choose a reason for hiding this comment

itamarst commented Oct 19, 2023

itamarst commented Oct 19, 2023

meejah commented Oct 19, 2023

itamarst commented Oct 19, 2023

exarkun left a comment

Choose a reason for hiding this comment

coveralls commented Oct 16, 2023 •

edited

Loading