fix(server): Adjust batching behavior to reduce network latency on MULTI blocks #1777

royjacobson · 2023-08-31T07:38:25Z

Add a Yield() call before executing the last command in the async queue when needed.
Allow the receive buffer to grow when needed.
Improve debugging logs for batching behavior.

The whole 'triggering the bug' process looks like this:

We process a larger packet with multiple short commands inside a MULTI segment.
We read the commands into a relatively small buffer (size 256)
The ParseRedis loop reads commands from the first part of the packet and pushes them into the dispatch queue.
The dispatch fiber processes all the commands in the dispatch queue quickly and without preemption because it's inside a MULTI block and all commands return immediately. After emptying the queue it calls SetBatchMode(false) which causes a flush of the response buffer after the next command is executed.
The rest of the commands are parsed and executed, but the second packet is not sent until an ACK is received from the client because of Nagle's algorithm.

Close #1285

romange · 2023-08-31T12:43:15Z

a small correction: SetBatchMode is not flushing anything but the subsequent reply will indeed flush.

Can you please add here how you reproduced the scenario with increased latency?

royjacobson · 2023-08-31T14:09:46Z

I reproduced it by running the provided benchmark from the issue (addr=redis://10.0.101.178:6378/7 go test -count=1 -v ./pkg/meta/... -run=TestDgfAndRedis).
The latency bug manifests after running it twice. (Probably due to some offset randomness or something?)

dranikpg · 2023-08-31T14:29:13Z

src/facade/dragonfly_connection.cc

+    // As a small optimization, if the queue was never larger than 1, skip the Yield() call.
+    if (dispatch_q_.size() == 1 && queue_was_larger_than_1) {
+      ThisFiber::Yield();
+    }


With memtier_benchmark --pipeline=2 you'll yield a lot 🤔 But its still more effective, right?

It would also be more efficient to check yield only if we didn't yield on the previous iteration (i.e. the transaction was run with inline scheduling)

yes, we need helio machinery for this. Moreover, iouring api allows learning if there is more data pending in the network buffer pending after the last Recv call (IORING_CQE_F_SOCK_NONEMPTY from 5.19). I do not think I implemented support for this.

It does trigger quite a few more Yields, I'm not sure what's the performance impact

Updated to use the new helio epoch interface. Looked at the logs with memtier_benchmark and the JuiceFS benchmark and it looks quite good, probably 1 yield every 10 commands or something.

romange · 2023-08-31T17:13:13Z

src/facade/dragonfly_connection.cc

+    // command in the queue and let the producer fiber push more commands if it wants
+    // to.
+    // As a small optimization, if the queue was never larger than 1, skip the Yield() call.
+    if (dispatch_q_.size() == 1 && queue_was_larger_than_1) {


can it be that we wake up on non-empty queue and it does not have size > 1 ?
I thought we only push to dispatch_q if we have more than 1 request.

At least all pubsub/monitor messages go over the dispatch queue and there we can wake for a single element

…queue when needed. 2. Allow the receive buffer to grow when needed. 3. Improve debugging logs for batching behavior.

…1821) * partially reverts #1777

royjacobson requested a review from romange August 31, 2023 07:38

dranikpg reviewed Aug 31, 2023

View reviewed changes

romange reviewed Aug 31, 2023

View reviewed changes

royjacobson requested a review from dranikpg September 4, 2023 08:12

royjacobson added 2 commits September 4, 2023 13:46

1. Add a Yield() call before executing the last command in the async …

6edd922

…queue when needed. 2. Allow the receive buffer to grow when needed. 3. Improve debugging logs for batching behavior.

Update helio and use the new epoch interface for deciding on yields.

3908c3e

royjacobson force-pushed the batching_async_queue_fix branch from 6e04377 to 3908c3e Compare September 4, 2023 10:47

dranikpg approved these changes Sep 4, 2023

View reviewed changes

royjacobson merged commit f94c4be into main Sep 4, 2023
10 checks passed

royjacobson deleted the batching_async_queue_fix branch September 4, 2023 12:29

kostasrim mentioned this pull request Sep 8, 2023

chore: partially revert adjust batching behavior to reduce network #1821

Merged

kostasrim added a commit that referenced this pull request Sep 8, 2023

chore: partially revert adjust batching behavior to reduce network (#…

55737a6

…1821) * partially reverts #1777

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(server): Adjust batching behavior to reduce network latency on MULTI blocks #1777

fix(server): Adjust batching behavior to reduce network latency on MULTI blocks #1777

royjacobson commented Aug 31, 2023 •

edited

romange commented Aug 31, 2023

royjacobson commented Aug 31, 2023

dranikpg Aug 31, 2023

dranikpg Aug 31, 2023

romange Aug 31, 2023

royjacobson Sep 3, 2023

royjacobson Sep 4, 2023

romange Aug 31, 2023

dranikpg Aug 31, 2023

fix(server): Adjust batching behavior to reduce network latency on MULTI blocks #1777

fix(server): Adjust batching behavior to reduce network latency on MULTI blocks #1777

Conversation

royjacobson commented Aug 31, 2023 • edited

romange commented Aug 31, 2023

royjacobson commented Aug 31, 2023

dranikpg Aug 31, 2023

Choose a reason for hiding this comment

dranikpg Aug 31, 2023

Choose a reason for hiding this comment

romange Aug 31, 2023

Choose a reason for hiding this comment

royjacobson Sep 3, 2023

Choose a reason for hiding this comment

royjacobson Sep 4, 2023

Choose a reason for hiding this comment

romange Aug 31, 2023

Choose a reason for hiding this comment

dranikpg Aug 31, 2023

Choose a reason for hiding this comment

royjacobson commented Aug 31, 2023 •

edited