Socket write queue fixes and improvements #4202

pwojcikdev · 2023-04-02T12:03:42Z

The socket asynchronous write loop, first introduced in #1938 got broken by #2918. The problematic PR attempted to streamline logic, seemingly with the assumption that just a strand is enough to guarantee correct order of execution of async writes, however this assumption is not correct. A strand can only ensure that tasks are executed sequentially with other tasks, so an unfortunate call sequence of async_write_1 > async_write_2 > async_write_1_complete > async_write_2_complete ... is still possible. Here is a link to a brief discussion from the Boost ASIO mailing list that mentions this problem: https://boost-users.boost.narkive.com/MrmFQST2/multiple-async-operations-on-a-socket To correctly implement a multi-writer async write loop, both strand and a write queue of some sort is required. Failure to do so might result in data from multiple write requests being interleaved, which is exactly the issue observed during high load local tests. This could also explain why a drop in connected nodes is observed each time live network activity spikes.

A related improvement is that the newly introduced socket send queue is further split into multiple subqueues, based on traffic type. This allows for better prioritization of inter node communication when network throughtput is limited, eg. ensuring that bootstrap traffic won't preempt voting when network is experiencing a period of heightened activity. This should eliminate the need for aggressive throttling of ascending bootstrapper rate, as both client and server should now automatically throttle their rate in response to network back pressure (socket send queue being filled). This could be extended further to differently prioritize live voting / voting requests / voting responses / block publishing, which should give the network additional resiliency, but this is a TODO.

# Conflicts: # nano/node/transport/socket.hpp

pwojcikdev added 7 commits April 1, 2023 19:06

Split send queues by traffic type

6115359

Improve socket write loop

b08613f

Fix tests

59ea6f2

Encapsulate send queue

cdb1ec7

Naming consistency

657e3a7

Remove the need for timer notifications

6a29419

Merge remote-tracking branch 'nano/develop' into traffic-queue-types-2

b7204bf

# Conflicts: # nano/node/transport/socket.hpp

clemahieu approved these changes Apr 2, 2023

View reviewed changes

clemahieu merged commit 62bdaba into nanocurrency:develop Apr 3, 2023

thsfs added enhancement non-functional change labels May 22, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Socket write queue fixes and improvements #4202

Socket write queue fixes and improvements #4202

pwojcikdev commented Apr 2, 2023 •

edited

Loading

Socket write queue fixes and improvements #4202

Socket write queue fixes and improvements #4202

Conversation

pwojcikdev commented Apr 2, 2023 • edited Loading

pwojcikdev commented Apr 2, 2023 •

edited

Loading