.dataReceived() is called once per byte, probably inefficient #3

warner · 2017-02-01T00:03:10Z

While analyzing tahoe#2861, we identified a likely performance concern with the way txi2p delivers data on the server side of a connection.

This happens when you use a txi2p.sam.endpoints.SAMI2PStreamServerEndpoint, which is how you listen on an .i2p address. txi2p implements this by making an outbound connection to the local I2P daemon, writing a command that says "hey, I want to accept connections for (some .i2p address)", then waiting for a response. When some client connects, the daemon responds ("hey, someone connected, get ready to talk to them"), and then uses the same TCP connection for the subsequent tunneled data.

On the txi2p side, there is a parser/state machine (implemented with Parsely) that manages the initial command and response. Once the response is received, this state machine is moved into State_readData, which matches on arbitrary single bytes (the "anything:data" target), and delivers each one to receiver.dataReceived().

This is sound, but slow. The expected scenario is when e.g. a Tahoe client uploads several megabytes of binary data to an I2P-based server, delivered through Foolscap and into the I2P connection. On the receiving (server) side, large buffers can be delivered in a single system call, up to the size of the kernel buffers (typically 64kB). This could all be processed in a single .dataReceived() invocation. When txi2p breaks this up into a lot of one-byte invocations instead, performance will suffer (in particular, CPU usage on the server will be higher than necessary). Worst case is probably a quadratic slowdown, if the next-higher protocol (e.g. Foolscap) does the lazy thing and appends the incoming bytes to a buffer until the expected number have been received:

  def dataReceived(self, data):
      self.buffer += data
      if len(self.buffer) == self.expected:
          self.messageReceived(self.buffer)
          self.buffer = b""

To fix this, txi2p will need to swap out the Parsley parser for a direct connection to the target protocol's .dataReceived, when it moves into State_readData. @washort suggested:

<dash> so what i'd do is, subclass the protocol, write a dataReceived for it
that checks self.currentRule
<dash> and if it's 'State_readdata' just invoke the appropriate dataReceived
directly and not the parser

This might also affect the inbound side of I2P client connections too (those created with SAMI2PStreamClientEndpoint), I'm not sure. In the Tahoe context, this would be a Tahoe client downloading a file from an I2P-hosted server, and the additional CPU load would occur on the client side.

The text was updated successfully, but these errors were encountered:

warner · 2017-02-01T00:27:51Z

dash pointed at

txi2p/txi2p/sam/session.py

Line 66 in 7ae558f

SessionCreateProtocol = makeProtocol(

as the place to put the new subclassed protocol. I think this means that we wouldn't use parsley.makeProtocol, but instead we'd need to build a protocol just like that but with .dataReceived() overridden. Or, we build yet another Protocol object (with the switch-on-state logic) to wrap the Parsley-provided one. The transport would be connected to our wrapper, which would either deliver data to the Parsley protocol (before the handshake completes) or to the application-layer one (foolscap, in our case), depending upon the state of the Parsley object.

washort · 2017-02-01T03:51:58Z

The easy way is to just subclass the class returned from makeProtocol and override dataReceived to either call the parser or not depending on self.currentRule, I think.

str4d · 2017-03-23T10:47:42Z

Fixed per @washort's suggestion. Thanks!

Set up CI for tests and release automation

warner mentioned this issue Feb 1, 2017

inbound data may be delayed until next chunk is received #4

Closed

str4d modified the milestone: 0.3.2 Feb 26, 2017

str4d closed this as completed in 592b159 Mar 23, 2017

exarkun added a commit to exarkun/txi2p-original that referenced this issue Aug 2, 2022

Merge pull request str4d#3 from tahoe-lafs/setup-continuous-integration

4f90d88

Set up CI for tests and release automation

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.dataReceived() is called once per byte, probably inefficient #3

.dataReceived() is called once per byte, probably inefficient #3

warner commented Feb 1, 2017

warner commented Feb 1, 2017

washort commented Feb 1, 2017

str4d commented Mar 23, 2017

.dataReceived() is called once per byte, probably inefficient #3

.dataReceived() is called once per byte, probably inefficient #3

Comments

warner commented Feb 1, 2017

warner commented Feb 1, 2017

washort commented Feb 1, 2017

str4d commented Mar 23, 2017