PeerGroup.handlePeerDeath() + BlockingClient.socket.close() called twice #1765

oscarguindzberg · 2019-04-01T16:46:14Z

When using BlockingClient instances, PeerGroup.handlePeerDeath() will be called twice if there is a connectivity problem during connection setup to a peer.

BlockingClient constructor creates a thread. That thread has a finally statement. Assuming a connection problem, an exception will be thrown and the finally statement will be executed. That calls connection.connectionClosed(), i.e Peer.connectionClosed(). That calls listener.onPeerDisconnected. One of the listeners is PeerGroup.PeerStartupListener. That calls PeerGroup.handlePeerDeath().
At the same time, Peer extends AbstractTimeoutHandler. When the peer is created, it is configured to issue a timeout if version ack msg was not received after N time. That calls Peer.timeoutOccurred() too which eventually ends up calling PeerGroup.handlePeerDeath().

I have a suspect in the same scenario BlockingClient's socket might be attempted to be closed twice.

BlockingClient constructor created thread, connection problem, finally statement, socket.close(). There is a catch that ignores the exception if socket.close() fails.
Peer.timeoutOccurred(), PeerSocketHandler.timeoutOccurred(), PeerSocketHandler.close(), writeTarget.closeConnection() i.e. BlockingClient.closeConnection(), socket.close(). There is a catch that throws an exception if socket.close() fails.

There is a race condition between BlockingClient's socket timeout (defaults to BlockingClientManager.connectTimeoutMillis i.e. 1 second) and Peer version ack timeout (default to PeerGroup.DEFAULT_CONNECT_TIMEOUT_MILLIS i.e. 5 seconds )

oscarguindzberg · 2019-04-01T16:56:29Z

@schildbach would you confirm my analysis is right?

schildbach · 2019-04-07T08:14:14Z

I've never looked at how the ClientConnectionManagers work. I think PeerGroup has used NioClientManager for ages. Is there a specific reason you're using BlockingClientManager?

schildbach · 2019-04-07T08:15:26Z

Ah, I guess Tor is the reason?

oscarguindzberg · 2019-04-07T22:59:55Z

yes, Tor is the reason.

schildbach · 2019-07-22T12:07:53Z

Hmm, I'm not sure how to continue with this.

BlockingClient(Manager) was mostly written by @TheBlueMatt via 534cec9, but I don't expect him to return to look into such issues. Also, judging by his commit description he was mostly focused on NIO and blocking I/O was just a byproduct (or maybe even just a leftover from the ancient Netty code).

I guess we will need someone who takes care of the networking code. Maybe explore ways to run Tor over NIO? Or maybe switch over to okio?

oscarguindzberg · 2019-07-22T17:37:30Z

I am taking some time off so I won't follow up on this subject in the near future.

oscarguindzberg mentioned this issue Apr 2, 2019

Bisq's bitcoinj connection handling changes audit bisq-network/bitcoinj#30

Open

oscarguindzberg mentioned this issue May 8, 2019

bisq's bitcoinj status bisq-network/bitcoinj#33

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PeerGroup.handlePeerDeath() + BlockingClient.socket.close() called twice #1765

PeerGroup.handlePeerDeath() + BlockingClient.socket.close() called twice #1765

oscarguindzberg commented Apr 1, 2019 •

edited

oscarguindzberg commented Apr 1, 2019

schildbach commented Apr 7, 2019

schildbach commented Apr 7, 2019

oscarguindzberg commented Apr 7, 2019

schildbach commented Jul 22, 2019 •

edited

oscarguindzberg commented Jul 22, 2019

PeerGroup.handlePeerDeath() + BlockingClient.socket.close() called twice #1765

PeerGroup.handlePeerDeath() + BlockingClient.socket.close() called twice #1765

Comments

oscarguindzberg commented Apr 1, 2019 • edited

oscarguindzberg commented Apr 1, 2019

schildbach commented Apr 7, 2019

schildbach commented Apr 7, 2019

oscarguindzberg commented Apr 7, 2019

schildbach commented Jul 22, 2019 • edited

oscarguindzberg commented Jul 22, 2019

oscarguindzberg commented Apr 1, 2019 •

edited

schildbach commented Jul 22, 2019 •

edited