NettyResponseFuture never completes because netty provider uses closed channel #415

hvesalai · 2013-11-08T15:36:17Z

Using version 1.7.21, a NettyResponseFuture never gets completed because the provider uses a channel that has already been closed and handled by closeChannel(...).

If I have read the source properly, what we have here is a classical race condition:

[in client thread] doConnect(...) is called and a cached channel is selected.

At this point, the attachment connected to the channel is still DiscardEvent

NettyAsyncHttpProvider.java#L913

doConnect(...) checks that the connection isOpen(), which it still is.

NettyAsyncHttpProvider.java#L923
[in I/O thread] The channel is closed by the remote end and channelClosed(...) is called. Since the attachment is DiscardEvent, nothing is done.

NettyAsyncHttpProvider.java#L1341
[in client thread] doConnect(...) continues using the now closed channel by attaching a NettyResponseFuture to the channel.

NettyAsyncHttpProvider.java#L937
[in client thread] doConnect(...) calls writeRequest(...) which checks if the channel is closed, which it is.

NettyAsyncHttpProvider.java#L447

However, writeRequest(...) does nothing about the fact, relying on channelClosed(...) to handle the situation. But, as channelClosed(...) has already been executed at step 2, the abort(...) or done(...) methods will never get called, resulting in that the listeners of the future will never get notified.

The text was updated successfully, but these errors were encountered:

javateck · 2014-01-16T18:34:02Z

good analysis, in my test, I have thread pool and latch in my stress client side, callback will simply do latch.countDown(), and I constantly see my threads stuck, meaning some callbacks never being called. Wondering how this basic is missed.

slandelle · 2014-01-16T19:36:10Z

Once again, please share your test case, as you claim to have one where you can easily reproduce.

hvesalai · 2014-01-16T19:40:46Z

@slandelle, are you asking for a test case to reproduce a race condition? Have you read the analysis, what do you think about it?

slandelle · 2014-01-16T19:44:43Z

@hvesalai Read briefly, but hadn't dug in yet. But that's typically the kind of stuff that's easier to fix if you have a test case to reproduce. @javateck claims he has a very obvious one, so I'd like to get my hands on it...

slandelle · 2014-01-21T15:34:55Z

@hvesalai What a mess! Thanks for your analysis. I'm afraid I'll only be able to come up with an ugly fix for AHC 1. Will try to come up with something better for AHC 2.

…on, close #415

slandelle · 2014-01-21T17:53:02Z

I've just pushed an imperfect fix on 1.7.X branch.

Limitations are:

there's always a time window between the moment a Channel is fetched from the pool and the moment a future is attached so that channelClosed can handle it
Netty HttpRequest is built again if retry fails and we finally have to go with a new Channel
it's ugly

Will first port the logic as is on AHC2, but we'll try to come up with something better (maybe use Netty custom events).

slandelle · 2014-01-22T12:58:37Z

Still not perfect, but chances are much much slower now (just a few simple instructions)

ghost assigned slandelle Jan 18, 2014

slandelle pushed a commit that referenced this issue Jan 21, 2014

Not perfect fix for race condition on remotely closed pooled connecti…

7afe807

…on, close #415

This was referenced Jan 22, 2014

Request not executed when Channel Closed #161

Closed

sometimes I call AsyncHttpClient.execute but no response on the handler #48

Closed

slandelle closed this as completed in 3e3308f Jan 22, 2014

charlesk40 mentioned this issue May 20, 2014

NettyAsyncHttpProvider does not persist Payload on PUT request when channel is closed and on a retry #556

Closed

cs-workco pushed a commit to cs-workco/async-http-client that referenced this issue Apr 13, 2023

EventLoop preferences (AsyncHttpClient#415)

4068ee5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

NettyResponseFuture never completes because netty provider uses closed channel #415

NettyResponseFuture never completes because netty provider uses closed channel #415

hvesalai commented Nov 8, 2013

javateck commented Jan 16, 2014

slandelle commented Jan 16, 2014

hvesalai commented Jan 16, 2014

slandelle commented Jan 16, 2014

slandelle commented Jan 21, 2014

slandelle commented Jan 21, 2014

slandelle commented Jan 22, 2014

NettyResponseFuture never completes because netty provider uses closed channel #415

NettyResponseFuture never completes because netty provider uses closed channel #415

Comments

hvesalai commented Nov 8, 2013

javateck commented Jan 16, 2014

slandelle commented Jan 16, 2014

hvesalai commented Jan 16, 2014

slandelle commented Jan 16, 2014

slandelle commented Jan 21, 2014

slandelle commented Jan 21, 2014

slandelle commented Jan 22, 2014