Race condition while reconnecting. #92

ololoru · 2016-11-09T22:11:52Z

I've discovered race condition while testing reconnection logic. Here below is setup I'm using:

Erlang (R16) is running on Macos, I start erlang shell and create new connection via eredis:start_link
and do basic put/get test. I'm using default database, and no authentication
Redis is started in docker container. (I'm using native docker for macos)
I stop redis service in docker and see race condition results either in process crash or closed socket saved in eredis_client state.

When connection dies and socket receives tcp_closed message eredis_client:reconnect_loop is called in a separate process where it:

Creates new socket
Changes controlling process to original eredis_client PID
Sends message to eredis_client PID to replace connection

There are the following issues with described approach:

Between steps 1) and 2) newly created socket(connection) dies and spawned process (where reconnect_loop is executed) receives tcp_closed message what causes eredis client process using dead socket
Between steps 2. and 3. socket(connection) dies and eredis_client receives tcp_closed prior to connection_ready message while socket is undefined what causes process termination on unhandled_message

The text was updated successfully, but these errors were encountered:

ntrepid8 · 2017-02-11T16:51:30Z

I see this issue pop up when trying to connect to Redis version 3.0.6, but do not see it when connecting to Redis version 2.8.4.

edit: this turned out to be a different networking issue on my server running Redis 3.0.6, please disregard...

knutin · 2017-02-11T18:04:53Z

Oh, thats interesting. I'll try to have a look soon. Sorry for the delay in responding!

…

On Sat, 11 Feb 2017 at 16:51, Josh Austin ***@***.***> wrote: I see this issue pop up when trying to connect to Redis version 3.0.6, but do not see it when connecting to Redis version 2.8.4. — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub <#92 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AAWPxFn1AH4YGDKZpfVbs0PmL0ctzs1kks5rbecTgaJpZM4KuC1N> .

benbro · 2017-03-14T09:22:38Z

How is this different than any other socket failure?
The socket can close in the following steps:

After calling gen_tcp:controlling_process/2 in reconnect_loop/2
in your PR, after sending the messages in the queue with [Client ! M || M <- Msgs]
At any other point in time

do_request(Req, From, State) returns an error when the socket is closed. The client can handle the error and retry. Isn't it enough?

Maybe we should handle the connection_ready message when the socket isn't undefined in the state?

benbro · 2017-03-14T16:41:06Z

Another option is to check for a live connection on every request and connect otherwise:
https://github.com/interline/epgsql_pool/blob/master/src/epgsql_pool_worker.erl

knutin · 2017-08-23T02:52:03Z

Wouldn't that incur a significant performance penalty?

…

On Tue 14. Mar 2017 at 09:41, benbro ***@***.***> wrote: Another option is to check for a live connection on every request and connect otherwise: https://github.com/interline/epgsql_pool/blob/master/src/epgsql_pool_worker.erl — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#92 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AAWPxDbLjTsYdc7PuOONBRjJ7HOIcycdks5rlsMkgaJpZM4KuC1N> .

ololoru mentioned this issue Nov 9, 2016

fix reconnect loop race condition #93

Merged

ntrepid8 mentioned this issue Feb 11, 2017

unhandled_message errors with newer redis ntrepid8/ex_redis_pool#3

Closed

bwschmidt mentioned this issue Jul 18, 2017

crash in eredis #101

Open

knutin closed this as completed in #93 Aug 15, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Race condition while reconnecting. #92

Race condition while reconnecting. #92

ololoru commented Nov 9, 2016 •

edited

ntrepid8 commented Feb 11, 2017 •

edited

knutin commented Feb 11, 2017 via email

benbro commented Mar 14, 2017 •

edited

benbro commented Mar 14, 2017

knutin commented Aug 23, 2017 via email

Race condition while reconnecting. #92

Race condition while reconnecting. #92

Comments

ololoru commented Nov 9, 2016 • edited

ntrepid8 commented Feb 11, 2017 • edited

knutin commented Feb 11, 2017 via email

benbro commented Mar 14, 2017 • edited

benbro commented Mar 14, 2017

knutin commented Aug 23, 2017 via email

ololoru commented Nov 9, 2016 •

edited

ntrepid8 commented Feb 11, 2017 •

edited

benbro commented Mar 14, 2017 •

edited