Redis connection gone from close event #247

swapnilsarwe · 2012-07-11T08:26:53Z

In our redis configuration:
timeout: 7 seconds

Whenever the connection is closed from the redis end, we are able to catch the end event because of the timeout.
But in some cases (most probably redis is closing the connection without notifying the client) we see the command queue getting piled up and requests are taking too much time to get the response [till the time node-redis client able to sense the close event]. In all such cases command callback is returned with this error Redis connection gone from close event. even after so much waiting.

Issue seems to be similar to this - http://code.google.com/p/redis/issues/detail?id=368

Is there a way to specify that execution of a command [sending and receiving a reply back] should not exceed the threshold and reply with an error in that case, instead of making the client stall. When we run the node-redis on debug mode we are clearly able to see the client getting stalled with the requests getting piled up in the command queue. We logged the why and queue length inside flush_on_error function. We have kept offline_queuing disabled. Or is there anyother way of triggering close event in such cases like socket_timeout?

Sample Log
Redis connection is gone from close event.
offline queue 0
command queue 8

Response time of failed request
{2012-07-11 08:06:48.306] [INFO] Production - {"debug":[{"time":"2012-07-11T08:06:17.918Z","data":"xxxx"},{"time":"2012-07-11T08:06:17.918Z","data":"xxxredis"},{"time":"2012-07-11T08:06:48.306Z","data":{"xxxxrediserror":"Redis connection gone from close event."}},{"time":"2012-07-11T08:06:48.306Z","data":{"YYY"}}],"responsetime":"30388 ms"}

Usual Resonse time

{"debug":[{"time":"2012-07-11T08:21:21.241Z","data":"xxxx"},{"time":"2012-07-11T08:21:21.241Z","data":"xxxxredis"},{"time":"2012-07-11T08:21:21.242Z","data":{"xxxxredisreply":"hai","xxxxrediserror":null}},{"time":"2012-07-11T08:21:21.242Z","data":"yyy"},{"time":"2012-07-11T08:21:21.242Z","data":{"xxxxredisreply":"YYY","xxxxrediserror":null}},{"time":"2012-07-11T08:21:21.242Z","data":{"YYY"}}],"responsetime":"1 ms"}

The text was updated successfully, but these errors were encountered:

DTrejo · 2012-07-26T01:07:24Z

I don't have much experience with this, apologies.

sberryman · 2012-09-24T04:59:23Z

I've noticed that I get timeout's when using redis on nodejitsu if no data has been sent on the connection for a while (idle connection). I noticed there is a PING command on redis but I believe that doesn't work on publish or subscribe connections so I need a way to "keep alive" connections. I was thinking about a simple 1 minute timer that just published messages to a channel I know nobody else will be listening on. While this is ugly I think it will be a simple fix until some other way to keep the connection alive or better detection for disconnected connections is created. I agree that it takes way too long for a command to timeout and having an auto-retry would be ideal. I was going to try and find you (@DTrejo) on the nodejitsu channel to talk about it.

EDIT: I should mention that I also have the problem of connections getting "stuck" for idle MongoDB connections. If I don't keep the connection active I run into really really strange issues on nodejitsu where connections will stay "active" for 140+ minutes and eventually mongoose will try connection until it times out (1 minute per connection with 5 connections) at which point it will finally throw a disconnected event and will then attempt to reconnect at that point.

DTrejo · 2013-02-24T04:45:13Z

This sounds like a reasonable option to include — is this issue still at large? If so, would one of you like to submit a PR for this?

Cheers,
D

sberryman · 2013-02-24T05:05:20Z

I've implemented the ugly hack to publish to a channel every 60 seconds and it has been running great ever since I went down that route. Honestly I haven't even thought about it since I submitted the ticket and implemented the "hack"

brycebaril · 2013-02-24T10:34:00Z

I actually also run into this same issue and have done the same "hack" -- whenever my monitor hits it does a couple lookups with the dual purpose of logging stats and refreshing the Redis connection.

The issue really seems to be the several minutes it takes the client to realize it has timed out and force a reconnection to Redis. Given how fast it is, this could be addressed by pessimistic PING checks, forcing a new connection after a certain amount of time, or smart connection pooling.

Fishrock123 · 2014-01-06T21:32:55Z

No status on this in over 11 months?

HR · 2014-04-26T07:29:37Z

What's the code for the "hack"?

munimkazia · 2014-05-16T05:16:08Z

This is a simple workaround to prevent the timeout: Just call ping at regular intervals.

setInterval(client.ping(), 1000 * 60 * 30);

benfleis · 2014-07-03T15:32:40Z

2 thoughts. First is minor, the second reflects a possibly real issue either in code or in documentation.

setInterval(client.ping(), 1000 * 60 * 30).unref(); would be a potential slight improvement from @munimkazia 's suggestion above, in many circumstances.
I saw this exact error when within mocha tests, spawning a randomly-port-numbered redis server and connecting with a client. The tests worked fine, but would eventually fail a randomly placed (race) error:
Uncaught Error: Redis connection to localhost:65353 failed - connect ECONNREFUSED
When adding an 'error' handler, I found the same error as above,
Error: Error: Redis connection gone from close event.
My sequence was to call client.quit(), then wait on the 'end' event, and then carry on, assuming that the client was cleanly closed. It turns out (not in any way I could see from the docs) that I still need to call client.end() from within the 'end' handler, and the race goes away. The race didn't occur in live code presumably because the redis server stays alive. The fact that I'm shutting down both server and client nearly simultaneously is the tickling condition here, I think. Then, the question becomes -- should the client have to call .end() after receiving 'end'? If so, it ought to be documented as such. If not, there is a bug worth filing.

benfleis · 2014-07-14T07:31:17Z

Any thoughts on the above #2 question, @mranney ?

brycebaril · 2014-07-14T16:16:06Z

Hi @benfleis -- hopefully the main issue here is fixed with 0.11.0, your second issue looks like it is something else and should probably be opened into its own issue.

gabegorelick · 2014-08-20T20:46:42Z

@brycebaril Does that mean that this should no longer be an issue with v0.11, because of socket_keepalive?

brycebaril mentioned this issue Dec 15, 2013

ETIMEDOUT when nodejs is idle for awhile. #530

Closed

mohit mentioned this issue Mar 25, 2014

param socket_keepalive to set keep-alive on socket #570

Merged

brycebaril added the reconnect label Jun 21, 2014

bradvogel mentioned this issue Feb 12, 2015

"Redis connection gone from end event" error Automattic/kue#511

Closed

manast mentioned this issue Mar 15, 2015

Processor stops working after long period of time OptimalBits/bull#38

Closed

brycebaril mentioned this issue Mar 16, 2015

Node redis giving Error: Ready check failed: Redis connection gone from close event #725

Closed

blainsmith closed this as completed Aug 15, 2015

digitarald mentioned this issue Mar 29, 2016

Build on heroku fails with "max number of clients reached" for redis mozilla/platform-status#393

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Redis connection gone from close event #247

Redis connection gone from close event #247

swapnilsarwe commented Jul 11, 2012

DTrejo commented Jul 26, 2012

sberryman commented Sep 24, 2012

DTrejo commented Feb 24, 2013

sberryman commented Feb 24, 2013

brycebaril commented Feb 24, 2013

Fishrock123 commented Jan 6, 2014

HR commented Apr 26, 2014

munimkazia commented May 16, 2014

benfleis commented Jul 3, 2014

benfleis commented Jul 14, 2014

brycebaril commented Jul 14, 2014

gabegorelick commented Aug 20, 2014

Redis connection gone from close event #247

Redis connection gone from close event #247

Comments

swapnilsarwe commented Jul 11, 2012

DTrejo commented Jul 26, 2012

sberryman commented Sep 24, 2012

DTrejo commented Feb 24, 2013

sberryman commented Feb 24, 2013

brycebaril commented Feb 24, 2013

Fishrock123 commented Jan 6, 2014

HR commented Apr 26, 2014

munimkazia commented May 16, 2014

benfleis commented Jul 3, 2014

benfleis commented Jul 14, 2014

brycebaril commented Jul 14, 2014

gabegorelick commented Aug 20, 2014