This repository has been archived by the owner. It is now read-only.
handle memcached connection events #83
Closed
Comments
|
Thanks @jrgm, I'll take a look |
|
This has not re-ocurred, but we should keep the bug open and follow up on the suggestion to handle connection events from the memcached driver. We can at least log them for debugging purposes. |
|
We're going to try to move away from memcached for this service, so I'm closing memcached-related bugs in the backlog |
Sign up for free
to subscribe to this conversation on GitHub.
Already have an account?
Sign in.
In production, the customs server on a single instance of fxa-auth-server got into a state where it was failing every request with this error:
There is no hint in the logs as to what triggered this server to enter this state. The log was completely normal until all connection would fail. However, the AWS memcached service was up and healthy and could be connected to over TCP from the same box that was reporting this error. A look at the customs server process with
lsoflooked normal, except there were no TCP sockets in any state for the memcached port number 11211. Anstraceshowed that upon receiving a/checkHTTP request from the fxa-auth-server, no attempt would be made to initiate a TCP connection to the memcached service. Quite strange.I was able to get a customs server to stage to produce the same stack trace by doing
sudo iptables -A OUTPUT -d <memcached IP> -j DROPand sending a steady stream of/auth/loginrequests to the auth-server. But when I would remove the iptables block, the server would eventually return to normal, in my tests.However, I wonder if we need to directly handle some of the connection events in https://github.com/3rd-Eden/memcached#events, for post-mortem analysis, if nothing else. Or maybe add an internal self-check that forces a restart if we can't connect to memcached.
The text was updated successfully, but these errors were encountered: