New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
namerd - "session expired" #518
Comments
Hey @JonathanBennett, I'm having trouble reproducing this. As I understand it, namerd's zk client sends a heartbeat to keep the zk session alive. Do you have any idea what could cause the session to expire? By default, the client requests a 10 second session timeout. Is the error message you pasted from linkerd or namerd? If it's from linkerd, is there anything interesting in the namerd logs? |
Hey, In such a case how should namerd recover? Should I be restarting namerd or should it be more resilient than this in failing from a zk timeout? From my logs I do have (about 4 days of failures over a weekend) I just see all lookups/sets failing with an error similar to the following:
|
namerd should definitely recover by retrying on a new session. I have a change that I suspect will fix it, but without being able to reproduce the error it's hard to be sure. |
…d#535) In some cases, we would adjust an existing Host header, or add one. And in all cases when an HTTP/1 request was received with an absolute-form target, it was not passed on. Now, the Host header is never changed. And if the Uri was in absolute-form, it is sent in the same format. Closes linkerd#518
Error message gets thrown over and over in the logs and gets raised as a HTTP 502 to the client.
E 0704 16:09:06.818 THREAD22: service failure Failure(session expired, flags=0x100000000) with NoSources at com.twitter.finagle.NoStacktrace(Unknown Source)
From @olix0r :
fwiw we it would _seem_ like a zk exception:
:; git grep -lF session\ expired finagle-serversets/src/main/scala/com/twitter/finagle/serverset2/ZkSession.scala
but we probably should try to prevent that from bubbling up into the request flow if we can
The text was updated successfully, but these errors were encountered: