New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Error while dialing dial tcp 127.0.0.1:2399: connect: connection refused #4728
Comments
Is this error preventing the node from starting up, or is it just filling the logs on an otherwise functional cluster member? Note that 2 servers isn't a supported configuration when using etcd - you must have have an odd number in order to meet quorum requirements: https://etcd.io/docs/v3.5/faq/#why-an-odd-number-of-cluster-members Can you confirm that you still run into this when using a supported number of servers? cc @briandowns it looks like we might be leaking an etcd client somewhere? Assuming the cluster is working despite this message, it suggests that there's still a grpc client around trying to keepalive to the temporary etcd instance that gets set up to extract the bootstrap data. |
Hi @brandond - thanks for your response! The cluster is otherwise functional! Now I have a 3 node cluster and the error is still regularly (roughly every two minutes) occurring on the node I bootstrapped first. If you want me to run any diagnostics on that node, I'd be happy to - just tell me which ones ;-) Regards, |
Thanks! Can you include the output of |
I can confirm that I see this message repeating on only the first node of a two-node etcd cluster after stopping and restarting both nodes.
|
Steps to reproduce:
|
Environmental Info:
K3s Version:
k3s version v1.21.7+k3s1 (ac70570)
go version go1.16.10
Node(s) CPU architecture, OS, and Version:
Linux rancher-02 4.19.0-18-amd64 #1 SMP Debian 4.19.208-1 (2021-09-29) x86_64 GNU/Linux
Cluster Configuration:
2 servers, embedded etcd
Describe the bug:
Regularily seeing
{"level":"warn","ts":"2021-12-13T13:33:11.054+0100","caller":"grpclog/grpclog.go:60","msg":"grpc: addrConn.createTransport failed to connect to {http://127.0.0.1:2399 <nil> 0 <nil>}. Err :connection error: desc = \"transport: Error while dialing dial tcp 127.0.0.1:2399: connect: connection refused\". Reconnecting..."}
in the k3s log.The text was updated successfully, but these errors were encountered: