Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Received ping response with no matching id #8256

Closed
nik9000 opened this issue Oct 28, 2014 · 4 comments
Closed

Received ping response with no matching id #8256

nik9000 opened this issue Oct 28, 2014 · 4 comments
Assignees

Comments

@nik9000
Copy link
Member

nik9000 commented Oct 28, 2014

I'm seeing a bunch of these on startup:

[2014-10-28 14:39:57,956][WARN ][discovery.zen.ping.multicast] [elastic1020] received ping response ping_response{target [[elastic1004][eq_H7EFGQ9SNTJuA6Wj9zw][elastic1004][inet[/10.64.0.111:9300]]{rack=A3, row=A, master=false}], master [[elastic1002][pEwiGcyESvSXFtUUwlh8qw][elastic1002][inet[/10.64.0.109:9300]]{rack=A3, row=A, master=true}], cluster_name[production-search-eqiad]} with no matching id [1]

and then everything works just fine. Is this OK?

@bleskes
Copy link
Contributor

bleskes commented Oct 28, 2014

@nik9000 this can happen when a node takes more then 1.5 seconds to respond to a multicast ping - i.e., after the ping has been timed out locally. Nothing much to worry about if you had nodes under load (and you are indeed using multicast). It can also happen if you have restarted a node in the middle of pinging and it's very quick to come back. The node will receive answers to ping it's predecessor sent but of course, the id is no longer in memory.

Let me know if this explains things so we can close it (or not..).

@nik9000
Copy link
Member Author

nik9000 commented Oct 28, 2014

@bleskes thanks. That's certainly explains it. In this case the nodes aren't really under load.... Would a flood of nodes joining cause this? Like, if I added 10 nodes to my cluster really fast. Because I did.

@bleskes
Copy link
Contributor

bleskes commented Oct 28, 2014

@nik9000 if you also took the nodes offline while other nodes were already pinging (for example, you had min master nodes breach) it does - the nodes got responses to the pings they sent before being restarted.

@nik9000
Copy link
Member Author

nik9000 commented Oct 28, 2014

I don't think that is what happened but I've suffered no ill effects from it. I'll just assume it was a blip of some sort.

@nik9000 nik9000 closed this as completed Oct 28, 2014
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants