-
Notifications
You must be signed in to change notification settings - Fork 232
Be more tolerant with unresponsive brokers #347
Comments
Just collecting more thoughts here, because I cannot completely oversee the extent of the problem yet: I wonder if it would make sense to move away from storing (This is also generally a safer setup, and would have avoided a bug @emmett9001 discovered the other week, where a |
All this is also related, but not exactly identical, to the issues in #338. |
#358 is related. I'm not super excited about moving away from storing |
This issue is old enough and similar enough and unclear enough that I'm going to close it. Please reopen if you think that's not appropriate. |
Yep, agree that we don't need to move on this right now: I don't think I've seen any related issues opened lately, and the proposed changes would, like you said, be a bit of a risk. Quick recap in case we revisit this in future (and so that I can remove the "hazy" label ;)):
|
This is a half-formed thought, but it appears @rduplain in #189 has been battling with https://issues.apache.org/jira/browse/KAFKA-1387 - the odd problem there is that a broker may be listed in metadata, but will refuse to speak to us upon connecting.
Currently, I think
Cluster._update_brokers()
stumbles when that happens. What should happen instead is that we just continue, leaving the broker in a disconnected state, because we only really have a problem once it turns out that the broker in question is aPartition.leader
- which it probably won't be, so we wouldn't care that it remains disconnected.The text was updated successfully, but these errors were encountered: