Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Consumers unable to recover upon kafka cluster failure #264

Closed
vineetgoel opened this issue Dec 4, 2017 · 2 comments
Closed

Consumers unable to recover upon kafka cluster failure #264

vineetgoel opened this issue Dec 4, 2017 · 2 comments

Comments

@vineetgoel
Copy link
Contributor

vineetgoel commented Dec 4, 2017

I noticed that my application which uses both producers and consumers was unable to recover upon some boxes failing in my kafka cluster. The errors I see in the logs are as follows (with errors closest to the point until which my application was running fine, on top):

  • kafka.errors.GroupCoordinatorNotAvailableError: [Error 15] GroupCoordinatorNotAvailableError
  • [2017-12-04 10:28:30,621: INFO] OffsetCommit failed for group foo due to a coordinator error (RequestTimedOutError), will find new coordinator and retry
  • kafka.errors.RequestTimedOutError: [Error 7] RequestTimedOutError

Not sure why the consumers were unable to recover at this point. What is the ideal way of handling such exceptions within the consumer. It also seems like the consumers don't really raise this exception and only logs them.

@vineetgoel
Copy link
Contributor Author

Verified that the consumer is able to recover on in most cases.

However there are times when the following error is raised and the consumer crashes:

kafka.errors.GroupCoordinatorNotAvailableError: [Error 15] GroupCoordinatorNotAvailableError

@tvoinarovskyi
Copy link
Member

Should be fixed by #286.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants