Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Operator freezes while exiting after an error in the watching/queueing cycle #25

Closed
kopf-archiver bot opened this issue Aug 18, 2020 · 0 comments
Closed
Labels
archive bug Something isn't working

Comments

@kopf-archiver
Copy link

kopf-archiver bot commented Aug 18, 2020

An issue by nolar at 2019-04-08 12:13:15+00:00
Original URL: zalando-incubator/kopf#25
 

Expected Behavior

When an error happens in the watching/queueing coroutines, the process exits, and the pod is restarted by Kubernetes (or it just exists if executed locally).

Actual Behavior

In some cases, the process freezes after the exception, and no new events are handled, nothing is logged.

Steps to Reproduce the Problem

Uncertain, but:

  1. Simulate an error in the watching cycle, e.g. such as Kopf-based operator fails with KeyError ['uid'] #10

Commented by nolar at 2019-04-16 14:59:09+00:00
 

The issue is presumably fixed in #27.

"Presumably" means that it was one certain way of freezing the operator forever with no reaction, and the simulated symptoms match with the observed symptoms. This way is now fixed.

But it is unclear how this way could be triggered and activated: the kubernetes.watch.Watch().stream() call never ends normally, since it has while True inside.

Which, in turn, means that there could be other reasons and ways of freezing. We need to catch them first, and investigate if it happens again.

@kopf-archiver kopf-archiver bot closed this as completed Aug 18, 2020
@kopf-archiver kopf-archiver bot changed the title [archival placeholder] Operator freezes while exiting after an error in the watching/queueing cycle Aug 19, 2020
@kopf-archiver kopf-archiver bot added the bug Something isn't working label Aug 19, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
archive bug Something isn't working
Projects
None yet
Development

No branches or pull requests

0 participants