Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Webhook Can Enter Infinite Crashloop #4165

Closed
ian-mi opened this issue Sep 29, 2020 · 1 comment · Fixed by #4168
Closed

Webhook Can Enter Infinite Crashloop #4165

ian-mi opened this issue Sep 29, 2020 · 1 comment · Fixed by #4168
Labels
kind/bug Categorizes issue or PR as related to a bug.

Comments

@ian-mi
Copy link
Contributor

ian-mi commented Sep 29, 2020

Describe the bug
The eventing-webhook will continually crash with errors such as

{"level":"error","ts":"2020-09-29T21:44:49.120Z","logger":"eventing-webhook.sinkbindings.webhook.sources.knative.dev","caller":"controller/controller.go:528","msg":"Reconcile error","commit":"74b9bed","knative.dev/pod":"eventing-webhook-847f8cdc6d-cbftm","error":"secret "eventing-webhook-certs" is missing "ca-cert.pem" key","stacktrace":"knative.dev/pkg/controller.(*Impl).handleErr\n\tknative.dev/pkg@v0.0.0-20200922164940-4bf40ad82aab/controller/controller.go:528\nknative.dev/pkg/controller.(*Impl).processNextWorkItem\n\tknative.dev/pkg@v0.0.0-20200922164940-4bf40ad82aab/controller/controller.go:514\nknative.dev/pkg/controller.(*Impl).RunContext.func3\n\tknative.dev/pkg@v0.0.0-20200922164940-4bf40ad82aab/controller/controller.go:451"}

http: TLS handshake error from 10.24.2.1:36304: server key missing

The eventing-webhook-certs secret remains unpopulated and does not appear to get reconciled by the webhook.

Expected behavior
The webhook certificate reconciler should populate the secret on startup.

To Reproduce
This occurs some small percentage of the time when applying the release yaml.

Knative release version
0.17.2

@ian-mi ian-mi added the kind/bug Categorizes issue or PR as related to a bug. label Sep 29, 2020
@ian-mi
Copy link
Contributor Author

ian-mi commented Sep 29, 2020

This scenario appears to occur when the webhook must obtain a previously held leader lease but the eventing-webhook-certs has not yet been reconciled. In this case the webhook will shutdown before the leader lease can be acquired and the cert can be reconciled.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
kind/bug Categorizes issue or PR as related to a bug.
Projects
None yet
1 participant