-
Notifications
You must be signed in to change notification settings - Fork 451
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Enhance resource-manager health controller #6770
Enhance resource-manager health controller #6770
Conversation
Start additional watches for all GVKs encountered in `ManagedResource.status.resources`. Objects are mapped to the owning ManagedResource to trigger the health controller as soon as a change to the health status of any managed object happens. Fetching events is split from health.CheckService so that we can call CheckHealth without fetching events in the healthStatusChanged predicate. In the predicate, we are not interested in the details but only in changes to the health status. Events are now fetched separately after a failed health check in the reconciler.
We now either use typed objects or metadata-only objects (for GVKs not registered in target scheme). With this, we can fully leverage the cache of the target cluster (if enabled), even for objects we don't know. For them, we only care about their presence, so metadata-only is enough.
/assign |
CheckHealth can now properly handle metadata-only objects, no need to treat them specifically in the predicate.
@rfranzke thanks for your quick review! PTAL :) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks, no further comments!
/lgtm
/approve
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: rfranzke The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
@timebertt: The following test failed, say
Full PR test history. Your PR dashboard. Command help for this repository. Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here. |
How to categorize this PR?
/area ops-productivity scalability
/kind enhancement
What this PR does / why we need it:
Enhance resource-manager health controller in several ways:
Which issue(s) this PR fixes:
Special notes for your reviewer:
The best part is, that resource-manager reconciles
ManagedResources
as soon as their health status changes to false.Hence, if you delete some managed resource or break its health, it gets reconciled back immediately now 🚀
Release note: