Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Dependency health checking (e.g. Gardener) #138

Open
ebensom opened this issue Feb 14, 2024 · 4 comments
Open

Dependency health checking (e.g. Gardener) #138

ebensom opened this issue Feb 14, 2024 · 4 comments
Labels
area/control-plane Related to all activities around Kyma Control Plane

Comments

@ebensom
Copy link
Member

ebensom commented Feb 14, 2024

Description

Implement periodic health checking of Gardener cluster API dependency by periodically querying of the version or health non-resource endpoint via gardener kubeclient in a separate goroutine and keep the latest check result up-to-date. Expose the current (up-to-date) healthcheck result on the Prometheus metrics endpoint via series like:

{app}_{subsys}_gardener_health{url="..", status="healthy"} 1
{app}_{subsys}_gardener_health{url="..", status="error"} 0
{app}_{subsys}_gardener_health{url="..", status="unknown"} 0

Reasons

Ability to cross-correlate infrastructure-manager errors with Gardener API (dependency) errors.

Attachments

@ebensom ebensom added area/control-plane Related to all activities around Kyma Control Plane kind/feature Categorizes issue or PR as related to a new feature. labels Feb 14, 2024
@tobiscr
Copy link
Contributor

tobiscr commented Feb 19, 2024

@ebensom : can we please quickly sync about it? We have open questions

@tobiscr
Copy link
Contributor

tobiscr commented Apr 29, 2024

@ebensom : I will setup a call for it to clarify the purpose. We want avoid to increase load on Gardener caused by redundant health-checks from us + additional monitoring etc.

@tobiscr
Copy link
Contributor

tobiscr commented May 3, 2024

Istio offers HTTP request metrics but those metrics are only available if traffic is used via plain HTTP but for Gardener HTTP connections are not possible as it enforces HTTPS communication.

Option to implement a check via Prometheus client in KCP would be possible, but this won't reflect whether the KIM is really able to talk to Gardener (it's not reflecting the truth).

@tobiscr
Copy link
Contributor

tobiscr commented May 7, 2024

@ebensom : we will only implement it if you send to each of us a "Thank you " award ;)

@tobiscr tobiscr changed the title Dependency health checking Dependency health checking (e.g. Gardener) Jun 20, 2024
@tobiscr tobiscr removed the kind/feature Categorizes issue or PR as related to a new feature. label Jun 26, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area/control-plane Related to all activities around Kyma Control Plane
Projects
None yet
Development

No branches or pull requests

2 participants