New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Metrics associated with a deleted node should not be reported #28382
Conversation
/test |
…er reported. When a node is deleted from a cluster, metrics associated with that node are still being exported to prometheus. Short of restarting the agent, we want to dynamically delete these metrics when a node is removed from the cluster. This PR ensures node_connectivity_status and node_connectivity_latency no longer report metrics for nodes that are no longer present on the cluster. Signed-off-by: Fernand Galiana <fernand.galiana@isovalent.com>
681c5d5
to
ce9b145
Compare
/test |
Just a minor nit on the release note
Typically we frame release notes by describing the impact, so something like
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
AFAIU, there's already a differentiation for metrics that can be deleted, so this PR is just following the pattern. IMO, we can merge this now to fix the ongoing problems with the node connectivity metrics, and then followup with a refactor.
Checkpatch is complaining that the commit subject line is too long. Overridden. |
did this make it into the newest release? |
Please ensure your pull request adheres to the following guidelines:
description and a
Fixes: #XXX
line if the commit addresses a particularGitHub issue.
Fixes: <commit-id>
tag, thenplease add the commit author[s] as reviewer[s] to this issue.
When a node is deleted from a cluster, metrics associated with that node are still being exported to prometheus.
Short of restarting the agent, we want to dynamically delete these metrics when a node is removed from the cluster.
This PR ensures node_connectivity_status and node_connectivity_latency no longer report metrics for nodes that are no longer present on the cluster.
Fixes: #issue-number