New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Bug 1881082: Expose etcd raft term as a metric #444
Bug 1881082: Expose etcd raft term as a metric #444
Conversation
Here's what it looks like in a cluster I've got which has experienced massive churn:
|
Although I'm not seeing them being scraped by Prometheus in my cluster, so I might be missing something in terms of registration, etc. /hold |
2cb09a2
to
505f28f
Compare
Already spent way too much time trying to figure out why modules are screwy in the CI build — switched to the deprecated |
/retest |
/hold cancel |
505f28f
to
c000388
Compare
/retest |
2 similar comments
/retest |
/retest |
I think in general this is a net gain in the context of historical reference. Raft terms are persisted to state thus the counter is immutable so to speak vs the current metric. As a followup, I would like to see the term added to events where reasonable as raft terms are included in all API responses thus very available.. Another benefit of adding the term to events is we can track before prom starts. /lgtm |
/hold |
Piggyback on the member health checking facility to expose the current raft term per member as a Prometheus metric to enable more granular performance analysis.
c000388
to
9d5cd75
Compare
/lgtm |
/retest Please review the full test history for this PR and help us cut down flakes. |
19 similar comments
/retest Please review the full test history for this PR and help us cut down flakes. |
/retest Please review the full test history for this PR and help us cut down flakes. |
/retest Please review the full test history for this PR and help us cut down flakes. |
/retest Please review the full test history for this PR and help us cut down flakes. |
/retest Please review the full test history for this PR and help us cut down flakes. |
/retest Please review the full test history for this PR and help us cut down flakes. |
/retest Please review the full test history for this PR and help us cut down flakes. |
/retest Please review the full test history for this PR and help us cut down flakes. |
/retest Please review the full test history for this PR and help us cut down flakes. |
/retest Please review the full test history for this PR and help us cut down flakes. |
/retest Please review the full test history for this PR and help us cut down flakes. |
/retest Please review the full test history for this PR and help us cut down flakes. |
/retest Please review the full test history for this PR and help us cut down flakes. |
/retest Please review the full test history for this PR and help us cut down flakes. |
/retest Please review the full test history for this PR and help us cut down flakes. |
/retest Please review the full test history for this PR and help us cut down flakes. |
/retest Please review the full test history for this PR and help us cut down flakes. |
/retest Please review the full test history for this PR and help us cut down flakes. |
/retest Please review the full test history for this PR and help us cut down flakes. |
@ironcladlou: This pull request references Bugzilla bug 1881082, which is valid. The bug has been moved to the POST state. The bug has been updated to refer to the pull request using the external bug tracker. 3 validation(s) were run on this bug
In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
/retest Please review the full test history for this PR and help us cut down flakes. |
@ironcladlou: The following test failed, say
Full PR test history. Your PR dashboard. Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here. |
/retest Please review the full test history for this PR and help us cut down flakes. |
@ironcladlou: All pull requests linked via external trackers have merged: Bugzilla bug 1881082 has been moved to the MODIFIED state. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
Piggyback on the member health checking facility to expose the current
raft term per member as a Prometheus metric to enable more granular
performance analysis.