Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature] List down all the metrics necessary for maintaining multinode #340

Closed
Tracked by #107
shreyas-s-rao opened this issue Apr 29, 2022 · 0 comments · Fixed by #414
Closed
Tracked by #107

[Feature] List down all the metrics necessary for maintaining multinode #340

shreyas-s-rao opened this issue Apr 29, 2022 · 0 comments · Fixed by #414
Labels
area/monitoring Monitoring (including availability monitoring and alerting) related kind/enhancement Enhancement, improvement, extension release/ga Planned for GA(General Availability) release of the Feature status/closed Issue is closed (either delivered or triaged)
Milestone

Comments

@shreyas-s-rao
Copy link
Contributor

shreyas-s-rao commented Apr 29, 2022

Feature (What you would like to be added):
Create a collective list of all the metrics that is needed to maintain multi-node ETCD. Update the file with the list. This is a running document that will capture all the metrics that will be exposed through prometheus for mutinode ETCD.

Motivation (Why is this needed?):
Once etcd-druid starts managing multi-node etcd clusters, it would perform operations such as cluster scale-up, scale-down, recovery quorum losses, forced restorations, etc. Currently druid does not expose any metrics about its operations, and these metrics will become imperative for the multi-node story, especially for understanding and debugging druid behaviour during etcd cluster failures.

Approach/Hint to the implement solution (optional):

@shreyas-s-rao shreyas-s-rao added area/monitoring Monitoring (including availability monitoring and alerting) related kind/enhancement Enhancement, improvement, extension labels Apr 29, 2022
@abdasgupta abdasgupta changed the title [Feature] Expose new metrics for druid [Feature] List down all the metrics necessary for maintaining multinode Jun 8, 2022
@ashwani2k ashwani2k added the release/ga Planned for GA(General Availability) release of the Feature label Jul 6, 2022
@abdasgupta abdasgupta added this to the v0.13.0 milestone Aug 22, 2022
@gardener-robot gardener-robot added the status/closed Issue is closed (either delivered or triaged) label Sep 2, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area/monitoring Monitoring (including availability monitoring and alerting) related kind/enhancement Enhancement, improvement, extension release/ga Planned for GA(General Availability) release of the Feature status/closed Issue is closed (either delivered or triaged)
Projects
None yet
Development

Successfully merging a pull request may close this issue.

4 participants