[Feature] List down all the metrics necessary for maintaining multinode #340

shreyas-s-rao · 2022-04-29T06:48:55Z

Feature (What you would like to be added):
Create a collective list of all the metrics that is needed to maintain multi-node ETCD. Update the file with the list. This is a running document that will capture all the metrics that will be exposed through prometheus for mutinode ETCD.

Motivation (Why is this needed?):
Once etcd-druid starts managing multi-node etcd clusters, it would perform operations such as cluster scale-up, scale-down, recovery quorum losses, forced restorations, etc. Currently druid does not expose any metrics about its operations, and these metrics will become imperative for the multi-node story, especially for understanding and debugging druid behaviour during etcd cluster failures.

Approach/Hint to the implement solution (optional):

shreyas-s-rao added area/monitoring Monitoring (including availability monitoring and alerting) related kind/enhancement Enhancement, improvement, extension labels Apr 29, 2022

shreyas-s-rao mentioned this issue Apr 29, 2022

Multi-Node/Clustered ETCD #107

Closed

34 tasks

abdasgupta mentioned this issue Jun 2, 2022

[Feature] Enhance gardener grafana dashboards for multi-node #221

Closed

abdasgupta changed the title ~~[Feature] Expose new metrics for druid~~ [Feature] List down all the metrics necessary for maintaining multinode Jun 8, 2022

ashwani2k added the release/ga Planned for GA(General Availability) release of the Feature label Jul 6, 2022

abdasgupta added this to the v0.13.0 milestone Aug 22, 2022

ishan16696 mentioned this issue Aug 22, 2022

Added the metrics needed to capture for multi-node etcd. #414

Merged

ishan16696 closed this as completed in #414 Sep 2, 2022

gardener-robot added the status/closed Issue is closed (either delivered or triaged) label Sep 2, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature] List down all the metrics necessary for maintaining multinode #340

[Feature] List down all the metrics necessary for maintaining multinode #340

shreyas-s-rao commented Apr 29, 2022 •

edited by abdasgupta

[Feature] List down all the metrics necessary for maintaining multinode #340

[Feature] List down all the metrics necessary for maintaining multinode #340

Comments

shreyas-s-rao commented Apr 29, 2022 • edited by abdasgupta

shreyas-s-rao commented Apr 29, 2022 •

edited by abdasgupta