charts/*: drop wireService label, use app= instead, add servicemonitor support #2413

flokli · 2022-05-18T14:16:30Z

This aligns labels a bit more with how they look like in other
deployments. In some cases, we were already setting the app label,
too.

There's one possible regression:
The wire-server-metrics helm chart previously configured kube-prometheus-stack to automatically scrape everything with a wireService label at port http,
path /i/metrics.

This custom configuration has been removed, and instead each chart provides the option to create ServiceMonitor resources, which add wire services to metric scraping that way.

Checklist

jschaul

looks fine to me; if CI passes.

I'm not sure if the wire-server-metrics chart truly used anywhere by anyone (not if it's installed, but if anyone is looking at any of those metrics)

flokli · 2022-05-18T15:41:27Z

Then I'd probably even propose going forward to entirely removing the wire-server-metrics charts, and updating the docs accordingly. Once there are ServiceProbes, a default kube-prometheus stack, or grafana-agent should be able to scrape these metrics.

akshaymankar · 2022-05-19T07:33:59Z

The wire-server-metrics chart is used by the federation environments. So, if you do change this flow, please don't forget to update the federation environments. Hopefully, this change doesn't blow up there 😄

flokli · 2022-05-19T10:25:04Z

The wire-server-metrics chart is used by the federation environments. So, if you do change this flow, please don't forget to update the federation environments.

Are we fine with just stopping to instantiate the wire-server-metrics chart there?

akshaymankar · 2022-05-23T09:46:37Z

Are we fine with just stopping to instantiate the wire-server-metrics chart there?

We added metrics there because it was hard for us to figure out what was wrong when things didn't work. So, I would say maybe this is still needed, but best ask people working on federation these days. /cc @smatting @pcapriotti @mdimjasevic @stephen-smith

smatting · 2022-05-23T10:30:07Z

So, I would say maybe this is still needed, but best ask people working on federation these days. /cc @smatting @pcapriotti @mdimjasevic @stephen-smith

I don't think we have used these metrics for any debugging yet. @supersven have you made use of them while debugging calling? If not I think it's fine to remove it, wdyt?

Once there are ServiceProbes, a default kube-prometheus stack, or grafana-agent should be able to scrape these metrics

Would this also be eventually available for the federation environments?

jschaul · 2022-05-23T10:35:14Z

The wire-server-metrics chart works fine as it is right now, and has no dependencies on other things, it even has up-to-date docs in https://docs.wire.com/how-to/install/monitoring.html

So I don't see any reason to break a so-far working chart without providing alternatives. If things are temporarily broken for less than 1 week and you intend to restore the behaviour of wire-server-metrics chart working as before; then I have no issue with this PR. If the intention is to make a whole new monitoring capability at some point in the far future and in the meantime things will be broken on all environments that use wire-server-metrics; then I'm not a fan. I'm not familiar enough with service probes and your overall intention to understand what you'd like to do; please explain.

supersven · 2022-05-23T12:23:10Z

I don't think we have used these metrics for any debugging yet. @supersven have you made use of them while debugging calling? If not I think it's fine to remove it, wdyt?

Nope. Debugging calling is currently more looking at logs and reasoning about Helm charts.

flokli · 2022-05-23T15:46:26Z

I guess we can just keep this PR open until it also adds ServiceProbes, which provides an alternative.

flokli · 2022-06-09T14:06:44Z

I went through all services in /services, and checked if they expose metrics at /i/metrics.

federator and nginz don't expose metrics at /i/metrics.

For those that did, I created the necessary helm file to create a ServiceMonitor resource, which marks the endpoint for scraping to a prometheus-operator (or something else looking for these CRs, like grafana-agent-operator).

As the CRDs don't come shipped with Kubernetes out of the box, but they're usually installed while installing a monitoring operator, it's opt-in and disabled by default.

This aligns labels a bit more with how they look like in other deployments. In some cases, we were already setting the `app` label, too. There's one possible regression: The wire-server-metrics helm chart configured kube-prometheus-stack to automatically scrape everything with a wireService label at port http, path /i/metrics. This will be fixed in a followup, by adding ServiceProbe resources to each workload that exposes metrics.

flokli · 2022-06-10T11:28:55Z

Hmm, it seems it's not possible to modify the spec.selector.matchLabels field of a Deployment or StatefulSet resource, so this might require deleting and recreating these resources.

flokli · 2022-06-14T09:16:33Z

For posterity: release notes to document the upgrade were added in #2472.

flokli requested review from akshaymankar and jschaul May 18, 2022 14:16

flokli temporarily deployed to cachix May 18, 2022 14:16 Inactive

jschaul approved these changes May 18, 2022

View reviewed changes

flokli marked this pull request as ready for review May 18, 2022 15:41

flokli force-pushed the charts-wireService-app branch from d73bbd7 to e582cb0 Compare June 8, 2022 15:21

flokli temporarily deployed to cachix June 8, 2022 15:21 Inactive

flokli temporarily deployed to cachix June 9, 2022 13:59 Inactive

flokli changed the title ~~charts/*: drop wireService label, use app= instead~~ charts/*: drop wireService label, use app= instead, add servicemonitor support Jun 9, 2022

flokli temporarily deployed to cachix June 9, 2022 14:20 Inactive

flokli requested review from smatting, supersven and stephen-smith June 9, 2022 14:20

flokli added 8 commits June 9, 2022 19:16

charts/brig: add servicemonitor support

5d8fb59

charts/cannon: add servicemonitor support

a7d8cd0

chart/cargohold: add servicemonitor support

ad83ce2

charts/galley: add servicemonitor support

9103b8b

charts/gundeck: add servicemonitor support

64529a9

charts/proxy: add servicemonitor support

a8b2c94

charts/spar: add servicemonitor support

b563ce9

changelog.d: add wireService label removal to changelog

bacf9c9

flokli force-pushed the charts-wireService-app branch from 44e3299 to bacf9c9 Compare June 9, 2022 17:16

flokli temporarily deployed to cachix June 9, 2022 17:16 Inactive

stephen-smith approved these changes Jun 9, 2022

View reviewed changes

flokli merged commit 46d5edb into develop Jun 10, 2022

flokli deleted the charts-wireService-app branch June 10, 2022 07:53

This was referenced Jun 14, 2022

Release 2022-06-14 - (expected chart version 4.14.0) #2478

Closed

Release 2022-06-14 - (expected chart version 4.14.0) #2482

Merged

This was referenced Jul 5, 2022

Release 2022-07-05 - (expected chart version 4.18.0) #2534

Closed

Release 2022-07-05 - (expected chart version 4.18.0) #2537

Merged

Release 2022-07-05 - (expected chart version 4.18.0) #2540

Closed

sysvinit mentioned this pull request Sep 6, 2022

coturn: refactor resource labels, expose ServiceMonitor for metrics endpoint #2677

Merged

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

charts/*: drop wireService label, use app= instead, add servicemonitor support #2413

charts/*: drop wireService label, use app= instead, add servicemonitor support #2413

flokli commented May 18, 2022 •

edited

jschaul left a comment

flokli commented May 18, 2022 •

edited

akshaymankar commented May 19, 2022

flokli commented May 19, 2022

akshaymankar commented May 23, 2022

smatting commented May 23, 2022 •

edited

jschaul commented May 23, 2022

supersven commented May 23, 2022

flokli commented May 23, 2022

flokli commented Jun 9, 2022 •

edited

flokli commented Jun 10, 2022

flokli commented Jun 14, 2022

charts/*: drop wireService label, use app= instead, add servicemonitor support #2413

charts/*: drop wireService label, use app= instead, add servicemonitor support #2413

Conversation

flokli commented May 18, 2022 • edited

Checklist

jschaul left a comment

Choose a reason for hiding this comment

flokli commented May 18, 2022 • edited

akshaymankar commented May 19, 2022

flokli commented May 19, 2022

akshaymankar commented May 23, 2022

smatting commented May 23, 2022 • edited

jschaul commented May 23, 2022

supersven commented May 23, 2022

flokli commented May 23, 2022

flokli commented Jun 9, 2022 • edited

flokli commented Jun 10, 2022

flokli commented Jun 14, 2022

flokli commented May 18, 2022 •

edited

flokli commented May 18, 2022 •

edited

smatting commented May 23, 2022 •

edited

flokli commented Jun 9, 2022 •

edited