metrics support #715

pohly · 2020-08-26T11:31:54Z

The Prometheus integration uses the approach from
helm/charts#22899:
- HTTP for metrics endpoints
- container ports tell Prometheus which containers to scrape
and how

CSI call counts are the same as in the sidecars. This enables
correlating statistics and ensures that also node-local operations are
captured; kubelet doesn't seem to be instrumented.

Internal communication is instrumented the same way.

PMEM usage statistics are recording by querying the device manager
each time the metrics data gets scraped.

The metrics support is enabled unconditionally in the operator and all
pre-generated deployment files and use plain HTTP for the sake of
simplicity. This is based on the rationale that the data itself is
not sensitive and should always be readily available if desired.

TODO:

tests for metrics data
get metrics: support usage inside CSI driver kubernetes-csi/csi-lib-utils#49 merged and use a new csi-lib-utils release

pohly · 2020-08-26T17:39:37Z

Force-pushed to trigger CI testing.

Those date back to the very beginning of the project and were used as a reminder for content that might be missing. We have added everything over time, so they no longer serve any real purpose.

pohly · 2020-09-11T13:47:27Z

@avalluri this PR is ready, I finished all pending TODOs.

pohly · 2020-09-16T10:27:14Z

docs/install.md

-<!-- FILL TEMPLATE:
+### Metrics support
+
+:bangbang: | Metric support is an alpha feature. What data is provided may change.


This works on GitHub (not surpising, I got this from github/markup#887) but not in Sphinx (just shows :bangbang:):

https://cloudnative-k8sci.southcentralus.cloudapp.azure.com/job/pmem-csi/job/PR-715/Doc_20Site/docs/install.html#metrics-support

@intelkevinputnam : do you have a suggestion how we can mark up warnings in our documentation?

I'll merge without this warning and instead moved it into an issue: #736

avalluri

Looks good.

The Prometheus integration uses the approach from helm/charts#22899: - HTTP for metrics endpoints - container ports tell Prometheus which containers to scrape and how CSI call counts are the same as in the sidecars. This enables correlating statistics and ensures that also node-local operations are captured; kubelet doesn't seem to be instrumented. Internal communication is instrumented the same way. PMEM usage statistics are recorded by querying the active device manager each time the metrics data gets scraped. The metrics support is enabled unconditionally in the operator and all pre-generated deployment files and use plain HTTP for the sake of simplicity. This is based on the rationale that the data itself is not sensitive and should always be readily available if desired.

pohly force-pushed the metrics-enhancements branch from aa94b63 to 833440c Compare August 26, 2020 17:39

pohly force-pushed the metrics-enhancements branch 2 times, most recently from bc7a4e5 to f545758 Compare August 27, 2020 06:23

pohly requested a review from avalluri August 27, 2020 06:41

pohly force-pushed the metrics-enhancements branch from f545758 to 9028e12 Compare August 27, 2020 13:44

pohly force-pushed the metrics-enhancements branch from 9028e12 to ed78869 Compare September 10, 2020 09:03

docs: remove FILL TEMPLATE comments

2d7facd

Those date back to the very beginning of the project and were used as a reminder for content that might be missing. We have added everything over time, so they no longer serve any real purpose.

pohly force-pushed the metrics-enhancements branch 2 times, most recently from 8c34c9f to 8cf3d52 Compare September 11, 2020 13:45

pohly changed the title ~~WIP: metrics support~~ metrics support Sep 11, 2020

pohly assigned avalluri Sep 11, 2020

pohly commented Sep 16, 2020

View reviewed changes

avalluri approved these changes Sep 18, 2020

View reviewed changes

pohly force-pushed the metrics-enhancements branch from 8cf3d52 to ef274da Compare September 18, 2020 08:48

pohly mentioned this pull request Sep 18, 2020

markup for warnings #736

Closed

pohly merged commit bd0feaa into intel:devel Sep 18, 2020

pohly mentioned this pull request Sep 29, 2020

extend, document and test metrics support #666

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

metrics support #715

metrics support #715

pohly commented Aug 26, 2020 •

edited

Loading

pohly commented Aug 26, 2020

pohly commented Sep 11, 2020

pohly Sep 16, 2020 •

edited

Loading

pohly Sep 18, 2020

avalluri left a comment

metrics support #715

metrics support #715

Conversation

pohly commented Aug 26, 2020 • edited Loading

pohly commented Aug 26, 2020

pohly commented Sep 11, 2020

pohly Sep 16, 2020 • edited Loading

Choose a reason for hiding this comment

pohly Sep 18, 2020

Choose a reason for hiding this comment

avalluri left a comment

Choose a reason for hiding this comment

pohly commented Aug 26, 2020 •

edited

Loading

pohly Sep 16, 2020 •

edited

Loading