-
Notifications
You must be signed in to change notification settings - Fork 55
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
metrics support #715
metrics support #715
Conversation
aa94b63
to
833440c
Compare
Force-pushed to trigger CI testing. |
bc7a4e5
to
f545758
Compare
f545758
to
9028e12
Compare
9028e12
to
ed78869
Compare
Those date back to the very beginning of the project and were used as a reminder for content that might be missing. We have added everything over time, so they no longer serve any real purpose.
8c34c9f
to
8cf3d52
Compare
@avalluri this PR is ready, I finished all pending TODOs. |
docs/install.md
Outdated
<!-- FILL TEMPLATE: | ||
### Metrics support | ||
|
||
:bangbang: | Metric support is an alpha feature. What data is provided may change. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This works on GitHub (not surpising, I got this from github/markup#887) but not in Sphinx (just shows :bangbang:
):
@intelkevinputnam : do you have a suggestion how we can mark up warnings in our documentation?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'll merge without this warning and instead moved it into an issue: #736
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good.
The Prometheus integration uses the approach from helm/charts#22899: - HTTP for metrics endpoints - container ports tell Prometheus which containers to scrape and how CSI call counts are the same as in the sidecars. This enables correlating statistics and ensures that also node-local operations are captured; kubelet doesn't seem to be instrumented. Internal communication is instrumented the same way. PMEM usage statistics are recorded by querying the active device manager each time the metrics data gets scraped. The metrics support is enabled unconditionally in the operator and all pre-generated deployment files and use plain HTTP for the sake of simplicity. This is based on the rationale that the data itself is not sensitive and should always be readily available if desired.
8cf3d52
to
ef274da
Compare
The Prometheus integration uses the approach from
helm/charts#22899:
- HTTP for metrics endpoints
- container ports tell Prometheus which containers to scrape
and how
CSI call counts are the same as in the sidecars. This enables
correlating statistics and ensures that also node-local operations are
captured; kubelet doesn't seem to be instrumented.
Internal communication is instrumented the same way.
PMEM usage statistics are recording by querying the device manager
each time the metrics data gets scraped.
The metrics support is enabled unconditionally in the operator and all
pre-generated deployment files and use plain HTTP for the sake of
simplicity. This is based on the rationale that the data itself is
not sensitive and should always be readily available if desired.
TODO: