Update metric instrumentation guide #195

fabxc · 2016-12-19T16:21:40Z

This extends a simple quick start guide by links to reference documentation and reiterations of the most important points to hopefully unify the instrumentation approach across components over time.

The contents were discussed in the last @kubernetes/sig-instrumentation-misc meeting and decided to be added in here.

piosz

Looks reasonable for me. Can someone who attended the meeting and participated in the discussion take a look?

piosz · 2017-01-03T13:36:49Z

contributors/devel/instrumentation.md

 first calling WithLabelValues if your metric has any labels

   https://github.com/kubernetes/kubernetes/blob/3ce7fe8310ff081dbbd3d95490193e1d5250d2c9/pkg/kubelet/kubelet.go#L1384
   https://github.com/kubernetes/kubernetes/blob/cd3299307d44665564e1a5c77d0daa0286603ff5/pkg/apiserver/apiserver.go#L87


How about including short snippets of the code here instead of having link to source code?

piosz · 2017-01-03T13:39:09Z

contributors/devel/instrumentation.md

+behavior but abstract system state, such as desired replicas for a deployment.
+They are not directly instrumented but collected from otherwise exposed data.
+
+In Kubernetes they are generally capture in the kube-state-metrics repository,


Maybe component instead of repository?

piosz · 2017-01-03T13:48:23Z

contributors/devel/instrumentation.md

+for a label at instrumentation time.
+
+Notable exceptions are exporters like kube-state-metrics, which expose per-pod
+or per-deployment metrics, which are theoretically unbound. However, they have


They are pretty well bounded: 30 pods per node, 2000 (5000) nodes.

At any point in time, yes. But not over time. Will make that clearer.

DirectXMan12

some minor nits and missing links

DirectXMan12 · 2017-01-03T18:29:02Z

contributors/devel/instrumentation.md

+
+In general, “external” labels like pod or node name do not belong into the
+instrumentation itself. They are to be attached to metrics by the collecting
+system that has the external knowledge. (blog post)


did you mean to put a link here?

Added, thanks.

DirectXMan12 · 2017-01-03T18:30:07Z

contributors/devel/instrumentation.md

+## Normalization
+
+Metrics should be normalized with respect to their dimensions. They should
+expose the minimal set of labels everyone of which provides additional information.


set of labels everyone of which should be expose the minimal set of labels, every one of which or expose the minimal set of labels, each of which

DirectXMan12 · 2017-01-03T18:30:46Z

contributors/devel/instrumentation.md

+```
+
+This however only caters to one specific query use case. There are many more
+meta infos that could be added effectively blowing up the instrumentation.


meta infos --> pieces of metadata

that could be added effectively --> that could be added, effectively

DirectXMan12 · 2017-01-03T18:31:26Z

contributors/devel/instrumentation.md

+They are also not guaranteed to be stable over time. What if pods at some
+point can be live migrated?
+Those pieces of information should be normalized into an info-level metric
+(blog post), which is always set to 1. For example:


fabxc

All comments addressed. PTAL.

fabxc · 2017-01-09T08:24:59Z

contributors/devel/instrumentation.md

+for a label at instrumentation time.
+
+Notable exceptions are exporters like kube-state-metrics, which expose per-pod
+or per-deployment metrics, which are theoretically unbound. However, they have


At any point in time, yes. But not over time. Will make that clearer.

fabxc · 2017-01-09T08:27:21Z

contributors/devel/instrumentation.md

+
+In general, “external” labels like pod or node name do not belong into the
+instrumentation itself. They are to be attached to metrics by the collecting
+system that has the external knowledge. (blog post)


Added, thanks.

DirectXMan12

one small nit, but in general LGTM 👍

DirectXMan12 · 2017-01-09T15:15:08Z

contributors/devel/instrumentation.md

+behavior but abstract system state, such as desired replicas for a deployment.
+They are not directly instrumented but collected from otherwise exposed data.
+
+In Kubernetes they are generally capture in the [kube-state-metrics](https://github.com/kubernetes/kube-state-metrics)


they are generally captured in the...

brancz · 2017-01-09T15:58:25Z

lgtm 👍

piosz · 2017-01-11T19:37:54Z

It seems there is no merge bot in this repo, so merging manually. Thanks for writing it down!

fgrzadkowski · 2017-05-31T13:39:34Z

contributors/devel/instrumentation.md

+   ```go
+    requestCounter = prometheus.NewCounterVec(
+      prometheus.CounterOpts{
+        Name: "apiserver_request_count",


Shouldn't this be "_total" according to https://prometheus.io/docs/practices/naming/#metric-names?

@MaciekPytel

Create Flake Report for 2017-03-03

k8s-ci-robot added the cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. label Dec 19, 2016

piosz self-assigned this Jan 3, 2017

piosz reviewed Jan 3, 2017

View reviewed changes

DirectXMan12 suggested changes Jan 3, 2017

View reviewed changes

fabxc force-pushed the master branch from 0f3d19c to ae32a46 Compare January 9, 2017 08:30

fabxc commented Jan 9, 2017

View reviewed changes

DirectXMan12 approved these changes Jan 9, 2017

View reviewed changes

fabxc force-pushed the master branch from ae32a46 to 50d3859 Compare January 10, 2017 09:57

Update metric instrumentation guide

50d3859

piosz assigned DirectXMan12 Jan 11, 2017

piosz added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Jan 11, 2017

piosz merged commit 2728a1e into kubernetes:master Jan 11, 2017

fgrzadkowski reviewed May 31, 2017

View reviewed changes

shyamjvs pushed a commit to shyamjvs/community that referenced this pull request Sep 22, 2017

Merge pull request kubernetes#195 from calebamiles/flake-report-eow-9

582587c

Create Flake Report for 2017-03-03

Update metric instrumentation guide #195

Update metric instrumentation guide #195

Uh oh!

Conversation

fabxc commented Dec 19, 2016

Uh oh!

piosz left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

DirectXMan12 left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

fabxc left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

DirectXMan12 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

brancz commented Jan 9, 2017

Uh oh!

piosz commented Jan 11, 2017

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

DirectXMan12 left a comment •

edited

Loading