Add Prometheus metrics to count ANP and ACNP Status updates #1801

antoninbas · 2021-01-29T22:26:21Z

Too frequent Status updates could generate too many versions of the CRD,
that would need to be stored in etcd until the next compaction by
kube-apiserver. Too many updates could also cause fragmentation of the
database. It is useful to have access to the number of updates over time
in production clusters.

Too frequent Status updates could generate too many versions of the CRD, that would need to be stored in etcd until the next compaction by kube-apiserver. Too many updates could also cause fragmentation of the database. It is useful to have access to the number of updates over time in production clusters.

jianjuns

I am ok with the change. Feel people might not understand what such metrics are for, but probably no harm either.

abhiraut · 2021-01-30T00:45:01Z

does update to the Status sub-resource generate a new version? i thought the generation count remained same.
but maybe it still updates the resourceVersion even if it does not generate a new resource.

antoninbas · 2021-01-30T01:03:56Z

does update to the Status sub-resource generate a new version? i thought the generation count remained same.
but maybe it still updates the resourceVersion even if it does not generate a new resource.

I believe ResourceVersion is incremented and there is definitely a write to etcd, but we can wait for @tnqn to confirm and comment on the usefulness of this PR

tnqn · 2021-02-03T10:14:40Z

does update to the Status sub-resource generate a new version? i thought the generation count remained same.
but maybe it still updates the resourceVersion even if it does not generate a new resource.

I believe ResourceVersion is incremented and there is definitely a write to etcd, but we can wait for @tnqn to confirm and comment on the usefulness of this PR

Yes, generation is different from resourceVersion. Any update to any field will lead to a new version and a write to etcd. Although the original issue is not caused by status update, this metrics can help rule it out quickly and generally help us understand the overhead of status manager so I'm good to add them.

tnqn

LGTM

antoninbas · 2021-02-03T21:06:14Z

Thanks for the feedback @tnqn

antoninbas · 2021-02-03T21:06:22Z

/test-all

codecov-io · 2021-02-03T21:52:49Z

Codecov Report

❗ No coverage uploaded for pull request base (main@57dcaec). Click here to learn what that means.
The diff coverage is n/a.

@@           Coverage Diff           @@
##             main    #1801   +/-   ##
=======================================
  Coverage        ?   26.49%           
=======================================
  Files           ?      179           
  Lines           ?    15169           
  Branches        ?        0           
=======================================
  Hits            ?     4019           
  Misses          ?    10625           
  Partials        ?      525

Flag	Coverage Δ
e2e-tests	`26.49% <0.00%> (?)`

Flags with carried forward coverage won't be shown. Click here to find out more.

vmwclabot added the cla-not-required label Jan 29, 2021

antoninbas requested review from tnqn and jianjuns January 29, 2021 22:26

jianjuns approved these changes Jan 29, 2021

View reviewed changes

abhiraut approved these changes Jan 30, 2021

View reviewed changes

tnqn approved these changes Feb 3, 2021

View reviewed changes

antoninbas merged commit 9a8939a into antrea-io:main Feb 3, 2021

antoninbas deleted the add-prometheus-metrics-to-count-anp-and-acnp-status-updates branch February 3, 2021 22:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Prometheus metrics to count ANP and ACNP Status updates #1801

Add Prometheus metrics to count ANP and ACNP Status updates #1801

antoninbas commented Jan 29, 2021

jianjuns left a comment

abhiraut commented Jan 30, 2021

antoninbas commented Jan 30, 2021

tnqn commented Feb 3, 2021

tnqn left a comment

antoninbas commented Feb 3, 2021

antoninbas commented Feb 3, 2021

codecov-io commented Feb 3, 2021 •

edited

Add Prometheus metrics to count ANP and ACNP Status updates #1801

Add Prometheus metrics to count ANP and ACNP Status updates #1801

Conversation

antoninbas commented Jan 29, 2021

jianjuns left a comment

Choose a reason for hiding this comment

abhiraut commented Jan 30, 2021

antoninbas commented Jan 30, 2021

tnqn commented Feb 3, 2021

tnqn left a comment

Choose a reason for hiding this comment

antoninbas commented Feb 3, 2021

antoninbas commented Feb 3, 2021

codecov-io commented Feb 3, 2021 • edited

Codecov Report

codecov-io commented Feb 3, 2021 •

edited