how to turn off timestamps in exported metrics #2526

muzammil360 · 2020-04-29T13:50:37Z

Hi. I am using cadvisor with pushgateway. It so happens that cadvisor exports timestamps along with metrics. This is a problem with pushgateway as it doesn't accept metrics with timestamps. Following is the error returned if I push the metrics using curl

pushed metrics are invalid or inconsistent with existing metrics: pushed metrics must not have timestamps

It seems that pushgateway doesn't seem to honor time stamps. Is there anyway I can turn them off in cadvisor? Or if you know an option where pushgateway will ignore the time stamp, even that is acceptable.

NOTE: I know that pushing cadvisor metrics to pushgateway is an anti-pattern but for the time being, I don't have much option.

The text was updated successfully, but these errors were encountered:

dashpole · 2020-04-29T18:50:59Z

Removing timestamps makes many metrics unusable, since we collect metrics out-of-band. I don't think we should support turning off timestamps.

muzammil360 · 2020-04-29T23:15:52Z

Thanks for the reply. Is there any way i can integrate cadvisor metrics with pushgateway? Pushgateway rejects the metrics for the sole reason that it has timestamps. The authors of pushgateway are adamant that metrics should not have timestamps. Even if timestamp is ignored, it will work for me but they send back 400 error. Any ideas how can i make it work?

…

On Wed, Apr 29, 2020, 11:51 PM David Ashpole ***@***.***> wrote: Removing timestamps makes many metrics unusable, since we collect metrics out-of-band. I don't think we should support turning off timestamps. — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub <#2526 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AE3S5IMQFDAD4J6DYVLR353RPBZKFANCNFSM4MTXUVWA> .

dashpole · 2020-04-30T00:09:54Z

I'm not aware of any ways to make that work.

SuperQ · 2020-10-01T08:53:56Z

Timestamps in Prometheus metrics cause more problems than just the pushgateway. For example, exposing timetsamps breaks staleness handling. This causes containers that have been removed to be still visible in the data for 5 minutes.

I don't know why you think the metrics are "unusable" without timestamps. For Prometheus monitoring, we expect no timestamps for metrics for almost every use case. The metric scrape is intended to be "When Prometheus last saw this data".

Exposing timestamps is causing problems for Kubernetes users.

CC @paulfantom @brian-brazil @roidelapluie @brancz

dashpole · 2020-10-01T16:28:41Z

At scrape time, the metrics returned may be up to 15 seconds old. Rates (e.g. cpu usage) didn't really work without timestamps.

We might be able to drop timestamps if the metrics are collected "on demand", which was added a few releases back: #1989.

SuperQ · 2020-10-01T17:03:29Z

On demand would be much preferred, but exposing timestamps, even a few minutes stale, is just fine. I'm not sure what you did to determine "didn't work", but what cAdvisor is doing right now is much worse.

Exposing timestamps is a violation of Prometheus metrics best practices and should not be done. The linked Kubernetes operator issue describes this well.

dashpole · 2020-10-01T17:13:57Z

See #2059. CPU rates can be inaccurate by +- 50%, which most users consider "unusable".

Can you link to the best practices documentation which states that timestamps should never be done?

SuperQ · 2020-10-01T17:20:27Z

Pre-compute rates shouldn't be exposed on a Promteheus endpoint. They're not useful for Prometheus users. cAdvisor should only expose the raw container CPU counters.

I will try and find the documentation on timestamps.

dashpole · 2020-10-01T17:38:00Z

These aren't precomputed. These are the raw CPU counters. That is exactly the problem. If the "real" time at which a counter is collected differs significantly from the scrape time, the rate won't be correct. For example, if we are collecting (in the background) and caching:

t0: 0
t10: 10
t25: 25

If we then scrape at t0, t9, t11, t24, t26 (I know prometheus does regular intervals, but this problem still occurs, just not as dramatically), we get rates:

t0-t9: 0/9 = 0
t9-t11: 10/2 = 5
t11-t24: 0/2 = 0
t24-t26: 13/2 = 6.5

The correct rate is 1 for the entire interval, but prometheus would graph numbers that are dramatically incorrect.

From my understanding, it is a best practice to perform collection at scrape time, and thus not expose timestamps. However, given that we do not perform collection at scrape time, it seems like we must attach timestamps so that rate computations are correct. Collecting all metrics at once causes problems when running a non-trivial number of containers (e.g. 100), which is why we don't do that by default. However, we did recently add the ability to trigger collection at scrape time. For users that are running low pod density, this could be a good option, and we could remove timestamps in that case.

Keep in mind that prometheus server isn't the only consumer of cAdvisor metrics. Not attaching timestamps for cached metrics would break rate calculations for all backends, so doing that across the board doesn't seem like a viable option.

SuperQ · 2020-10-01T18:29:18Z

I will have to look at the other issue more closely, but what you're describing is not how Prometheus does calculations.

I will have to go over the linked issue, but the conclusions of the linked PR are incorrect. There is not enough information in that PR to show what's going on for real. They have one graph with 6 hours data, and one with 1 hour of data. This means that the default view is going to have a step of 14 seconds in the 1 hour view, and 86 seconds in the 6 hour view.

My first guess with #2059 based on what they're showing is that they've configured a scrape interval of 30 seconds. This is going to lead to the weird +- 50% artifacts when you have a miss-matched collection to the scrape. Then when you combine 14 second steps and Prometheus rate extrapolation, you're going to see this +-50% problem.

Basically, they've got a self-induced problem, and it's neither cAdvisors or Prometheus that is the cause.

dashpole · 2020-10-01T18:33:31Z

Another thing that is relevant here is that cAdvisor jitters the interval to spread out load. Collection occurs every 10-15 seconds. I'm not entirely sure if that matters for this problem.

SuperQ · 2020-10-01T18:34:48Z

From my understanding, it is a best practice to perform collection at scrape time, and thus not expose timestamps.

Yes, you're very right about this. The best practice is to collect at scrape time.

Not attaching timestamps for cached metrics would break rate calculations for all backends

No, this is not likely a problem.

Collection occurs every 10-15 seconds. I'm not entirely sure if that matters for this problem.

No, this shouldn't be a problem.

Adds a `send_timestamps` option to the prometheus exporter to allow it to send scrape timestamps. By default, when scraping, Prometheus records data assuming that the presented sample is at the instant of the scrape. For data sources that cache the underlying information and do not refresh on scrape, this can lead to metric samples being recorded at the wrong timestamp. For ex, cadvisor caches for many seconds (4-20 in our experience), and so a sample taken "now" may actually be a sample from 20s ago. To handle this situation, the exposition format allows an exporter to advise the Prometheus server of the timestamp of the underlying scrape. OpenTelemetry is aware of the timestamp of the scrape. This change adds an option to have OpenTelemetry send the timestamps of underlying samples out with the Prometheus exporter. Visually, the image shows existing behavior prior to 9.45am and with `send_timestamps: true` set from 9.45am onwards. This is metrics for a job using a single CPU. ![image](https://user-images.githubusercontent.com/3196528/95892425-62fe5a00-0d3b-11eb-9023-af8e59652157.png) **Related issues:** google/cadvisor#2526 orijtech/prometheus-go-metrics-exporter#11 **Testing:** Test cases have been added. In addition, for e2e test, see screenshot from our environment above. **Documentation:** The `prometheusexporter` README has been updated.

Wayde2014 · 2022-02-15T01:56:47Z

I also encountered the same confusion, is there a solution to this problem now?

Does cadvisor support turn off timestamps in exported metrics?

@muzammil360

muzammil360 · 2022-02-15T02:21:58Z

@Wayde2014 no i didn't find anything else. But i suppose that was like 2 years ago. Cadvisor might have improved considerably during this time. I ended up running a regexp and dropping time field (as far i can remember).

muzammil360 · 2022-02-15T02:24:39Z

@Wayde2014 actually, if you look just above your comment, it seems jasonk000 attempted to fix it here

Wayde2014 · 2022-02-15T02:57:34Z

Thanks for the reply.

Do you still remember how the regular expression was written, thank you very much.

below is the command and its execution result:

$ curl -k -m 5 --header "Authorization: Bearer ${TOKEN}" "https://localhost:10250/metrics/cadvisor"
......
container_cpu_load_average_10s{container="",id="/",image="",name="",namespace="",pod=""} 0 1644893651118
container_cpu_load_average_10s{container="",id="/kubepods",image="",name="",namespace="",pod=""} 0 1644893653633
container_cpu_load_average_10s{container="",id="/kubepods/besteffort",image="",name="",namespace="",pod=""} 0 1644893651525
container_cpu_load_average_10s{container="",id="/kubepods/besteffort/pod00b3e4fe-cebd-47a5-8d8c-a2ffbccc146c",image="",name="",namespace="baetyl-edge",pod="idp-web-546d968bb4-4k7fj"} 0 1644893644069
......

Wayde2014 · 2022-02-15T03:02:45Z

@Wayde2014 actually, if you look just above your comment, it seems jasonk000 attempted to fix it here

I have seen these, and I have searched online for nearly 2 days, but there is still no good solution.

Thank you for your enthusiastic help.

muzammil360 · 2022-02-15T04:39:48Z

Thank you for your enthusiastic help.

@Wayde2014 i don't exactly remember. I remember figuring out the pattern of time stamps i wanted to avoid. My regexp was looking for a bunch of digits e.g 1644893653633. Like you might want to avoid the time from today to 100 years from now. Thsi will limit the no. of most significant digits.

I must add that i didn't care much about compute complexity for my application. consider for your application.

kiuber · 2022-07-26T14:55:42Z

@muzammil360 Hi, would you share your regexp rule? thank you.

kiuber · 2022-07-26T16:22:38Z

curl ${cadvisor_host}:${cadvisor_host} | sed -r 's/(} .*) ([0-9]*)/\1/' works for me.

@muzammil360 Hi, would you share your regexp rule? thank you.

Gypsying · 2023-07-10T11:28:42Z

curl ${cadvisor_host}:${cadvisor_host} | sed -r 's/(} .*) ([0-9]*)/\1/' works for me.

@muzammil360 Hi, would you share your regexp rule? thank you.

Thanks, that works for me too.

brancz · 2023-07-11T05:27:37Z

I actually think this can be closed in cadvisor side, since Prometheus supports honor_timestamp. Users can that way choose themselves whether they want to honor cadvisor timestamps or rather use the scrape time.

paulfantom mentioned this issue Oct 1, 2020

ignore timestamps from cadvisor metrics prometheus-operator/kube-prometheus#695

Merged

jasonk000 mentioned this issue Oct 13, 2020

Add send_timestamps option to Prometheus Exporter. open-telemetry/opentelemetry-collector#1951

Merged

jan--f mentioned this issue Nov 10, 2022

Improve staleness handling for series with explicit timestamps prometheus/prometheus#11565

Closed

hagen1778 mentioned this issue Jul 27, 2023

gaps when plotting metrics in Grafana VictoriaMetrics/VictoriaMetrics#4697

Closed

valyala mentioned this issue Jul 29, 2023

VictoriaMetrics returns a "hole" in a metric where Prometheus does not VictoriaMetrics/VictoriaMetrics#1773

Closed

dharapvj mentioned this issue Oct 19, 2023

[User-MLA] Dropped unused metrics and reduce label dimentions for many high cardinality metrics kubermatic/kubermatic#12756

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

how to turn off timestamps in exported metrics #2526

how to turn off timestamps in exported metrics #2526

muzammil360 commented Apr 29, 2020

dashpole commented Apr 29, 2020

muzammil360 commented Apr 29, 2020 via email

dashpole commented Apr 30, 2020

SuperQ commented Oct 1, 2020

dashpole commented Oct 1, 2020

SuperQ commented Oct 1, 2020

dashpole commented Oct 1, 2020

SuperQ commented Oct 1, 2020

dashpole commented Oct 1, 2020

SuperQ commented Oct 1, 2020 •

edited

Loading

dashpole commented Oct 1, 2020

SuperQ commented Oct 1, 2020

Wayde2014 commented Feb 15, 2022

muzammil360 commented Feb 15, 2022

muzammil360 commented Feb 15, 2022

Wayde2014 commented Feb 15, 2022 •

edited

Loading

Wayde2014 commented Feb 15, 2022

muzammil360 commented Feb 15, 2022

kiuber commented Jul 26, 2022

kiuber commented Jul 26, 2022

Gypsying commented Jul 10, 2023

brancz commented Jul 11, 2023 •

edited

Loading

how to turn off timestamps in exported metrics #2526

how to turn off timestamps in exported metrics #2526

Comments

muzammil360 commented Apr 29, 2020

dashpole commented Apr 29, 2020

muzammil360 commented Apr 29, 2020 via email

dashpole commented Apr 30, 2020

SuperQ commented Oct 1, 2020

dashpole commented Oct 1, 2020

SuperQ commented Oct 1, 2020

dashpole commented Oct 1, 2020

SuperQ commented Oct 1, 2020

dashpole commented Oct 1, 2020

SuperQ commented Oct 1, 2020 • edited Loading

dashpole commented Oct 1, 2020

SuperQ commented Oct 1, 2020

Wayde2014 commented Feb 15, 2022

muzammil360 commented Feb 15, 2022

muzammil360 commented Feb 15, 2022

Wayde2014 commented Feb 15, 2022 • edited Loading

Wayde2014 commented Feb 15, 2022

muzammil360 commented Feb 15, 2022

kiuber commented Jul 26, 2022

kiuber commented Jul 26, 2022

Gypsying commented Jul 10, 2023

brancz commented Jul 11, 2023 • edited Loading

SuperQ commented Oct 1, 2020 •

edited

Loading

Wayde2014 commented Feb 15, 2022 •

edited

Loading

brancz commented Jul 11, 2023 •

edited

Loading