Add staleness markers support #1526

hagen1778 · 2021-08-09T09:30:53Z

Is your feature request related to a problem? Please describe.
VictoriaMetrics and Prometheus staleness detection is different.

VictoriaMetrics calculates staleness threshold based on interval difference between datapoints timestamps (or scrape intervals).

Prometheus staleness logic is the following:

if a scrape fails then all time series from the previous scrape will be marked as stale;
if there's no samples within the 5 minutes before the evaluation, time series will be marked as stale.

VictoriaMetrics staleness detection behaves differently to Prometheus implementation for the following reasons:

staleness should work for all import protocols, and Prometheus remote write protocol is only one of them;
data may be ingested/backfilled into VictoriaMetrics, so "scrape" logic won't work;
to address staleness issues for series with intervals between data points exceeding 5 minutes.

However, because of the differences, query results for stale series between Prometheus and VictoriaMetrics may differ and result into discrepancies when using VictoriaMetrics as remote storage.

Describe the solution you'd like
To improve compatibility with Prometheus ecosystem, VictoriaMetrics TSDB and vmagent should support staleness markers.

Describe alternatives you've considered
Manual queries modifications to account for staleness and resets.

waldoweng · 2021-08-09T09:35:43Z

so this feature is in development now?

hagen1778 · 2021-08-09T11:30:58Z

It is not yet, but has a high priority.

Updates #1526 Updates #748 Updates #1509 Updates #1530 Updates #845

valyala · 2021-08-13T09:28:30Z

Support for Prometheus staleness markers has been added to VictoriaMetrics in the following commits:

Single-node: 4401464
Cluster version: c1f81f0

Please give it a try before it will be included into the next release of VictoriaMetrics. See build instructions for single-node and build instructions for cluster version.

If the data is ingested into VictoriaMetrics by vmagent instead of Prometheus, then the vmagent must be built from the commit 4401464 according to build instructions, since the previous versions of vmagent do not generate Prometheus staleness marks when scrape targets disappear.

…moveCounterResets functions Prometheus stalenss marks shouldn't be changed in removeCounterResets. Otherwise they will be converted to an ordinary NaN values, which couldn't be removed in dropStaleNaNs() function later. This may result in incorrect calculations for rollup functions. Updates #1526

This allows dropping staleness marks only once and then calculate multiple rollup functions on the result. Updates #1526

valyala · 2021-08-15T21:57:43Z

VictoriaMetrics and vmagent gained support for Prometheus staleness markers starting from v1.64.0. Closing the feature request as done.

… scrapers for the added targets This should prevent from possible time series overlap when old target is substituted by new target (for example, during Kubernetes deployments). Updates #1526 Updates #1530 Updates #748 Updates #1509

… tracking is enabled for metrics from deleted / disappeared scrape targets Store the scraped response body instead of storing the parsed and relabeld metrics. This should reduce memory usage, since the response body takes less memory than the parsed and relabeled metrics. This is especially true for Kubernetes service discovery, which adds many long labels for all the scraped metrics. This should also reduce CPU usage, since the marshaling of the parsed and relabeld metrics has been substituted by response body copying. Updates #1526

…ics shutdown if -selfScrapeInterval > 0 Updates #943 Updates #1526

hagen1778 added the enhancement New feature or request label Aug 9, 2021

This was referenced Aug 9, 2021

query results may incorrectly overlap time series #748

Closed

Query_range wrong last value #1509

Closed

valyala mentioned this issue Aug 11, 2021

Last value in graph is wrong (query_range/maxStepForPointsAdjustment) #1442

Closed

hagen1778 mentioned this issue Aug 13, 2021

Target down but count function returns value #1530

Closed

valyala added a commit that referenced this issue Aug 13, 2021

all: add support for Prometheus staleness markers

4401464

Updates #1526 Updates #748 Updates #1509 Updates #1530 Updates #845

valyala added a commit that referenced this issue Aug 13, 2021

all: add support for Prometheus staleness markers

c1f81f0

Updates #1526 Updates #748 Updates #1509 Updates #1530 Updates #845

This was referenced Aug 14, 2021

vmselect query and query_range still return samples till 5min after target had been remove from vmagent #1025

Closed

count() is inflated for large step durations #876

Open

VM count differs from prometheus #365

Closed

valyala added a commit that referenced this issue Aug 15, 2021

app/vmselect/promql: drop staleness marks before calling rollupConfig.Do

5420c3d

This allows dropping staleness marks only once and then calculate multiple rollup functions on the result. Updates #1526

valyala added a commit that referenced this issue Aug 15, 2021

app/vmselect/promql: drop staleness marks before calling rollupConfig.Do

113f0a8

This allows dropping staleness marks only once and then calculate multiple rollup functions on the result. Updates #1526

This was referenced Aug 15, 2021

vmselect unreasonable duplicate time series error #1501

Closed

Some metrics are late on Victoria query endpoint #1484

Closed

valyala closed this as completed Aug 15, 2021

hagen1778 mentioned this issue Aug 30, 2021

vmalert: plotting a recorded metric along with the original in grafana requires a offset for them to match #1232

Closed

valyala mentioned this issue Sep 20, 2021

Upgrading v1.63.0 -> v1.65.0 leads to unexpeced huge datapoint #1628

Closed

changshun-shi mentioned this issue Dec 20, 2023

why two function for default_roll_up and last_over_time #5499

Closed

3 tasks

valyala added a commit that referenced this issue Feb 8, 2024

app/victoria-metrics: properly send staleness markers on victoriametr…

a354924

…ics shutdown if -selfScrapeInterval > 0 Updates #943 Updates #1526

valyala added a commit that referenced this issue Feb 8, 2024

app/victoria-metrics: properly send staleness markers on victoriametr…

582d431

…ics shutdown if -selfScrapeInterval > 0 Updates #943 Updates #1526

valyala added a commit that referenced this issue Feb 8, 2024

app/victoria-metrics: properly send staleness markers on victoriametr…

7fef8ba

…ics shutdown if -selfScrapeInterval > 0 Updates #943 Updates #1526

valyala added a commit that referenced this issue Feb 8, 2024

app/victoria-metrics: properly send staleness markers on victoriametr…

a2a218e

…ics shutdown if -selfScrapeInterval > 0 Updates #943 Updates #1526

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add staleness markers support #1526

Add staleness markers support #1526

hagen1778 commented Aug 9, 2021 •

edited

waldoweng commented Aug 9, 2021

hagen1778 commented Aug 9, 2021

valyala commented Aug 13, 2021 •

edited

valyala commented Aug 15, 2021

Add staleness markers support #1526

Add staleness markers support #1526

Comments

hagen1778 commented Aug 9, 2021 • edited

waldoweng commented Aug 9, 2021

hagen1778 commented Aug 9, 2021

valyala commented Aug 13, 2021 • edited

valyala commented Aug 15, 2021

hagen1778 commented Aug 9, 2021 •

edited

valyala commented Aug 13, 2021 •

edited