compactor: adds downsample duration histogram #4552

vanugrah · 2021-08-11T01:23:13Z

Signed-off-by: Anugrah Vijay anugrah.vijay@gmail.com

I added CHANGELOG entry for this change.
Change is not relevant to the end user.

Changes

Registers and implements downsampleDuration histogram.
New var for downsample duration to avoid recomputing time.Since across logs/metric.

Verification

Local validation of compactor with local minIO bucket

Signed-off-by: Anugrah Vijay <anugrah.vijay@gmail.com>

vanugrah · 2021-08-11T01:26:15Z

Based on @yeya24 's feedback on the previous PR I will be:

Adding the downsample resolution as label to the histogram.
Increasing the starting bucket size.

Signed-off-by: Anugrah Vijay <anugrah.vijay@gmail.com>

vanugrah · 2021-08-11T06:16:08Z

With respect to the histogram buckets, we could get rid of the initial few buckets and make it 64, 128, 256, 512, 1024, 2048, 4096, 8192 or we could define more user friendly intervals such as 1m, 5m, 15m, 30m, 1h, 2h, 4h. WDYT?

cmd/thanos/downsample.go

Signed-off-by: Anugrah Vijay <anugrah.vijay@gmail.com>

yeya24 · 2021-08-11T22:44:45Z

Anything you still want to update? @vanugrah If not please mark this pr ready for review. 😄

vanugrah · 2021-08-11T23:40:26Z

Bucket ranges? Currently it mimics the compactors geometric series: 64, 128, 256, 512, 1024, 2048, 4096, 8192.
But changing that to recognizable intervals may be more user friendly: 1m, 5m, 15m, 30m, 60m, 120m, 240m.

yeya24 · 2021-08-11T23:42:54Z

Bucket ranges? Currently it mimics the compactors geometric series: 64, 128, 256, 512, 1024, 2048, 4096, 8192.
But changing that to recognizable intervals may be more user friendly: 1m, 5m, 15m, 30m, 60m, 120m, 240m.

Latter sounds better.

Signed-off-by: Anugrah Vijay <anugrah.vijay@gmail.com>

yeya24

LGTM. Thanks for the contribution.

vanugrah · 2021-08-12T00:24:29Z

😄 Thanks for the speedy feedback!

vanugrah · 2021-08-12T01:16:52Z

So for flakey e2e tests do we just rerun?

yeya24 · 2021-08-12T02:41:23Z

So for flakey e2e tests do we just rerun?

I restarted it and it is passing now

vanugrah · 2021-08-12T02:59:57Z

Wooot 🎉 Thanks for your help @yeya24 . More PR's soon to come.

GiedriusS

With --wait doesn't this mean that we will have infinitely different groups? 🤔 Perhaps a histogram over all durations would be more suitable i.e. drop the group label?

vanugrah · 2021-08-12T07:46:41Z

Hello! 👋

Based on the DefaultGroupKey it looks like the two parts of the label are the resolution and a hash of the meta labels. So if I'm not mistaken the cardinality would increase by num_buckets x 3 (1 for each resolution) for each unique set of meta labels, i.e. each unique source of blocks. So theoretically yes there is no upper bound on groups given that it correlates with things uploading blocks. But in practice, I haven't heard of more than a few thousand block sources for a single bucket, even on the larger side. Though, this is definitely just a heuristic.

Another point to consider is that the other downsample metrics already use the group key as a label: https://github.com/thanos-io/thanos/blob/main/cmd/thanos/downsample.go#L46-L53

And we already initialize the metric for all groups:
https://github.com/thanos-io/thanos/blob/main/cmd/thanos/downsample.go#L117-L119

So we're not necessarily introducing anything new to risk unbounded cardinality and we'd be able to align labels with the existing metrics.

Still wrapping my head around the compactor so let me know if I've misunderstood anything 😄

vanugrah · 2021-08-13T18:10:55Z

@GiedriusS Any thoughts?

vanugrah · 2021-08-13T18:13:14Z

@yeya24 Your review was marked stale because I had to resolve a merge conflict on the Changelog. Care to re-review?

yeya24

LGTM. Let's merge it.

Adds downsampleDuration histogram

2ed01b6

Signed-off-by: Anugrah Vijay <anugrah.vijay@gmail.com>

vanugrah mentioned this pull request Aug 11, 2021

compactor: add downsample duration histogram #4551

Closed

2 tasks

Adds resolution label to downsampleDuration histogram

07c8b5c

Signed-off-by: Anugrah Vijay <anugrah.vijay@gmail.com>

yeya24 requested changes Aug 11, 2021

View reviewed changes

cmd/thanos/downsample.go Outdated Show resolved Hide resolved

Changes downsampleDuration histogram label from resolution to group

57d5974

Signed-off-by: Anugrah Vijay <anugrah.vijay@gmail.com>

Changes downsampleDuration histogram bucket intervals

3b03fbd

Signed-off-by: Anugrah Vijay <anugrah.vijay@gmail.com>

vanugrah marked this pull request as ready for review August 11, 2021 23:58

vanugrah added 2 commits August 11, 2021 17:09

Updates changelog

68ac99d

Signed-off-by: Anugrah Vijay <anugrah.vijay@gmail.com>

Merge branch 'main' into metrics/compactor_downsample_duration

95a4e38

yeya24 previously approved these changes Aug 12, 2021

View reviewed changes

GiedriusS reviewed Aug 12, 2021

View reviewed changes

Merge branch 'main' into metrics/compactor_downsample_duration

5cf5bd6

vanugrah dismissed yeya24’s stale review via 5cf5bd6 August 13, 2021 18:10

yeya24 approved these changes Aug 23, 2021

View reviewed changes

yeya24 enabled auto-merge (squash) August 23, 2021 21:15

yeya24 disabled auto-merge August 23, 2021 21:15

yeya24 merged commit ce1c4fe into thanos-io:main Aug 23, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

compactor: adds downsample duration histogram #4552

compactor: adds downsample duration histogram #4552

vanugrah commented Aug 11, 2021 •

edited

vanugrah commented Aug 11, 2021 •

edited

vanugrah commented Aug 11, 2021

yeya24 commented Aug 11, 2021

vanugrah commented Aug 11, 2021

yeya24 commented Aug 11, 2021

yeya24 left a comment

vanugrah commented Aug 12, 2021

vanugrah commented Aug 12, 2021

yeya24 commented Aug 12, 2021

vanugrah commented Aug 12, 2021

GiedriusS left a comment

vanugrah commented Aug 12, 2021 •

edited

vanugrah commented Aug 13, 2021

vanugrah commented Aug 13, 2021

yeya24 left a comment

compactor: adds downsample duration histogram #4552

compactor: adds downsample duration histogram #4552

Conversation

vanugrah commented Aug 11, 2021 • edited

Changes

Verification

vanugrah commented Aug 11, 2021 • edited

vanugrah commented Aug 11, 2021

yeya24 commented Aug 11, 2021

vanugrah commented Aug 11, 2021

yeya24 commented Aug 11, 2021

yeya24 left a comment

Choose a reason for hiding this comment

vanugrah commented Aug 12, 2021

vanugrah commented Aug 12, 2021

yeya24 commented Aug 12, 2021

vanugrah commented Aug 12, 2021

GiedriusS left a comment

Choose a reason for hiding this comment

vanugrah commented Aug 12, 2021 • edited

vanugrah commented Aug 13, 2021

vanugrah commented Aug 13, 2021

yeya24 left a comment

Choose a reason for hiding this comment

vanugrah commented Aug 11, 2021 •

edited

vanugrah commented Aug 11, 2021 •

edited

vanugrah commented Aug 12, 2021 •

edited