Add factories for time series aggregation #107803

dnhatn · 2024-04-24T01:03:54Z

This change introduces operator factories for time-series aggregations. A time-series aggregation executes in three stages, deviating from the typical two-stage aggregation.

For example: sum(rate(write_requests)), avg(cpu) BY cluster, time-bucket

1. Initial Stage:
In this stage, a standard hash aggregation is executed, grouped by tsid and time-bucket. The values aggregations are added to collect values of the grouping keys excluding the time-bucket, which are then used for final result grouping.

rate[INITIAL](write_requests), avg[INITIAL](cpu), values[SINGLE](cluster) BY tsid, time-bucket

2. Intermediate Stage:
Equivalent to the final mode of a standard hash aggregation. This stage merges and reduces the result of the rate aggregations, but merges without reducing the results of non-rate aggregations. Certain aggregations, such as count_distinct, cannot have their final results combined.

rate[FINAL](write_requests), avg[INTERMEDIATE](cpu), values[SINGLE](cluster) BY tsid, time-bucket

3. Final Stage:
This extra stage performs outer aggregations over the rate results and combines the intermediate results of non-rate aggregations using the specified user-defined grouping keys.

sum[SINGLE](rate_result), avg[FINAL](cpu) BY cluster, bucket

...ute/src/test/java/org/elasticsearch/compute/operator/TimeSeriesAggregationOperatorTests.java

dnhatn · 2024-04-24T04:09:09Z

Before this change, I took a different approach, creating a MetricsAggregationOperator that extends the HashAggregationOperator and overrides the output of the final mode. However, I believe this current approach is cleaner, despite the alternative potentially being more efficient.

kkrik-es · 2024-04-24T06:49:35Z

...src/main/java/org/elasticsearch/compute/operator/TimeSeriesAggregationOperatorFactories.java

+ * Equivalent to the final mode of a standard hash aggregation.
+ * This stage merges and reduces the result of the rate aggregations,
+ * but merges (without reducing) the results of non-rate aggregations.
+ * Certain aggregations, such as count_distinct, cannot have their final results combined.


Why is this the case? Don't we assume that the final grouping fields are included in the tsid? Or, we don't have the mapping between tsid values and the final groups?

In this particular case, avg(cpu) is tracked as sum and count, which can be reduced by [tsid, time-bucket] for each [tsid, cluster] combo?

Is this because the distinct counts per tsid and time bucket aren't a subset of the dimension groups?

Ah, I think my wording is confusing. Just to clarify, this isn't a limitation; it's the reason why we still need to keep the intermediate states of non-rate aggregation after the second stage. I will reword this.

martijnvg

Thanks Nhat, the direction here looks good to me.

...src/main/java/org/elasticsearch/compute/operator/TimeSeriesAggregationOperatorFactories.java

nik9000 · 2024-04-25T13:25:48Z

...src/main/java/org/elasticsearch/compute/operator/TimeSeriesAggregationOperatorFactories.java

+            List<GroupingAggregator.Factory> aggregators = new ArrayList<>(outerRates.size() + nonRates.size());
+            for (AggregatorFunctionSupplier f : outerRates) {
+                aggregators.add(f.groupingAggregatorFactory(AggregatorMode.SINGLE));
+            }


It feels a little weird that you'd run these "outer" aggs on rates here and not as part of "something else" - like, they are just a regular stats at that point. But you have to "split" the stream, right? The "outer" aggs get the incoming pages and the non-rate aggs get the incoming pages too. Those can't be separate Operators because they both want to consume the page.

I guess we could make an Operator that works makes a shallowCopy of the page and passes it into a pipeline breaker and then passes the result onwards. That'd split the stream. But the way I'd model the planning for this is pretty similar to what you've written here anyway. So maybe it doesn't make a difference.

Am I understanding the problem?

After the second phase, the output page contains [tsid, time-buckets, final-rate, intermediate result of non-rates, grouping keys]. This output page can be split into two separate pages: the first containing [grouping keys, time-bucket, final-rate], and the second containing [grouping keys, time-bucket, intermediate result of non-rates]. Two independent hash aggregations can then be run over these pages, although they will have to hash the grouping keys twice. This should not significantly impact performance. My primary concern is the order of the grouping keys emitted by BlockHashes. The two hash operators must generate the ordered grouping keys. Currently, our BlockHash implementations guarantee this, although we never document this guarantee. I will apply your suggestion to implement a SplitOperator, which will execute a series of operators sequentially (for now) and then merge the output pages into a single page. Thanks Nik!

I'm not sure that SplitOperator is the right thing. Maybe it is! But I was mostly thinking of it as building block for my understanding. But you think it'll help, go for it!

dnhatn · 2024-05-08T23:49:52Z

test this please

elasticsearchmachine · 2024-05-09T01:45:49Z

Pinging @elastic/es-storage-engine (Team:StorageEngine)

dnhatn · 2024-05-09T04:33:52Z

@kkrik-es @martijnvg @nik9000 Thank you for your review and feedback. I am going to merge this PR as is. I think we will need some adjustments when integrating with the plans.

This change introduces operator factories for time-series aggregations. A time-series aggregation executes in three stages, deviating from the typical two-stage aggregation. For example: `sum(rate(write_requests)), avg(cpu) BY cluster, time-bucket` **1. Initial Stage:** In this stage, a standard hash aggregation is executed, grouped by tsid and time-bucket. The `values` aggregations are added to collect values of the grouping keys excluding the time-bucket, which are then used for final result grouping. ``` rate[INITIAL](write_requests), avg[INITIAL](cpu), values[SINGLE](cluster) BY tsid, time-bucket ``` **2. Intermediate Stage:** Equivalent to the final mode of a standard hash aggregation. This stage merges and reduces the result of the rate aggregations, but merges without reducing the results of non-rate aggregations. Certain aggregations, such as count_distinct, cannot have their final results combined. ``` rate[FINAL](write_requests), avg[INTERMEDIATE](cpu), values[SINGLE](cluster) BY tsid, time-bucket ``` **3. Final Stage:** This extra stage performs outer aggregations over the rate results and combines the intermediate results of non-rate aggregations using the specified user-defined grouping keys. ``` sum[SINGLE](rate_result), avg[FINAL](cpu) BY cluster, bucket ```

elasticsearchmachine added the v8.15.0 label Apr 24, 2024

dnhatn force-pushed the metrics-operator branch 3 times, most recently from b94607c to ca154b7 Compare April 24, 2024 03:53

dnhatn commented Apr 24, 2024

View reviewed changes

...ute/src/test/java/org/elasticsearch/compute/operator/TimeSeriesAggregationOperatorTests.java Outdated Show resolved Hide resolved

dnhatn requested review from nik9000, martijnvg and kkrik-es April 24, 2024 03:55

Add factories for time series aggregation

0fd094c

dnhatn force-pushed the metrics-operator branch from ca154b7 to 0fd094c Compare April 24, 2024 03:58

kkrik-es reviewed Apr 24, 2024

View reviewed changes

martijnvg approved these changes Apr 24, 2024

View reviewed changes

nik9000 reviewed Apr 26, 2024

View reviewed changes

dnhatn added 3 commits May 8, 2024 15:27

Merge branch 'main' into metrics-operator

71b54df

Wording

6295d58

javadoc

5da34bb

dnhatn added :StorageEngine/TSDB You know, for Metrics >non-issue labels May 9, 2024

dnhatn marked this pull request as ready for review May 9, 2024 01:45

elasticsearchmachine added the Team:StorageEngine label May 9, 2024

dnhatn merged commit 155e7c5 into elastic:main May 9, 2024
15 checks passed

dnhatn deleted the metrics-operator branch May 9, 2024 04:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add factories for time series aggregation #107803

Add factories for time series aggregation #107803

dnhatn commented Apr 24, 2024 •

edited

Loading

dnhatn commented Apr 24, 2024

kkrik-es Apr 24, 2024 •

edited

Loading

martijnvg Apr 24, 2024

dnhatn Apr 24, 2024

martijnvg left a comment

nik9000 Apr 25, 2024

dnhatn Apr 28, 2024

nik9000 Apr 28, 2024

dnhatn commented May 8, 2024

elasticsearchmachine commented May 9, 2024

dnhatn commented May 9, 2024

Add factories for time series aggregation #107803

Add factories for time series aggregation #107803

Conversation

dnhatn commented Apr 24, 2024 • edited Loading

dnhatn commented Apr 24, 2024

kkrik-es Apr 24, 2024 • edited Loading

Choose a reason for hiding this comment

martijnvg Apr 24, 2024

Choose a reason for hiding this comment

dnhatn Apr 24, 2024

Choose a reason for hiding this comment

martijnvg left a comment

Choose a reason for hiding this comment

nik9000 Apr 25, 2024

Choose a reason for hiding this comment

dnhatn Apr 28, 2024

Choose a reason for hiding this comment

nik9000 Apr 28, 2024

Choose a reason for hiding this comment

dnhatn commented May 8, 2024

elasticsearchmachine commented May 9, 2024

dnhatn commented May 9, 2024

dnhatn commented Apr 24, 2024 •

edited

Loading

kkrik-es Apr 24, 2024 •

edited

Loading