Support window in more time-series aggregations #138456

dnhatn · 2025-11-23T05:35:18Z

This change adds support for window functions for additional time-series aggregations, including min_over_time, max_over_time, first_over_time, count_over_time, and sum_over_time. These changes are straightforward. The main update in this PR is how the window is expanded before sliding over the partial results.

For example, given these data points:

|_tsid| cluster| host | timestamp            | metric |
| t1  | prod   | h1   | 2025-04-15T01:12:00Z | 100    |
| t2  | prod   | h2   | 2025-04-15T01:14:00Z | 200    |

With bucket=5s and no window:

TS ...
| WHERE TRANGE('2025-04-15T01:10:00Z', '2025-04-15T01:15:00Z')
| STATS sum(sum_over_time(metric)) BY host, TBUCKET(5s)

Yields:

cluster | bucket                 | SUM  | 
prod    | 2025-04-15T01:10:00Z   | 300  |

With a window=5s:

TS ...
| WHERE TRANGE('2025-04-15T01:10:00Z', '2025-04-15T01:15:00Z')
| STATS sum(sum_over_time(metric, 5s)) BY host, TBUCKET(1s)

Yields:

cluster | bucket                 | SUM  | 
prod    | 2025-04-15T01:12:00Z   | 100  |
prod    | 2025-04-15T01:14:00Z   | 200  |

Ideally, all buckets from 2025-04-15T01:10:00Z to 2025-04-15T01:14:00Z should be generated:

cluster | bucket                 | SUM  | 
prod    | 2025-04-15T01:10:00Z   | 300  |
prod    | 2025-04-15T01:11:00Z   | 300  |
prod    | 2025-04-15T01:12:00Z   | 300  |
prod    | 2025-04-15T01:13:00Z   | 300  |
prod    | 2025-04-15T01:14:00Z   | 300  |

With this change, buckets are expanded as if sliding over the raw input before combining for the final results.

github-actions · 2025-11-23T05:36:52Z

ℹ️ Important: Docs version tagging

👋 Thanks for updating the docs! Just a friendly reminder that our docs are now cumulative. This means all 9.x versions are documented on the same page and published off of the main branch, instead of creating separate pages for each minor version.

We use applies_to tags to mark version-specific features and changes.

Expand for a quick overview

When to use applies_to tags:

✅ At the page level to indicate which products/deployments the content applies to (mandatory)
✅ When features change state (e.g. preview, ga) in a specific version
✅ When availability differs across deployments and environments

What NOT to do:

❌ Don't remove or replace information that applies to an older version
❌ Don't add new information that applies to a specific version without an applies_to tag
❌ Don't forget that applies_to tags can be used at the page, section, and inline level

🤔 Need help?

Check out the cumulative docs guidelines
Reach out in the #docs Slack channel

elasticsearchmachine · 2025-11-24T06:03:49Z

Pinging @elastic/es-storage-engine (Team:StorageEngine)

dnhatn · 2025-11-24T06:03:55Z

.../compute/src/main/java/org/elasticsearch/compute/operator/TimeSeriesAggregationOperator.java

+     * 02:00 -> [12:00, 13:59) // bucket=2m
+     * 00:04 -> [04:00, 05:59), [06:00, 07:59), [08:00, 09:59), [10:00, 11:59), [12:00, 13:59) // window=10m
+     */
+    private void expandWindowBuckets() {


@kkrik-es This is the main change

kkrik-es · 2025-11-24T06:11:46Z

.../compute/src/main/java/org/elasticsearch/compute/operator/TimeSeriesAggregationOperator.java

+     * For example, if the window is 10 minutes and the bucket size is 2 minutes:
+     * If we see a bucket at 12:00, we need to ensure buckets exist for the previous 10 minutes.
+     * 02:00 -> [12:00, 13:59) // bucket=2m
+     * 00:04 -> [04:00, 05:59), [06:00, 07:59), [08:00, 09:59), [10:00, 11:59), [12:00, 13:59) // window=10m


I'm somewhat confused by these examples - while the text above is clear. Shouldn't we have an example where, at 12:00, we need to include data from 11:50 and on?

I will update the comment; the PR description may provide more clarity.

Right that example was nice.

.../compute/src/main/java/org/elasticsearch/compute/operator/TimeSeriesAggregationOperator.java

kkrik-es · 2025-11-24T06:15:38Z

.../compute/src/main/java/org/elasticsearch/compute/operator/TimeSeriesAggregationOperator.java

+            long endTimestamp = tsBlockHash.getLongKeyFromGroup(groupId);
+            long bucket = timeBucket.nextRoundingValue(endTimestamp - timeResolution.convert(largestWindowMillis()));
+            bucket = Math.max(bucket, tsBlockHash.getMinLongKey());
+            while (bucket < endTimestamp) {


Nit: consider adding a comment here, we're filling in missing buckets due to the window expanding outside each bucket.

kkrik-es

Nice, makes sense.

dnhatn · 2025-11-24T20:37:22Z

Thanks Kostas!

This change adds support for window functions for additional time-series aggregations, including `min_over_time`, `max_over_time`, `first_over_time`, `count_over_time`, and `sum_over_time`. These changes are straightforward. The main update in this PR is how the window is expanded before sliding over the partial results. For example, given these data points: ``` |_tsid| cluster| host | timestamp | metric | | t1 | prod | h1 | 2025-04-15T01:12:00Z | 100 | | t2 | prod | h2 | 2025-04-15T01:14:00Z | 200 | ``` With `bucket=5s` and no window: ``` TS ... | WHERE TRANGE('2025-04-15T01:10:00Z', '2025-04-15T01:15:00Z') | STATS sum(sum_over_time(metric)) BY host, TBUCKET(5s) ``` Yields: ``` cluster | bucket | SUM | prod | 2025-04-15T01:10:00Z | 300 | ``` With a window=5s: ``` TS ... | WHERE TRANGE('2025-04-15T01:10:00Z', '2025-04-15T01:15:00Z') | STATS sum(sum_over_time(metric, 5s)) BY host, TBUCKET(1s) ``` Yields: ``` cluster | bucket | SUM | prod | 2025-04-15T01:12:00Z | 100 | prod | 2025-04-15T01:14:00Z | 200 | ``` Ideally, all buckets from `2025-04-15T01:10:00Z` to `2025-04-15T01:14:00Z` should be generated: ``` cluster | bucket | SUM | prod | 2025-04-15T01:10:00Z | 300 | prod | 2025-04-15T01:11:00Z | 300 | prod | 2025-04-15T01:12:00Z | 300 | prod | 2025-04-15T01:13:00Z | 200 | prod | 2025-04-15T01:14:00Z | 200 | ``` With this change, buckets are expanded as if sliding over the raw input before combining for the final results.

dnhatn added >non-issue :StorageEngine/ES|QL Timeseries / metrics / logsdb capabilities in ES|QL v9.3.0 labels Nov 23, 2025

dnhatn force-pushed the more-window-function-1 branch from b6f0988 to b07352b Compare November 23, 2025 06:02

Support window in more time-series aggregations

1c43338

dnhatn force-pushed the more-window-function-1 branch from b07352b to 1c43338 Compare November 23, 2025 06:09

dnhatn requested a review from kkrik-es November 24, 2025 06:03

dnhatn marked this pull request as ready for review November 24, 2025 06:03

elasticsearchmachine added the Team:StorageEngine label Nov 24, 2025

dnhatn commented Nov 24, 2025

View reviewed changes

kkrik-es reviewed Nov 24, 2025

View reviewed changes

.../compute/src/main/java/org/elasticsearch/compute/operator/TimeSeriesAggregationOperator.java Show resolved Hide resolved

kkrik-es reviewed Nov 24, 2025

View reviewed changes

kkrik-es approved these changes Nov 24, 2025

View reviewed changes

dnhatn added 2 commits November 24, 2025 10:10

examples

e037ade

Merge remote-tracking branch 'elastic/main' into more-window-function-1

715b97e

dnhatn merged commit ff33b6c into elastic:main Nov 24, 2025
34 checks passed

dnhatn deleted the more-window-function-1 branch November 24, 2025 20:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Support window in more time-series aggregations #138456

Support window in more time-series aggregations #138456

Uh oh!

dnhatn commented Nov 23, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Nov 23, 2025

When to use applies_to tags:

What NOT to do:

Uh oh!

elasticsearchmachine commented Nov 24, 2025

Uh oh!

dnhatn Nov 24, 2025

Uh oh!

kkrik-es Nov 24, 2025

Uh oh!

dnhatn Nov 24, 2025

Uh oh!

kkrik-es Nov 24, 2025

Uh oh!

Uh oh!

kkrik-es Nov 24, 2025

Uh oh!

kkrik-es left a comment

Uh oh!

dnhatn commented Nov 24, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Support window in more time-series aggregations #138456

Support window in more time-series aggregations #138456

Uh oh!

Conversation

dnhatn commented Nov 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Nov 23, 2025

ℹ️ Important: Docs version tagging

When to use applies_to tags:

What NOT to do:

🤔 Need help?

Uh oh!

elasticsearchmachine commented Nov 24, 2025

Uh oh!

dnhatn Nov 24, 2025

Choose a reason for hiding this comment

Uh oh!

kkrik-es Nov 24, 2025

Choose a reason for hiding this comment

Uh oh!

dnhatn Nov 24, 2025

Choose a reason for hiding this comment

Uh oh!

kkrik-es Nov 24, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

kkrik-es Nov 24, 2025

Choose a reason for hiding this comment

Uh oh!

kkrik-es left a comment

Choose a reason for hiding this comment

Uh oh!

dnhatn commented Nov 24, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

dnhatn commented Nov 23, 2025 •

edited

Loading