ESQL: Implement first/last_over_time for exponential histograms #138639

JonasKunz · 2025-11-26T08:54:04Z

Implements first_over_time and last_over_time for exponential_histograms.
I decided to handroll the state for (long, ExponentialHistogram) pairs and the aggregators for the functions above,
as otherwise I think the templates would get more messy with special cases.

If we eventually encounter too much copied code, we can revisit that decision.

elasticsearchmachine · 2025-11-27T08:57:36Z

Pinging @elastic/es-analytical-engine (Team:Analytics)

dnhatn

I've left some comments, but this looks good. Thanks Jonas!

.../compute/src/main/java/org/elasticsearch/compute/aggregation/ExponentialHistogramStates.java

dnhatn · 2025-11-30T06:54:07Z

.../compute/src/main/java/org/elasticsearch/compute/aggregation/ExponentialHistogramStates.java

+            assert histogramValue != null;
+            ensureCapacity(groupId);
+            Releasables.close(histogramValues.get(groupId));
+            histogramValues.set(groupId, ExponentialHistogram.builder(histogramValue, breaker).build());


Here, we copy every candidate we see. This is fine for last_over_time with tsdb, but for first_over_time we may copy and discard many values. Is it possible to make ExponentialHistogram ref-counted and delay copying until the end? If so, we can improve this in a follow-up.

This is fine for last_over_time with tsdb, but for first_over_time we may copy and discard many values

Just for me understanding, isn't it the other way around? In TSDB, we iterate over the values sorted by time.
So for last, we actually see the desired value last and therefore keep overriding the state all the time?

Is it possible to make ExponentialHistogram ref-counted and delay copying until the end? If so, we can improve this in a follow-up.

The exponential histograms we operate on here directly work on the byte[] owned by the block. So to keep a reference to the histogram, we'd need to keep the reference to the entire block. I assume that we want to avoid this, as it could hog a lot of memory?

If we want to avoid the above, I think we can't get away without copying. But we can at least avoid the allocations and the decoding/encoding of the histogram.

Similar to BreakingBytesRefBuilder, we could add a corresponding histogram builder, which directly copies the encoded histogram bytes and can be reused. WDYT?

Created an issue:
#138809

Thanks, Jonas! That works. It is quite optional since it only affects first_over_time, which I think is not commonly used.

.../compute/src/main/java/org/elasticsearch/compute/aggregation/ExponentialHistogramStates.java

# Conflicts: # x-pack/plugin/esql/qa/testFixtures/src/main/resources/exponential_histogram.csv-spec

elasticsearchmachine added external-contributor Pull request authored by a developer outside the Elasticsearch team v9.3.0 labels Nov 26, 2025

JonasKunz force-pushed the exp-histo-overtime-aggs branch from 84b7227 to 6ed7422 Compare November 26, 2025 08:55

JonasKunz changed the title ~~Exp histo overtime aggs~~ ESQL: Implement first/last_over_time for exponential histograms Nov 26, 2025

JonasKunz mentioned this pull request Nov 26, 2025

ESQL: Support exponential histograms in TS queries #138671

Closed

10 tasks

JonasKunz added 4 commits November 27, 2025 09:39

Add LastOverTime implementation for exponential histograms

6e05581

Added CSV tests

eeecdbf

Update capability

eda1864

Add first_over_time

9c323ab

JonasKunz force-pushed the exp-histo-overtime-aggs branch from 594f6c9 to 9c323ab Compare November 27, 2025 08:40

Remove generated import comments

4d1665b

JonasKunz added :Analytics/ES|QL AKA ESQL >non-issue labels Nov 27, 2025

JonasKunz marked this pull request as ready for review November 27, 2025 08:57

JonasKunz requested a review from dnhatn November 27, 2025 08:57

elasticsearchmachine added the Team:Analytics Meta label for analytical engine team (ESQL/Aggs/Geo) label Nov 27, 2025

dnhatn approved these changes Nov 30, 2025

View reviewed changes

dnhatn reviewed Nov 30, 2025

View reviewed changes

.../compute/src/main/java/org/elasticsearch/compute/aggregation/ExponentialHistogramStates.java Outdated Show resolved Hide resolved

JonasKunz added 3 commits December 1, 2025 09:53

Merge remote-tracking branch 'elastic/main' into exp-histo-overtime-aggs

2eeead3

# Conflicts: # x-pack/plugin/esql/qa/testFixtures/src/main/resources/exponential_histogram.csv-spec

Fix circuit breaker leaks

22cadef

increment capability

9961590

JonasKunz force-pushed the exp-histo-overtime-aggs branch from a36e755 to 9961590 Compare December 1, 2025 09:24

JonasKunz added 2 commits December 1, 2025 11:04

Merge branch 'main' into exp-histo-overtime-aggs

c3b471f

Merge branch 'main' into exp-histo-overtime-aggs

626a2c2

JonasKunz mentioned this pull request Dec 1, 2025

Optimization: Avoid excessive decoding and allocation in exponential histogram last/first_over_time #138809

Open

JonasKunz merged commit f6ffd56 into elastic:main Dec 1, 2025
34 checks passed

JonasKunz deleted the exp-histo-overtime-aggs branch December 1, 2025 12:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ESQL: Implement first/last_over_time for exponential histograms #138639

ESQL: Implement first/last_over_time for exponential histograms #138639

Uh oh!

JonasKunz commented Nov 26, 2025 •

edited

Loading

Uh oh!

elasticsearchmachine commented Nov 27, 2025

Uh oh!

dnhatn left a comment

Uh oh!

Uh oh!

Uh oh!

dnhatn Nov 30, 2025

Uh oh!

JonasKunz Dec 1, 2025 •

edited

Loading

Uh oh!

JonasKunz Dec 1, 2025

Uh oh!

dnhatn Dec 1, 2025

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ESQL: Implement first/last_over_time for exponential histograms #138639

ESQL: Implement first/last_over_time for exponential histograms #138639

Uh oh!

Conversation

JonasKunz commented Nov 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

elasticsearchmachine commented Nov 27, 2025

Uh oh!

dnhatn left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

dnhatn Nov 30, 2025

Choose a reason for hiding this comment

Uh oh!

JonasKunz Dec 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

JonasKunz Dec 1, 2025

Choose a reason for hiding this comment

Uh oh!

dnhatn Dec 1, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

JonasKunz commented Nov 26, 2025 •

edited

Loading

JonasKunz Dec 1, 2025 •

edited

Loading