Add support for defining custom window frame bounds for window functions #14249

yashmayya · 2024-10-17T10:12:34Z

Fixes [Multi Stage] Add ROWS support for aggregation window functions #11406.
Background reading - https://www.postgresql.org/docs/current/tutorial-window.html, https://www.postgresql.org/docs/current/functions-window.html, https://www.postgresql.org/docs/current/sql-expressions.html#SYNTAX-WINDOW-FUNCTIONS (this one is most relevant to this PR).
Currently, Pinot's window function implementations have limited or even incorrect support for window frame bounds. For instance, FIRST_VALUE / LAST_VALUE assume that the window frame is always ROWS BETWEEN UNBOUNDED PRECEDING AND UNBOUNDED FOLLOWING even though the default window frame as per standard SQL is RANGE BETWEEN UNBOUNDED PRECEDING AND CURRENT ROW. Furthermore, support for defining the lower bound explicitly as UNBOUNDED PRECEDING / CURRENT ROW / n FOLLOWING / n PRECEDING and the upper bound as UNBOUNDED FOLLOWING / CURRENT ROW / n FOLLOWING / n PRECEDING does not exist.
This patch adds support for any custom bounds (offset based or otherwise) for ROWS window frames, and also adds support for UNBOUNDED PRECEDING / CURRENT ROW / UNBOUNDED FOLLOWING bounds for RANGE window frames. There are a ton of edge cases to be handled here but this patch attempts to add test cases to cover most of these scenarios.
Note that Calcite (and hence Pinot) only supports ROWS and RANGE based window frame bounds, whereas Postgres also supports GROUPS.
The planner side changes (mainly literal extraction) are built over [WIP] Support preceding and following in WINDOW #14233.
Apart from the need to add support for offset bounds for RANGE based window frames, another important future enhancement is to optimize the performance of ROWS based window frames for aggregate window functions where both the lower and upper bounds are offset based / current row. Since the changes in this patch are built over the existing framework for window functions where a "merger" is used to merge values for aggregate window functions, it isn't possible to use a sliding window based algorithm to efficiently compute aggregates for windows. This will require more significant changes to the framework but is critical to ensure performant computations especially for larger windows. Optimizations have been added in this patch to ensure that aggregation window functions over window frames with UNBOUNDED PRECEDING lower bound or UNBOUNDED FOLLOWING upper bound are computed efficiently.
Note that all the changes here only affect the aggregate window functions (SUM, COUNT, MIN, MAX etc.) and FIRST_VALUE / LAST_VALUE. The other window functions currently supported by Pinot (LAG, LEAD, RANK, DENSE_RANK, ROW_NUMBER) don't support custom window frame bounds and Calcite ensures that during query planning.
Calcite also does some other validation for window frame bounds like making sure lower bound isn't UNBOUNDED FOLLOWING / upper bound isn't UNBOUNDED PRECEDING, lower bound isn't UNBOUNDED FOLLOWING if upper bound is UNBOUNDED PRECEDING and vice versa etc.

codecov-commenter · 2024-10-17T11:13:48Z

Codecov Report

Attention: Patch coverage is 88.61210% with 32 lines in your changes missing coverage. Please review.

Project coverage is 63.78%. Comparing base (59551e4) to head (5346878).
Report is 1214 commits behind head on master.

Files with missing lines	Patch %	Lines
.../query/planner/logical/PlanNodeToRelConverter.java	0.00%	11 Missing ⚠️
...ator/window/aggregate/AggregateWindowFunction.java	93.00%	1 Missing and 6 partials ⚠️
...e/rel/rules/PinotWindowExchangeNodeInsertRule.java	88.88%	2 Missing and 4 partials ⚠️
.../query/planner/logical/RelToPlanNodeConverter.java	75.00%	1 Missing and 3 partials ⚠️
...not/query/runtime/operator/window/WindowFrame.java	84.61%	1 Missing and 1 partial ⚠️
...uery/runtime/operator/WindowAggregateOperator.java	80.00%	0 Missing and 1 partial ⚠️
...operator/window/value/LastValueWindowFunction.java	97.14%	0 Missing and 1 partial ⚠️

Additional details and impacted files

@@             Coverage Diff              @@
##             master   #14249      +/-   ##
============================================
+ Coverage     61.75%   63.78%   +2.03%     
- Complexity      207     1536    +1329     
============================================
  Files          2436     2627     +191     
  Lines        133233   144844   +11611     
  Branches      20636    22187    +1551     
============================================
+ Hits          82274    92385   +10111     
- Misses        44911    45647     +736     
- Partials       6048     6812     +764

Flag	Coverage Δ
custom-integration1	`100.00% <ø> (+99.99%)`	⬆️
integration	`100.00% <ø> (+99.99%)`	⬆️
integration1	`100.00% <ø> (+99.99%)`	⬆️
integration2	`0.00% <ø> (ø)`
java-11	`63.75% <88.61%> (+2.04%)`	⬆️
java-21	`63.67% <88.61%> (+2.04%)`	⬆️
skip-bytebuffers-false	`63.76% <88.61%> (+2.01%)`	⬆️
skip-bytebuffers-true	`63.64% <88.61%> (+35.92%)`	⬆️
temurin	`63.78% <88.61%> (+2.03%)`	⬆️
unittests	`63.77% <88.61%> (+2.03%)`	⬆️
unittests1	`55.52% <88.61%> (+8.63%)`	⬆️
unittests2	`34.27% <1.42%> (+6.54%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

yashmayya · 2024-10-17T14:38:34Z

...n/java/org/apache/pinot/query/runtime/operator/window/aggregate/AggregateWindowFunction.java

-    if (_partitionByOnly) {
-      return processPartitionOnlyRows(rows);


I'd initially removed this optimization to reduce clutter since there are a lot of different cases being handled in the new function implementation. However, on second thoughts, the optimization to avoid key computation (among other things) for each row might be significant enough to be worth retaining?

Also, the optimization is still applied to windows without ORDER BY, since Calcite forces the window frame to be RANGE BETWEEN UNBOUNDED PRECEDING AND UNBOUNDED FOLLOWING for such windows (and we do avoid the per row key computation for RANGE/ROWS BETWEEN UNBOUNDED PRECEDING AND UNBOUNDED FOLLOWING). So the only other case is where the partition keys and order by keys are identical.

…ongs

…indow.aggregate to org.apache.pinot.query.runtime.operator.window

vrajat · 2024-10-22T16:54:35Z

pinot-query-planner/src/test/java/org/apache/pinot/query/QueryCompilationTest.java

+  @Test
+  public void testWindowFunctionsWithCustomWindowFrame() {
+    String queryWithDefaultWindow = "SELECT col1, col2, RANK() OVER (PARTITION BY col1 ORDER BY col2) FROM a";
+    _queryEnvironment.planQuery(queryWithDefaultWindow);


Is this a complete test for a query ? The expectation is that planning wont throw an exception ?

Also the same test contains queries that will throw a parse exception ?

Is this a complete test for a query

The queries aren't actually executed, just validated, compiled and optimized.

The expectation is that planning wont throw an exception ?

Yes. I can change it to an assertion on QueryEnvironment.canCompileQuery to make that more clear perhaps.

Also the same test contains queries that will throw a parse exception ?

Yes.

vrajat · 2024-10-22T16:57:14Z

...-tests/src/test/java/org/apache/pinot/integration/tests/MultiStageEngineIntegrationTest.java

  }

+  @Test
+  public void testWindowFunction()


This test is added to check if queries with these window functions execute ?

Yeah basically to verify the end to end flow (query planning + execution with runtime operators) works without errors.

yashmayya · 2024-10-23T04:47:43Z

Superseded by #14273.

yashmayya added feature release-notes Referenced by PRs that need attention when compiling the next release notes multi-stage Related to the multi-stage query engine labels Oct 17, 2024

yashmayya force-pushed the window-function-custom-window-frames branch from 1018bcf to 640ad27 Compare October 17, 2024 10:37

yashmayya marked this pull request as ready for review October 17, 2024 11:39

yashmayya force-pushed the window-function-custom-window-frames branch from 640ad27 to a445cb0 Compare October 17, 2024 14:42

yashmayya commented Oct 17, 2024

View reviewed changes

yashmayya mentioned this pull request Oct 18, 2024

[WIP] Support preceding and following in WINDOW #14233

Closed

yashmayya force-pushed the window-function-custom-window-frames branch 4 times, most recently from f1425c0 to 377fecd Compare October 18, 2024 13:05

ankitsultana mentioned this pull request Oct 18, 2024

Fix race condition in IdealStateGroupCommit #14237

Merged

yashmayya added 2 commits October 21, 2024 12:26

Add support for defining custom window frame bounds for window functions

9273cfd

Refactor logic for ROWS type window frame processing to avoid using l…

6e95ca0

…ongs

yashmayya force-pushed the window-function-custom-window-frames branch from 377fecd to 6e95ca0 Compare October 21, 2024 06:56

Add a couple more test cases

363abef

This was referenced Oct 21, 2024

Add IGNORE NULLS option to FIRST_VALUE and LAST_VALUE window functions yashmayya/pinot#1

Closed

Add IGNORE NULLS option to FIRST_VALUE and LAST_VALUE window functions #14264

Merged

yashmayya added 2 commits October 22, 2024 10:50

Minor refactor in AggregateWindowFunction

24836ff

Move WindowFrame class from org.apache.pinot.query.runtime.operator.w…

5346878

…indow.aggregate to org.apache.pinot.query.runtime.operator.window

yashmayya mentioned this pull request Oct 22, 2024

Add support for defining custom window frame bounds for window functions #14273

Merged

vrajat reviewed Oct 22, 2024

View reviewed changes

yashmayya closed this Oct 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add support for defining custom window frame bounds for window functions #14249

Add support for defining custom window frame bounds for window functions #14249

Uh oh!

yashmayya commented Oct 17, 2024

Uh oh!

codecov-commenter commented Oct 17, 2024 •

edited

Loading

Uh oh!

yashmayya Oct 17, 2024

Uh oh!

yashmayya Oct 18, 2024

Uh oh!

vrajat Oct 22, 2024

Uh oh!

yashmayya Oct 23, 2024

Uh oh!

vrajat Oct 22, 2024

Uh oh!

yashmayya Oct 23, 2024

Uh oh!

yashmayya commented Oct 23, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		if (_partitionByOnly) {
		return processPartitionOnlyRows(rows);

Add support for defining custom window frame bounds for window functions #14249

Add support for defining custom window frame bounds for window functions #14249

Uh oh!

Conversation

yashmayya commented Oct 17, 2024

Uh oh!

codecov-commenter commented Oct 17, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

yashmayya Oct 17, 2024

Choose a reason for hiding this comment

Uh oh!

yashmayya Oct 18, 2024

Choose a reason for hiding this comment

Uh oh!

vrajat Oct 22, 2024

Choose a reason for hiding this comment

Uh oh!

yashmayya Oct 23, 2024

Choose a reason for hiding this comment

Uh oh!

vrajat Oct 22, 2024

Choose a reason for hiding this comment

Uh oh!

yashmayya Oct 23, 2024

Choose a reason for hiding this comment

Uh oh!

yashmayya commented Oct 23, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

codecov-commenter commented Oct 17, 2024 •

edited

Loading