"Partition boost" the group by queries in MSQ for better splits by LakshSingla · Pull Request #15474 · apache/druid

LakshSingla · 2023-12-03T18:22:54Z

Description

In MSQ, each partition corresponds to a single segment (if it is the end stage) or the unit of work that is assigned to a worker. If the clustering key has low cardinality, then the returned partitions can be too large. For certain stages, we can break up the resulting partitions by introducing a synthetic clustering column "__boost" (which should contain a unique value - incrementally increasing longs for MSQ's implementation) at the end of the clustering keys, so that the keys become unique, and the partition boundaries are respectable. This is done in ScanQueryKit, and can be done in the GroupByPostShuffleFrameProcessor.

This PR introduces this partition boosting to the group by queries as well.

For example, the queries of the form

INSERT INTO table
SELECT dim1, dim2, agg(metric1) as aggregated FROM EXTERN(...)
GROUP BY dim1, dim2
PARTITIONED BY YEAR
CLUSTERED BY aggregated

can run into large segments if the "aggregated" is a very low cardinality column, and the (dim1, dim2) is a high cardinality pair. The solution (before the patch) would be to add more dims into the CLUSTERED BY clause to make it more unique. Post this patch, that won't be required.
ControllerImpl already handles partition-boosted columns specially.

NOTE: Group by queries can fail during the cluster upgrade, if some of the workers are on an older version, while the others are on a newer version.

Release note

Key changed/added classes in this PR

MyFoo
OurBar
TheirBaz

This PR has:

cryptoe · 2024-01-02T09:59:37Z

...ry/src/main/java/org/apache/druid/msq/querykit/groupby/GroupByPostShuffleFrameProcessor.java

      final GroupByQuery query
  )
  {
+    List<VirtualColumn> virtualColumns = new ArrayList<>();


We need a UT for this.

I removed the conditionals all together, therefore the code is now straightforward.

cryptoe

Lets add some test cases. We do not want the boosted column as part of the segment.

cryptoe · 2024-01-02T10:14:11Z

...ry/src/main/java/org/apache/druid/msq/querykit/groupby/GroupByPostShuffleFrameProcessor.java


    if (frameWriter.addSelection()) {
+      if (partitionBoostVirtualColumn != null) {
+        partitionBoostVirtualColumn.setValue(partitionBoostVirtualColumn.getValue() + 1);


Will this cause the compare function on L161 to behave weirdly ?

The compare function won't be behaving weirdly because:

The boosted column is only a part of the final written frame. The outpuRow doesn't contain the boosted column

Also, the compare function is computed based on the group by query, which will compare the dims and the __time column (if needed). Since __boost doesn't come there, it will choose to ignore the column altogether.

This can be confirmed in the COUNT(*) tests testInsertOnExternalDataSource which aggregates the rows with the correct result.

...ry/src/main/java/org/apache/druid/msq/querykit/groupby/GroupByPostShuffleFrameProcessor.java

...e/multi-stage-query/src/main/java/org/apache/druid/msq/querykit/groupby/GroupByQueryKit.java

cryptoe · 2024-01-02T10:26:40Z

...e/multi-stage-query/src/main/java/org/apache/druid/msq/querykit/groupby/GroupByQueryKit.java

+    resultClusterByWithPartitionBoostColumns.add(new KeyColumn(QueryKitUtils.PARTITION_BOOST_COLUMN, KeyOrder.ASCENDING));
+    ClusterBy resultClusterByWithPartitionBoost = new ClusterBy(
+        resultClusterByWithPartitionBoostColumns,
+        resultClusterByWithoutPartitionBoost.getBucketByCount()


I think line 153 : 167 can be another method called createRowSignature.
Also the if on line 159 should be the first switch in the control flow. Something like

SigX=null; if(boosted) { sigX=foo }else{ sigX=bar }

Refactored to different function, and cleaned up the main code flow.

LakshSingla · 2024-01-09T05:34:32Z

We do not want the boosted column as part of the segment.

This is the job of the controller to remove the boosted column. Also, we have pre-existing MSQInsertTests, which assert on the expected row and the row signature of the segment. The boosted column doesn't show up there.

LakshSingla · 2024-01-09T05:36:14Z

I have also removed the conditional in the GroupByFrameProcessor and its factory because depending on the signature passed by the query kit, it will choose to add or ignore the partition boosted column. If the signature has __boost column in it, it will pick the column from the selector factory. Else, it will ignore the virtual column altogether. There's no need for additional conditional in the processor or the factory.

LakshSingla · 2024-01-09T11:12:14Z

Tested that there's a __boost column at the end of the post-processor factory. Also, I went into the debug mode into the GroupByPostShuffleFrameProcessor and the SegmentGeneratorFrameProcessor and the boost column is present in both places with an incrementing value.

cryptoe

Nit comments.
LGTM.

...e/multi-stage-query/src/main/java/org/apache/druid/msq/querykit/groupby/GroupByQueryKit.java

LakshSingla · 2024-01-11T07:16:14Z

Thanks for the review @cryptoe

init commit

2cfba74

github-actions bot added Area - Batch Ingestion Area - MSQ For multi stage queries - https://github.com/apache/druid/issues/12262 labels Dec 3, 2023

cryptoe reviewed Jan 2, 2024

View reviewed changes

cleanup

8992555

cryptoe approved these changes Jan 9, 2024

View reviewed changes

...e/multi-stage-query/src/main/java/org/apache/druid/msq/querykit/groupby/GroupByQueryKit.java Outdated Show resolved Hide resolved

...e/multi-stage-query/src/main/java/org/apache/druid/msq/querykit/groupby/GroupByQueryKit.java Outdated Show resolved Hide resolved

review

360bde0

LakshSingla merged commit 87fbe42 into apache:master Jan 11, 2024

LakshSingla deleted the groupby-boost branch January 11, 2024 07:16

LakshSingla added this to the 29.0.0 milestone Jan 29, 2024

Conversation

LakshSingla commented Dec 3, 2023

Description

Release note

Key changed/added classes in this PR

Uh oh!

cryptoe Jan 2, 2024

Choose a reason for hiding this comment

Uh oh!

LakshSingla Jan 9, 2024

Choose a reason for hiding this comment

Uh oh!

cryptoe left a comment

Choose a reason for hiding this comment

Uh oh!

cryptoe Jan 2, 2024

Choose a reason for hiding this comment

Uh oh!

LakshSingla Jan 9, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cryptoe Jan 2, 2024

Choose a reason for hiding this comment

Uh oh!

LakshSingla Jan 9, 2024

Choose a reason for hiding this comment

Uh oh!

LakshSingla commented Jan 9, 2024

Uh oh!

LakshSingla commented Jan 9, 2024

Uh oh!

LakshSingla commented Jan 9, 2024

Uh oh!

cryptoe left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

LakshSingla commented Jan 11, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants