feat: added parallelization on key partitioned data #18919

gene-bordegaray · 2025-11-24T21:44:47Z

Full Report

Issue 18777 Parallelize Key Partitioned Data.pdf

Which issue does this PR close?

Closes Enable Parallel Aggregation for Non-Overlapping Partitioned Data #18777.

Rationale for this change

Optimize aggregations on Hive-partitioned tables by eliminating unnecessary repartitioning/coalescing when grouping by partition columns. This enables parallel computation of complete results without a merge bottleneck.

What changes are included in this PR?

Introduce new partitioning type KeyPartitioned
Save and propagate file partition metadata through query plan
Change aggregation mode selection in physical planner
Update enforce distribution rules to eliminate unnecessary repartitioning

Are these changes tested?

Unit and integration tests added for all new logic

Benchmarking

For tpch it was unaffected as expected (not partitioned):

I create my own benchmark and saw these results:

Benchmarking hive_partitioned_agg/with_key_partitioned: Collecting 100 samples in estimated 6
hive_partitioned_agg/with_key_partitioned
                        time:   [12.356 ms 12.428 ms 12.505 ms]
                        change: [−1.6022% −0.8538% −0.0780%] (p = 0.03 < 0.05)
                        Change within noise threshold.
Found 2 outliers among 100 measurements (2.00%)
  1 (1.00%) high mild
  1 (1.00%) high severe
Benchmarking hive_partitioned_agg/without_key_partitioned: Collecting 100 samples in estimate
hive_partitioned_agg/without_key_partitioned
                        time:   [13.179 ms 13.278 ms 13.382 ms]
                        change: [−0.8465% +0.2090% +1.2419%] (p = 0.70 > 0.05)
                        No change in performance detected.
Found 4 outliers among 100 measurements (4.00%)

These are not huge improvements as in memory hashing is pretty efficient but these are consistent gain (ran many times).

Are there any user-facing changes?

Yes, new configuration option: listing_table_preserve_partition_values
Changes query plans when activated

…= partition groups

gene-bordegaray · 2025-11-26T05:54:37Z

datafusion/datasource/src/file_scan_config.rs

+    pub preserve_partition_values: bool,
+    /// Cached result of key_partition_exprs computation to avoid repeated work
+    #[allow(clippy::type_complexity)]
+    key_partition_exprs_cache: OnceLock<Option<Vec<Arc<dyn PhysicalExpr>>>>,


Caches results of compute_key_partition_exprs() which is expensive:

loops through file groups and does hash set operations

called multiple times (output_partitioning() and eq_properties())

gene-bordegaray · 2025-11-26T05:58:13Z

datafusion/physical-optimizer/src/enforce_distribution.rs

                }
+                Distribution::KeyPartitioned(_) => {
+                    // Nothing to do: treated as satisfied upstream
+                }


No-op because we can guarantee that our data is correctly distributed

gene-bordegaray · 2025-11-26T06:04:34Z

datafusion/sqllogictest/test_files/agg_func_substitute.slt

-02)--AggregateExec: mode=FinalPartitioned, gby=[a@0 as a], aggr=[nth_value(multiple_ordered_table.c,Int64(1)) ORDER BY [multiple_ordered_table.c ASC NULLS LAST]], ordering_mode=Sorted
-03)----SortExec: expr=[a@0 ASC NULLS LAST], preserve_partitioning=[true]
-04)------CoalesceBatchesExec: target_batch_size=8192
-05)--------RepartitionExec: partitioning=Hash([a@0], 4), input_partitions=4


Eliminates this hash because it would break ordering guarantees

gene-bordegaray · 2025-11-26T06:10:34Z

cc: @NGA-TRAN @alamb this is updated solution with report on why I chose what I did

added parallelization on key partitioned data (opt in only)

5af5522

gene-bordegaray mentioned this pull request Nov 24, 2025

Enable Parallel Aggregation for Non-Overlapping Partitioned Data #18826

Closed

ran clippy, fixed typo, and added unit tests

00bb260

github-actions bot added the documentation Improvements or additions to documentation label Nov 25, 2025

gene-bordegaray added 2 commits November 24, 2025 18:12

ran clippy, fixed typo, and added unit tests

c97404e

simplified logic, enforced repartition rules when target partitions !…

8398852

…= partition groups

gene-bordegaray commented Nov 26, 2025

View reviewed changes

gene-bordegaray marked this pull request as ready for review November 26, 2025 06:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: added parallelization on key partitioned data #18919

feat: added parallelization on key partitioned data #18919

gene-bordegaray commented Nov 24, 2025 •

edited

Loading

Uh oh!

gene-bordegaray Nov 26, 2025 •

edited

Loading

Uh oh!

gene-bordegaray Nov 26, 2025

Uh oh!

gene-bordegaray Nov 26, 2025 •

edited

Loading

Uh oh!

gene-bordegaray commented Nov 26, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

feat: added parallelization on key partitioned data #18919

Are you sure you want to change the base?

feat: added parallelization on key partitioned data #18919

Conversation

gene-bordegaray commented Nov 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Full Report

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

Are these changes tested?

Benchmarking

Are there any user-facing changes?

Uh oh!

gene-bordegaray Nov 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

gene-bordegaray Nov 26, 2025

Choose a reason for hiding this comment

Uh oh!

gene-bordegaray Nov 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

gene-bordegaray commented Nov 26, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

gene-bordegaray commented Nov 24, 2025 •

edited

Loading

gene-bordegaray Nov 26, 2025 •

edited

Loading

gene-bordegaray Nov 26, 2025 •

edited

Loading