[PERF] Adaptive Query Execution #2176

samster25 · 2024-04-24T07:09:54Z

Implements AQE at query stage boundaries such as HashJoin, SMJ, Repartition, GroupedAgg, etc
In the case of binary ops, we rank the two paths and choose the one with lower cost
Also implements a better Approximate Statistics to rank the partial plans at forks
Implements AQE for both PyRunner and RayRunner
Fix bug in RayRunner build_partitions where it didn't forward partial metadata
Flag to enable AQE via DaftExecutionConfig or env variable DAFT_ENABLE_AQE=1
Follow on:

Turn on AQE testing in CI
Implement AQE Rules such as dynamic coalesce or DPP

codecov · 2024-04-26T00:17:16Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 85.08%. Comparing base (252721e) to head (0628788).
Report is 2 commits behind head on main.

❗ Current head 0628788 differs from pull request most recent head 0efd84b. Consider uploading reports for the commit 0efd84b to get more accurate results

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #2176      +/-   ##
==========================================
- Coverage   85.64%   85.08%   -0.56%     
==========================================
  Files          71       68       -3     
  Lines        7661     7367     -294     
==========================================
- Hits         6561     6268     -293     
+ Misses       1100     1099       -1

see 22 files with indirect coverage changes

clarkzinzow

LGTM! 🚀 🚀 🚀

clarkzinzow · 2024-05-07T20:47:05Z

daft/runners/partitioning.py

@@ -302,6 +302,9 @@ def num_partitions(self) -> int | None:
    def size_bytes(self) -> int | None:
        return self.value.size_bytes() if self.value is not None else None

+    def num_rows(self) -> int | None:
+        return len(self.value) if self.value is not None else None


We have an assertion in the AdaptivePhysicalPlanScheduler that cache_entry.num_rows() is not None, is there any case in which that won't be true (i.e. will cache_entry.value ever be None at that point)?

I'm not sure, I was just following the convention above

discussed offline!

clarkzinzow · 2024-05-07T20:48:01Z

daft/runners/pyrunner.py

+            adaptive_planner = builder.to_adaptive_physical_plan_scheduler(daft_execution_config)
+            while not adaptive_planner.is_done():
+                source_id, plan_scheduler = adaptive_planner.next()
+                # don't store partition sets in variable to avoid reference


Nice, good call.

daft/runners/pyrunner.py

clarkzinzow · 2024-05-07T20:51:30Z

daft/runners/ray_runner.py

-    metadatas = [PartitionMetadata.from_table(p) for p in partitions]
+    assert len(partial_metadatas) == len(partitions), f"{len(partial_metadatas)} vs {len(partitions)}"
+
+    metadatas = [PartitionMetadata.from_table(p).merge_with_partial(m) for p, m in zip(partitions, partial_metadatas)]


Nice! I was just doing something similar in the executor branch. 😄

src/daft-plan/src/physical_plan.rs

clarkzinzow · 2024-05-07T21:45:07Z

src/daft-plan/src/physical_plan.rs

+            }
+            Self::Project(Project { input, .. })
+            | Self::MonotonicallyIncreasingId(MonotonicallyIncreasingId { input, .. }) => {
+                // TODO(sammy), we need the schema to estimate the new size per row


Ah, we can always tweak logical->physical translation to add the schema to the physical Project and MontonicallyIncreasingId structs for this case! Obviously not a blocking issue, though.

src/daft-plan/src/physical_plan.rs

wip

0628788

samster25 force-pushed the sammy/query-stage-emitter branch from 4fc57ee to 0628788 Compare April 25, 2024 23:56

adaptive planner

715b2ab

samster25 changed the title ~~Query Stages~~ [PERF] Query Stages Apr 26, 2024

github-actions bot added the performance label Apr 26, 2024

samster25 added 24 commits April 26, 2024 01:44

wip

48d29db

Merge branch 'main' into sammy/query-stage-emitter

fdf06fe

Merge remote-tracking branch 'origin' into sammy/query-stage-emitter

42431a6

first pass AQE working

08a216b

enable partial plans

21ddbea

propagate materialized results in pset

1de8398

drop gil

911fd43

update ray part set for mat result

1b60667

type clean up

d3002b6

Merge remote-tracking branch 'origin' into sammy/query-stage-emitter

c734eb3

merge

03b17ba

refactor of approx stats module for physical plan

4eb1b11

dont emit query stage at root node

1a08bdf

debug seg fault

4a860e5

fix physical plan seg fault

04390f5

clean for debugging

b20479d

only invoke query boundary if more than 1 partition

475ff2a

clippy fixes

24fdedc

more clippy fixes

e7f7212

more clippy fixes

4ac8640

cargo check fixes

82976ca

fix mypy

dd6cc60

assume foreign key primary key join stats

2cee757

dont hold references across loops

2cf2847

samster25 added 6 commits May 6, 2024 13:10

move comments around

2bb1b56

refactor ray runner to methods

ab44272

refactor ray runner to methods 2

1580aeb

fix partial metadata bug

4d33950

enable env variable detection and drop log level

dbc68d7

add checks

dab51a1

samster25 changed the title ~~[PERF] Query Stages~~ [PERF] Adaptive Query Execution May 6, 2024

merge in unpivot

b92ab62

samster25 marked this pull request as ready for review May 6, 2024 23:35

samster25 requested review from clarkzinzow and jaychia May 6, 2024 23:38

clarkzinzow approved these changes May 7, 2024

View reviewed changes

samster25 added 2 commits May 7, 2024 15:10

remove unwrap 100

4c0f090

merge in main

0efd84b

samster25 enabled auto-merge (squash) May 7, 2024 22:19

samster25 merged commit b61461f into main May 7, 2024
27 checks passed

samster25 deleted the sammy/query-stage-emitter branch May 7, 2024 22:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[PERF] Adaptive Query Execution #2176

[PERF] Adaptive Query Execution #2176

samster25 commented Apr 24, 2024 •

edited

Loading

codecov bot commented Apr 26, 2024 •

edited

Loading

clarkzinzow left a comment

clarkzinzow May 7, 2024

samster25 May 7, 2024

samster25 May 7, 2024

clarkzinzow May 7, 2024

clarkzinzow May 7, 2024

clarkzinzow May 7, 2024

[PERF] Adaptive Query Execution #2176

[PERF] Adaptive Query Execution #2176

Conversation

samster25 commented Apr 24, 2024 • edited Loading

codecov bot commented Apr 26, 2024 • edited Loading

Codecov Report

clarkzinzow left a comment

Choose a reason for hiding this comment

clarkzinzow May 7, 2024

Choose a reason for hiding this comment

samster25 May 7, 2024

Choose a reason for hiding this comment

samster25 May 7, 2024

Choose a reason for hiding this comment

clarkzinzow May 7, 2024

Choose a reason for hiding this comment

clarkzinzow May 7, 2024

Choose a reason for hiding this comment

clarkzinzow May 7, 2024

Choose a reason for hiding this comment

samster25 commented Apr 24, 2024 •

edited

Loading

codecov bot commented Apr 26, 2024 •

edited

Loading