Skip to content
This repository has been archived by the owner on Jun 14, 2024. It is now read-only.

[Gold Standard] Updated plans for all tpcds queries with spark-only setup #377

Open
wants to merge 28 commits into
base: master
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from 1 commit
Commits
Show all changes
28 commits
Select commit Hold shift + click to select a range
208ea5e
gold standard initial commit
apoorvedave1 Feb 19, 2021
cc14991
fix q32
apoorvedave1 Feb 19, 2021
7c8ee78
Merge branch 'master' of github.com:apoorvedave1/hyperspace-1 into gs
apoorvedave1 Feb 19, 2021
fa9bd4a
Merge branch 'master' of github.com:apoorvedave1/hyperspace-1 into gs
apoorvedave1 Mar 10, 2021
4b007a5
keep only tpcds v1.4 and remove others
apoorvedave1 Mar 10, 2021
59207f6
Merge remote-tracking branch 'upstream/master' into gs_initial
apoorvedave1 Mar 10, 2021
5c0cee9
Trigger Build
apoorvedave1 Mar 10, 2021
900539d
build error: test with sequential run
apoorvedave1 Mar 10, 2021
530dfa7
revert previous commit
apoorvedave1 Mar 10, 2021
add01f1
update plans
apoorvedave1 Mar 10, 2021
8b58b6b
update plan
apoorvedave1 Mar 10, 2021
97e8441
added sorting for fixing build pipeline
apoorvedave1 Mar 11, 2021
411450f
udpated plans with sorting
apoorvedave1 Mar 11, 2021
b70f00b
fix q49
apoorvedave1 Mar 11, 2021
7880f40
updated instructions on how to run tests
apoorvedave1 Mar 11, 2021
3dcd2d5
test with updated plans for q47, q49
apoorvedave1 Mar 12, 2021
a7a1149
update q47, 49 plans
apoorvedave1 Mar 12, 2021
dd295a9
remove rogue query
apoorvedave1 Mar 12, 2021
f47fbd3
remove q49
apoorvedave1 Mar 12, 2021
d15a4e8
fix scalastyle
apoorvedave1 Mar 12, 2021
4c511fb
add query files for q49.sql
apoorvedave1 Mar 12, 2021
e48c713
restructuring
apoorvedave1 Mar 12, 2021
6c5a2f7
cleanup before rebase
apoorvedave1 Mar 13, 2021
25cb361
Merge remote-tracking branch 'upstream/master' into gs_initial
apoorvedave1 Mar 13, 2021
0b40853
updated plans based on the plan stability suite
apoorvedave1 Mar 13, 2021
32f5899
normalize location: fix
apoorvedave1 Mar 15, 2021
352f2f5
Merge branch 'master' of github.com:apoorvedave1/hyperspace-1 into gs…
apoorvedave1 Mar 16, 2021
d5fbec8
query plans for all queries
apoorvedave1 Mar 16, 2021
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
296 changes: 42 additions & 254 deletions src/test/resources/tpcds/spark-2.4/approved-plans-v1_4/q1/explain.txt

Large diffs are not rendered by default.

Original file line number Diff line number Diff line change
@@ -1,65 +1,58 @@
TakeOrderedAndProject [c_customer_id]
WholeStageCodegen (9)
WholeStageCodegen
Project [c_customer_id]
BroadcastHashJoin [ctr_customer_sk,c_customer_sk]
BroadcastHashJoin [c_customer_sk,ctr_customer_sk]
Project [ctr_customer_sk]
BroadcastHashJoin [ctr_store_sk,s_store_sk]
Project [ctr_customer_sk,ctr_store_sk]
BroadcastHashJoin [ctr_store_sk,ctr_store_skL,ctr_total_return,(CAST(avg(ctr_total_return) AS DECIMAL(21,6)) * CAST(1.2 AS DECIMAL(21,6)))]
BroadcastHashJoin [(CAST(avg(ctr_total_return) AS DECIMAL(21,6)) * CAST(1.2 AS DECIMAL(21,6))),ctr_store_sk,ctr_store_skL,ctr_total_return]
Filter [ctr_total_return]
HashAggregate [sr_customer_sk,sr_store_sk,sum] [sum(UnscaledValue(sr_return_amt)),ctr_customer_sk,ctr_store_sk,ctr_total_return,sum]
HashAggregate [sr_customer_sk,sr_store_sk,sum,sum(UnscaledValue(sr_return_amt))] [ctr_customer_sk,ctr_store_sk,ctr_total_return,sum,sum(UnscaledValue(sr_return_amt))]
InputAdapter
Exchange [sr_customer_sk,sr_store_sk] #1
WholeStageCodegen (2)
HashAggregate [sr_customer_sk,sr_store_sk,sr_return_amt] [sum,sum]
Project [sr_customer_sk,sr_store_sk,sr_return_amt]
BroadcastHashJoin [sr_returned_date_sk,d_date_sk]
Filter [sr_returned_date_sk,sr_store_sk,sr_customer_sk]
ColumnarToRow
InputAdapter
Scan parquet default.store_returns [sr_returned_date_sk,sr_customer_sk,sr_store_sk,sr_return_amt]
WholeStageCodegen
HashAggregate [sr_customer_sk,sr_return_amt,sr_store_sk,sum,sum] [sum,sum]
Project [sr_customer_sk,sr_return_amt,sr_store_sk]
BroadcastHashJoin [d_date_sk,sr_returned_date_sk]
Project [sr_customer_sk,sr_return_amt,sr_returned_date_sk,sr_store_sk]
Filter [sr_customer_sk,sr_returned_date_sk,sr_store_sk]
Scan parquet default.store_returns [sr_customer_sk,sr_return_amt,sr_returned_date_sk,sr_store_sk] [sr_customer_sk,sr_return_amt,sr_returned_date_sk,sr_store_sk]
InputAdapter
BroadcastExchange #2
WholeStageCodegen (1)
WholeStageCodegen
Project [d_date_sk]
Filter [d_year,d_date_sk]
ColumnarToRow
InputAdapter
Scan parquet default.date_dim [d_date_sk,d_year]
Filter [d_date_sk,d_year]
Scan parquet default.date_dim [d_date_sk,d_year] [d_date_sk,d_year]
InputAdapter
BroadcastExchange #3
WholeStageCodegen (6)
WholeStageCodegen
Filter [(CAST(avg(ctr_total_return) AS DECIMAL(21,6)) * CAST(1.2 AS DECIMAL(21,6)))]
HashAggregate [ctr_store_sk,sum,count] [avg(ctr_total_return),(CAST(avg(ctr_total_return) AS DECIMAL(21,6)) * CAST(1.2 AS DECIMAL(21,6))),ctr_store_skL,sum,count]
HashAggregate [avg(ctr_total_return),count,ctr_store_sk,sum] [(CAST(avg(ctr_total_return) AS DECIMAL(21,6)) * CAST(1.2 AS DECIMAL(21,6))),avg(ctr_total_return),count,ctr_store_skL,sum]
InputAdapter
Exchange [ctr_store_sk] #4
WholeStageCodegen (5)
HashAggregate [ctr_store_sk,ctr_total_return] [sum,count,sum,count]
HashAggregate [sr_customer_sk,sr_store_sk,sum] [sum(UnscaledValue(sr_return_amt)),ctr_store_sk,ctr_total_return,sum]
WholeStageCodegen
HashAggregate [count,count,ctr_store_sk,ctr_total_return,sum,sum] [count,count,sum,sum]
HashAggregate [sr_customer_sk,sr_store_sk,sum,sum(UnscaledValue(sr_return_amt))] [ctr_store_sk,ctr_total_return,sum,sum(UnscaledValue(sr_return_amt))]
InputAdapter
Exchange [sr_customer_sk,sr_store_sk] #5
WholeStageCodegen (4)
HashAggregate [sr_customer_sk,sr_store_sk,sr_return_amt] [sum,sum]
Project [sr_customer_sk,sr_store_sk,sr_return_amt]
BroadcastHashJoin [sr_returned_date_sk,d_date_sk]
Filter [sr_returned_date_sk,sr_store_sk]
ColumnarToRow
InputAdapter
Scan parquet default.store_returns [sr_returned_date_sk,sr_customer_sk,sr_store_sk,sr_return_amt]
WholeStageCodegen
HashAggregate [sr_customer_sk,sr_return_amt,sr_store_sk,sum,sum] [sum,sum]
Project [sr_customer_sk,sr_return_amt,sr_store_sk]
BroadcastHashJoin [d_date_sk,sr_returned_date_sk]
Project [sr_customer_sk,sr_return_amt,sr_returned_date_sk,sr_store_sk]
Filter [sr_returned_date_sk,sr_store_sk]
Scan parquet default.store_returns [sr_customer_sk,sr_return_amt,sr_returned_date_sk,sr_store_sk] [sr_customer_sk,sr_return_amt,sr_returned_date_sk,sr_store_sk]
InputAdapter
ReusedExchange [d_date_sk] #2
ReusedExchange [d_date_sk] [d_date_sk] #2
InputAdapter
BroadcastExchange #6
WholeStageCodegen (7)
WholeStageCodegen
Project [s_store_sk]
Filter [s_state,s_store_sk]
ColumnarToRow
InputAdapter
Scan parquet default.store [s_store_sk,s_state]
Scan parquet default.store [s_state,s_store_sk] [s_state,s_store_sk]
InputAdapter
BroadcastExchange #7
WholeStageCodegen (8)
Filter [c_customer_sk]
ColumnarToRow
InputAdapter
Scan parquet default.customer [c_customer_sk,c_customer_id]
WholeStageCodegen
Project [c_customer_id,c_customer_sk]
Filter [c_customer_sk]
Scan parquet default.customer [c_customer_id,c_customer_sk] [c_customer_id,c_customer_sk]
Loading