Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[GLUTEN-4827][UT] Add Golden Files for TPC-H Spark34 + Gluten Execution Plan #4828

Merged
merged 3 commits into from
Mar 14, 2024

Conversation

zwangsheng
Copy link
Contributor

What changes were proposed in this pull request?

Close #4827

As title, follow up to add TPC-H + Spark33 Golden Files.

How was this patch tested?

unit tests

Copy link

github-actions bot commented Mar 1, 2024

#4827

Copy link

github-actions bot commented Mar 1, 2024

Run Gluten Clickhouse CI

1 similar comment
Copy link

github-actions bot commented Mar 1, 2024

Run Gluten Clickhouse CI

@zwangsheng zwangsheng marked this pull request as draft March 1, 2024 08:41
Copy link

github-actions bot commented Mar 1, 2024

Run Gluten Clickhouse CI

@zwangsheng zwangsheng marked this pull request as ready for review March 1, 2024 09:14
Copy link

github-actions bot commented Mar 1, 2024

Run Gluten Clickhouse CI

- name: Clean temp golden files
if: failure()
run: |
rm -rf /tmp/$GITHUB_RUN_ID/spark34/tpch-approved-plan
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can this be moved to the above "Upload golden files" GHA task ? If feasible, maybe we can unify them (the above two or three tasks) into one task.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I got your point, i will go futher to find how to resolve this.

Copy link

github-actions bot commented Mar 8, 2024

Run Gluten Clickhouse CI

runs-on: velox-self-hosted
needs: [ubuntu2004-test-spark32, ubuntu2004-test-spark33, ubuntu2004-test-spark34]
steps:
- name: Upload golden files
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

if: failure()

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Right, we should trigger this upload job when the previous job fail.

And according to Github Action Doc, we should add this condition on job instead of adding on each step.

Copy link

github-actions bot commented Mar 8, 2024

Run Gluten Clickhouse CI

1 similar comment
Copy link

github-actions bot commented Mar 8, 2024

Run Gluten Clickhouse CI

@zwangsheng zwangsheng marked this pull request as draft March 8, 2024 10:27
Copy link

github-actions bot commented Mar 8, 2024

Run Gluten Clickhouse CI

7 similar comments
Copy link

github-actions bot commented Mar 8, 2024

Run Gluten Clickhouse CI

Copy link

Run Gluten Clickhouse CI

Copy link

Run Gluten Clickhouse CI

Copy link

Run Gluten Clickhouse CI

Copy link

Run Gluten Clickhouse CI

Copy link

Run Gluten Clickhouse CI

Copy link

Run Gluten Clickhouse CI

@ulysses-you
Copy link
Contributor

@zwangsheng is there any block on this pr ?

@zwangsheng
Copy link
Contributor Author

@zwangsheng is there any block on this pr ?

I'm still testing merge upload step. But IMO, we can leave merge step job in the following PR. WDYT @ulysses-you @PHILO-HE

@PHILO-HE
Copy link
Contributor

@zwangsheng is there any block on this pr ?

I'm still testing merge upload step. But IMO, we can leave merge step job in the following PR. WDYT @ulysses-you @PHILO-HE

OK to me to separate the work into two or more PRs.
Assume the uploading action can also work well in the new CI we are working on. cc @zhouyuan.

@ulysses-you
Copy link
Contributor

I'm fine to seperate pr. @PHILO-HE IIUC the current action should work, just some code cleanup leave to another pr.

Copy link

Run Gluten Clickhouse CI

@zwangsheng
Copy link
Contributor Author

Thanks for both @ulysses-you @PHILO-HE, i will revert this commit to focus on Spark 34 Golden Files, after some test, will turn this PR ready.

@zwangsheng zwangsheng marked this pull request as ready for review March 13, 2024 07:44
Copy link

Run Gluten Clickhouse CI

@@ -670,3 +673,12 @@ jobs:
if: ${{ always() }}
run: |
$PATH_TO_GLUTEN_TE/$OS_IMAGE_NAME/gha/gha-checkout/clean.sh

clean-up-tmp:
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@zwangsheng It seems a bit overkill to add a common job to do cleanup. The github action become complex.. I prefer to revert this change.

image

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

IMO, it is acceptable to add a new job to do the cleanup, especially since we will later incorporate the upload operations into this job.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think we should merge upload and cleanup into one job and it seems unnecessary to pull out upload to a new job. There exists many similar jobs in gluten CI, if we want to make it clear, we should consider other jobs.

Copy link

Run Gluten Clickhouse CI

@ulysses-you ulysses-you merged commit 09be9b8 into apache:main Mar 14, 2024
17 checks passed
@GlutenPerfBot
Copy link
Contributor

===== Performance report for TPCH SF2000 with Velox backend, for reference only ====

query log/native_4828_time.csv log/native_master_03_13_2024_d7ed0844e_time.csv difference percentage
q1 37.75 38.81 1.059 102.80%
q2 24.20 24.06 -0.135 99.44%
q3 38.00 38.18 0.177 100.47%
q4 36.25 38.49 2.234 106.16%
q5 67.82 69.71 1.885 102.78%
q6 7.41 7.45 0.047 100.64%
q7 83.82 82.46 -1.363 98.37%
q8 85.28 83.21 -2.068 97.57%
q9 121.80 121.83 0.031 100.03%
q10 47.07 44.36 -2.710 94.24%
q11 20.03 20.90 0.870 104.34%
q12 28.41 28.06 -0.350 98.77%
q13 47.25 46.88 -0.370 99.22%
q14 18.59 21.98 3.386 118.21%
q15 31.40 33.08 1.685 105.37%
q16 14.31 13.84 -0.468 96.73%
q17 100.67 101.80 1.131 101.12%
q18 142.34 141.05 -1.291 99.09%
q19 13.85 15.07 1.215 108.77%
q20 26.38 27.03 0.642 102.43%
q21 228.38 229.52 1.146 100.50%
q22 13.89 13.86 -0.032 99.77%
total 1234.91 1241.63 6.720 100.54%

taiyang-li pushed a commit to bigo-sg/gluten that referenced this pull request Mar 25, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[UT] Add Golden Files for TPC-H Spark34 + Gluten Execution Plan
4 participants