Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[VL] Optimize GHA workflow with nightly built docker image (part-1) #4293

Merged
merged 11 commits into from
Jan 10, 2024

Conversation

PZD-CHINA
Copy link
Contributor

The patch targets to do CI cleanup.

.github/workflows/velox_be.yml static-build-test and build-script-test need further debug.

Copy link

github-actions bot commented Jan 5, 2024

Thanks for opening a pull request!

Could you open an issue for this pull request on Github Issues?

https://github.com/oap-project/gluten/issues

Then could you also rename commit message and pull request title in the following format?

[GLUTEN-${ISSUES_ID}][COMPONENT]feat/fix: ${detailed message}

See also:

Copy link

github-actions bot commented Jan 5, 2024

Run Gluten Clickhouse CI

@PHILO-HE PHILO-HE changed the title [VL] GHA CI cleanup [VL] Optimize GHA workflow with nightly built docker image Jan 5, 2024
Copy link

github-actions bot commented Jan 5, 2024

Run Gluten Clickhouse CI

Copy link

github-actions bot commented Jan 5, 2024

Run Gluten Clickhouse CI

3 similar comments
Copy link

github-actions bot commented Jan 5, 2024

Run Gluten Clickhouse CI

Copy link

github-actions bot commented Jan 5, 2024

Run Gluten Clickhouse CI

Copy link

github-actions bot commented Jan 5, 2024

Run Gluten Clickhouse CI

Copy link

github-actions bot commented Jan 5, 2024

Run Gluten Clickhouse CI

3 similar comments
Copy link

github-actions bot commented Jan 5, 2024

Run Gluten Clickhouse CI

Copy link

github-actions bot commented Jan 5, 2024

Run Gluten Clickhouse CI

Copy link

github-actions bot commented Jan 5, 2024

Run Gluten Clickhouse CI

@PHILO-HE
Copy link
Contributor

PHILO-HE commented Jan 5, 2024

Please replace /opt/spark331 by /opt/spark342 in the below two pieces of code.
Using spark 331 binary for spark 342 tests has some potential issues. In the new docker, suppose spark-342 source is downloaded and then built. See code link.

https://github.com/oap-project/gluten/blob/29f850086609d8922b30c163ea1687fbd25a2e8d/.github/workflows/velox_be.yml#L238

https://github.com/oap-project/gluten/blob/29f850086609d8922b30c163ea1687fbd25a2e8d/.github/workflows/velox_be.yml#L276

Copy link

github-actions bot commented Jan 5, 2024

Run Gluten Clickhouse CI

3 similar comments
Copy link

github-actions bot commented Jan 5, 2024

Run Gluten Clickhouse CI

Copy link

github-actions bot commented Jan 5, 2024

Run Gluten Clickhouse CI

Copy link

github-actions bot commented Jan 5, 2024

Run Gluten Clickhouse CI

DOCKER_REGISTRY_PORT: 5000

PATH_TO_GLUTEN_TE: ./tools/gluten-te
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think we should be able to use the Github environment variables

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hongze once removed the use of github environment in a revert commit: 385b252. I will check this with him.

Copy link

github-actions bot commented Jan 5, 2024

Run Gluten Clickhouse CI

2 similar comments
Copy link

github-actions bot commented Jan 5, 2024

Run Gluten Clickhouse CI

Copy link

github-actions bot commented Jan 6, 2024

Run Gluten Clickhouse CI

Copy link

github-actions bot commented Jan 6, 2024

Run Gluten Clickhouse CI

@zhouyuan
Copy link
Contributor

zhouyuan commented Jan 8, 2024

please help to do a rebase, the CK backend failure is fixed in main branch

Copy link

github-actions bot commented Jan 9, 2024

Run Gluten Clickhouse CI

@ayushi-agarwal
Copy link
Contributor

@PZD-CHINA I'm not sure why these changes caused the failure in spark 34. Could you please disable these two failed unit tests and added them in this list for spark34 failed unit tests #3559? Thanks.

@JkSelf This PR will solve the issue #4203. As part of this PR #4273 we found that these were failing in our local when we give correct path and many other UT's pass when given the correct path.

PHILO-HE and others added 6 commits January 10, 2024 08:37
revert DEFAULT_DOCKER_CACHE_REGISTRY and DEFAULT_DOCKER_PUSH_REGISTRY set.

set  DOCKER_CACHE_REGISTRY and DOCKER_PUSH_REGISTRY in velxo_be.yml

update velxo_be.yml  build-script-test job
Copy link

Run Gluten Clickhouse CI

Copy link

Run Gluten Clickhouse CI

Copy link

Run Gluten Clickhouse CI

Copy link

Run Gluten Clickhouse CI

Copy link

Run Gluten Clickhouse CI

Copy link

Run Gluten Clickhouse CI

@PHILO-HE
Copy link
Contributor

Let's leave the change for yml in a follow-up PR and then really use the new docker after that.

Copy link
Contributor

@PHILO-HE PHILO-HE left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for your work!

@PHILO-HE PHILO-HE changed the title [VL] Optimize GHA workflow with nightly built docker image [VL] Optimize GHA workflow with nightly built docker image (part-1) Jan 10, 2024
@PHILO-HE PHILO-HE merged commit f80408c into apache:main Jan 10, 2024
15 of 20 checks passed
@GlutenPerfBot
Copy link
Contributor

===== Performance report for TPCH SF2000 with Velox backend, for reference only ====

query log/native_4293_time.csv log/native_master_01_09_2024_d3df97aa0_time.csv difference percentage
q1 33.60 33.38 -0.225 99.33%
q2 25.39 25.66 0.271 101.07%
q3 38.36 37.88 -0.479 98.75%
q4 39.32 39.11 -0.209 99.47%
q5 73.23 71.57 -1.652 97.74%
q6 6.65 6.96 0.309 104.65%
q7 85.38 86.82 1.441 101.69%
q8 86.09 86.62 0.530 100.62%
q9 126.71 125.62 -1.088 99.14%
q10 43.79 42.71 -1.082 97.53%
q11 20.12 20.21 0.086 100.43%
q12 27.36 29.40 2.047 107.48%
q13 47.13 47.13 0.002 100.00%
q14 19.52 16.36 -3.158 83.82%
q15 29.13 27.94 -1.186 95.93%
q16 14.44 14.99 0.550 103.81%
q17 104.29 156.46 52.173 150.03%
q18 150.89 194.42 43.528 128.85%
q19 12.54 16.64 4.101 132.71%
q20 27.49 28.42 0.923 103.36%
q21 227.92 225.07 -2.850 98.75%
q22 13.84 14.11 0.271 101.96%
total 1253.19 1347.49 94.301 107.52%

@vibhaska
Copy link
Contributor

Please replace /opt/spark331 by /opt/spark342 in the below two pieces of code. Using spark 331 binary for spark 342 tests has some potential issues. In the new docker, suppose spark-342 source is downloaded and then built. See code link.

https://github.com/oap-project/gluten/blob/29f850086609d8922b30c163ea1687fbd25a2e8d/.github/workflows/velox_be.yml#L238

https://github.com/oap-project/gluten/blob/29f850086609d8922b30c163ea1687fbd25a2e8d/.github/workflows/velox_be.yml#L276

Has this been done already?

@vibhaska
Copy link
Contributor

Please replace /opt/spark331 by /opt/spark342 in the below two pieces of code. Using spark 331 binary for spark 342 tests has some potential issues. In the new docker, suppose spark-342 source is downloaded and then built. See code link.
https://github.com/oap-project/gluten/blob/29f850086609d8922b30c163ea1687fbd25a2e8d/.github/workflows/velox_be.yml#L238

https://github.com/oap-project/gluten/blob/29f850086609d8922b30c163ea1687fbd25a2e8d/.github/workflows/velox_be.yml#L276

Has this been done already?

@JkSelf Possibly missed in checkin because its a ".github" folder, ignored by git.

@PHILO-HE
Copy link
Contributor

Hi @vibhaska, the spark-3.4 test home issue will be fixed by another PR: https://github.com/oap-project/gluten/pull/4362/files. Will merge that patch after CI passes.

@PZD-CHINA PZD-CHINA deleted the GHA-workflow branch January 16, 2024 05:32
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

8 participants