-
Notifications
You must be signed in to change notification settings - Fork 28.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[SPARK-33162][INFRA][3.0] Use pre-built image at GitHub Action PySpark jobs #30253
Conversation
Kubernetes integration test starting |
Kubernetes integration test status failure |
Test build #130623 has finished for PR 30253 at commit
|
Hi, @HyukjinKwon . |
Thank you, @HyukjinKwon ! Merged to branch-3.0. |
…k jobs ### What changes were proposed in this pull request? This is a backport of #30059 . This PR aims to use `pre-built image` at Github Action PySpark jobs. To isolate the changes, `pyspark` jobs are split from the main job. The docker image is built by the following. | Item | URL | | --------------- | ------------- | | Dockerfile | https://github.com/dongjoon-hyun/ApacheSparkGitHubActionImage/blob/main/Dockerfile | | Builder | https://github.com/dongjoon-hyun/ApacheSparkGitHubActionImage/blob/main/.github/workflows/build.yml | | Image Location | https://hub.docker.com/r/dongjoon/apache-spark-github-action-image | Please note that. 1. The community still will use `build_and_test.yml` to add new features like as we did until now. The `Dockerfile` will be updated regularly. 2. When Apache Spark gets an official docker repository location, we will use it. 3. Also, it's the best if we keep this docker file and builder script at a new Apache Spark dev branch instead of outside GitHub repository. ### Why are the changes needed? This will reduce the Python and its package installation time. **BEFORE (branch-3.0)** ![Screen Shot 2020-11-04 at 2 28 49 PM](https://user-images.githubusercontent.com/9700541/98174664-17f2e500-1eaa-11eb-9222-018eead9c418.png) **AFTER (branch-3.0)** ![Screen Shot 2020-11-04 at 2 29 43 PM](https://user-images.githubusercontent.com/9700541/98174758-378a0d80-1eaa-11eb-8e6a-929158c2fea3.png) ### Does this PR introduce _any_ user-facing change? No. ### How was this patch tested? Pass the GitHub Action on this PR without `package installation steps`. Closes #30253 from dongjoon-hyun/GHA-3.0. Authored-by: Dongjoon Hyun <dhyun@apple.com> Signed-off-by: Dongjoon Hyun <dhyun@apple.com>
Kubernetes integration test starting |
Kubernetes integration test status failure |
Test build #130635 has finished for PR 30253 at commit
|
What changes were proposed in this pull request?
This is a backport of #30059 .
This PR aims to use
pre-built image
at Github Action PySpark jobs. To isolate the changes,pyspark
jobs are split from the main job. The docker image is built by the following.Please note that.
build_and_test.yml
to add new features like as we did until now. TheDockerfile
will be updated regularly.Why are the changes needed?
This will reduce the Python and its package installation time.
BEFORE (branch-3.0)
AFTER (branch-3.0)
Does this PR introduce any user-facing change?
No.
How was this patch tested?
Pass the GitHub Action on this PR without
package installation steps
.