Skip to content

Conversation

@dongjoon-hyun
Copy link
Member

@dongjoon-hyun dongjoon-hyun commented Jul 30, 2021

What changes were proposed in this pull request?

This PR aims to upgrade PySpark GitHub Action job to use the latest docker image 20210730 having sklearn and mlflow additionally.

$ docker run -it --rm dongjoon/apache-spark-github-action-image:20210730 python3.9 -m pip list | grep mlflow
mlflow                    1.19.0

$ docker run -it --rm dongjoon/apache-spark-github-action-image:20210730 python3.9 -m pip list | grep sklearn
sklearn                   0.0

Why are the changes needed?

This will save the installation time.

Does this PR introduce any user-facing change?

No.

How was this patch tested?

Pass the GitHub Action PySpark jobs.

@github-actions github-actions bot added the INFRA label Jul 30, 2021
@dongjoon-hyun
Copy link
Member Author

cc @HyukjinKwon

@SparkQA
Copy link

SparkQA commented Jul 30, 2021

Kubernetes integration test starting
URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/46417/

@HyukjinKwon
Copy link
Member

Merged to master and branch-3.2.

HyukjinKwon pushed a commit that referenced this pull request Jul 30, 2021
…0730

### What changes were proposed in this pull request?

This PR aims to upgrade PySpark GitHub Action job to use the latest docker image `20210730` having `sklearn` and `mlflow` additionally.
- dongjoon-hyun/ApacheSparkGitHubActionImage@5ca9445

```
$ docker run -it --rm dongjoon/apache-spark-github-action-image:20210730 python3.9 -m pip list | grep mlflow
mlflow                    1.19.0

$ docker run -it --rm dongjoon/apache-spark-github-action-image:20210730 python3.9 -m pip list | grep sklearn
sklearn                   0.0
```

### Why are the changes needed?

This will save the installation time.

### Does this PR introduce _any_ user-facing change?

No.

### How was this patch tested?

Pass the GitHub Action PySpark jobs.

Closes #33595 from dongjoon-hyun/SPARK-36345.

Authored-by: Dongjoon Hyun <dhyun@apple.com>
Signed-off-by: Hyukjin Kwon <gurwls223@apache.org>
(cherry picked from commit 0e65ed5)
Signed-off-by: Hyukjin Kwon <gurwls223@apache.org>
@HyukjinKwon
Copy link
Member

the test failure shouldn't be related. it's from my mistake #33596

@ueshin
Copy link
Member

ueshin commented Jul 30, 2021

Oh, wait. Actually pandas in the docker images was upgraded to 1.3.1. I guess it caused the test failures.

pandas                    1.3.1

@SparkQA
Copy link

SparkQA commented Jul 30, 2021

Kubernetes integration test status failure
URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/46417/

@HyukjinKwon
Copy link
Member

Oh I a bit rushed to merge. @ueshin would you mind taking a quick look please for pandas 1.3.1 upgrade? I can't work on it at this moment. Reverting it for now is fine to me too.

@ueshin
Copy link
Member

ueshin commented Jul 30, 2021

Or we can disable the tests for now (#33598).

@dongjoon-hyun
Copy link
Member Author

This is reverted.

@ueshin
Copy link
Member

ueshin commented Jul 31, 2021

@dongjoon-hyun This was also merged to 3.2. Could you revert it as well? Thanks!

@HyukjinKwon
Copy link
Member

Thanks guys!!

@SparkQA
Copy link

SparkQA commented Jul 31, 2021

Test build #141908 has finished for PR 33595 at commit 5367960.

  • This patch fails from timeout after a configured wait of 500m.
  • This patch merges cleanly.
  • This patch adds no public classes.

@HyukjinKwon
Copy link
Member

@dongjoon-hyun This was also merged to 3.2. Could you revert it as well? Thanks!

Done 👍

@dongjoon-hyun
Copy link
Member Author

dongjoon-hyun commented Aug 1, 2021

Oh. Thanks, @HyukjinKwon . I missed the ping here. Sorry, @ueshin .

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants