Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[SPARK-37453][INFRA][SQL][TESTS] Split TPC-DS build in GitHub Actions #34698

Closed
wants to merge 2 commits into from

Conversation

HyukjinKwon
Copy link
Member

What changes were proposed in this pull request?

This is kind of a followup for #33510 and #34641. This PR proposes to split TPC-DS build in GitHub Actions.

Why are the changes needed?

Running these queries easily causes out-of-memory in GitHub Actions machines, and make the build flaky. We should deflake it.

Does this PR introduce any user-facing change?

No, dev-only.

How was this patch tested?

GitHub Actions in this PR should test it out.

@HyukjinKwon HyukjinKwon marked this pull request as draft November 24, 2021 10:47
@HyukjinKwon HyukjinKwon changed the title [SPARK-37453][INFRA][SQL][TESTS] Split TPC-DS build in GitHub Actions [WIP][SPARK-37453][INFRA][SQL][TESTS] Split TPC-DS build in GitHub Actions Nov 24, 2021
@SparkQA

This comment has been minimized.

@HyukjinKwon HyukjinKwon changed the title [WIP][SPARK-37453][INFRA][SQL][TESTS] Split TPC-DS build in GitHub Actions [SPARK-37453][INFRA][SQL][TESTS] Split TPC-DS build in GitHub Actions Nov 24, 2021
@HyukjinKwon HyukjinKwon marked this pull request as ready for review November 24, 2021 12:17
@SparkQA

This comment has been minimized.

@HyukjinKwon HyukjinKwon force-pushed the split-tpcds branch 3 times, most recently from 2ffa2da to 36b3473 Compare November 24, 2021 13:00
joinConfSet.tail.foreach { conf =>
System.gc() // SPARK-37368
runQuery(queryString, goldenFile, conf.toSeq, true)
}
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

While I am here, I ended up with refactoring this code part here. No behavior change. Everything is as was.

@SparkQA

This comment has been minimized.

@HyukjinKwon
Copy link
Member Author

cc @cloud-fan, @linhongliu-db and @jerqi

@SparkQA

This comment has been minimized.

@SparkQA

This comment has been minimized.

@SparkQA

This comment has been minimized.

@SparkQA

This comment has been minimized.

@SparkQA

This comment has been minimized.

@SparkQA

This comment has been minimized.

@SparkQA

This comment has been minimized.

@SparkQA

This comment has been minimized.

@SparkQA

This comment has been minimized.

@SparkQA

This comment has been minimized.

@SparkQA

This comment has been minimized.

@SparkQA

This comment has been minimized.

@SparkQA

This comment has been minimized.

@SparkQA

This comment has been minimized.

@HyukjinKwon
Copy link
Member Author

Tests passed.

Merged to master.

@SparkQA
Copy link

SparkQA commented Nov 25, 2021

Kubernetes integration test status failure
URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/50068/

@SparkQA
Copy link

SparkQA commented Nov 25, 2021

Test build #145596 has finished for PR 34698 at commit 8fa16bb.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@HyukjinKwon HyukjinKwon deleted the split-tpcds branch January 4, 2022 00:51
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
4 participants