Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[SPARK-32741][SQL][FOLLOWUP] Run plan integrity check only for effective plan changes #29928

Closed
wants to merge 1 commit into from

Conversation

maropu
Copy link
Member

@maropu maropu commented Oct 2, 2020

What changes were proposed in this pull request?

(This is a followup PR of #29585) The PR modified RuleExecutor#isPlanIntegral code for checking if a plan has globally-unique attribute IDs, but this check made Jenkins maven test jobs much longer (See the Dongjoon comment and thanks, @dongjoon-hyun !). To recover running time for the Jenkins tests, this PR intends to update the code to run plan integrity check only for effective plans.

Why are the changes needed?

To recover running time for Jenkins tests.

Does this PR introduce any user-facing change?

No.

How was this patch tested?

Existing tests.

@maropu
Copy link
Member Author

maropu commented Oct 2, 2020

I will check the running time of Jenkins. cc: @dongjoon-hyun

@SparkQA
Copy link

SparkQA commented Oct 2, 2020

Kubernetes integration test starting
URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/33940/

@dongjoon-hyun
Copy link
Member

Thank you for this investigation, @maropu .
Also, cc @cloud-fan .

@dongjoon-hyun
Copy link
Member

dongjoon-hyun commented Oct 2, 2020

@maropu . Could you run Maven run? SBT run is parallelized by default.

@dongjoon-hyun dongjoon-hyun changed the title [WIP][SPARK-32741][SQL][FOLLOWUP] Run plan integrity check only for effective plans [WIP][SPARK-32741][SQL][FOLLOWUP][test-maven] Run plan integrity check only for effective plans Oct 2, 2020
@dongjoon-hyun
Copy link
Member

Retest this please

@maropu
Copy link
Member Author

maropu commented Oct 2, 2020

Thanks for the update, @dongjoon-hyun

@SparkQA
Copy link

SparkQA commented Oct 2, 2020

Kubernetes integration test status success
URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/33940/

@SparkQA
Copy link

SparkQA commented Oct 2, 2020

Kubernetes integration test starting
URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/33942/

@SparkQA
Copy link

SparkQA commented Oct 2, 2020

Kubernetes integration test status failure
URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/33942/

@SparkQA
Copy link

SparkQA commented Oct 2, 2020

Test build #129327 has finished for PR 29928 at commit 6582f90.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@SparkQA
Copy link

SparkQA commented Oct 2, 2020

Test build #129329 has finished for PR 29928 at commit 6582f90.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@maropu
Copy link
Member Author

maropu commented Oct 2, 2020

okay, it seems the running time got recovered: https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/129327/

FYI: latest running test:
https://amplab.cs.berkeley.edu/jenkins/view/Spark%20QA%20Test%20(Dashboard)/job/spark-master-test-maven-hadoop-2.7-hive-2.3/
#848 => Took 8 hr 1 min (PR #29585 merged)
#847 => Took 8 hr 1 min (PR #29585 merged)
#846 => Took 6 hr 45 min
#845 => Took 6 hr 41 min
#844 => Took 6 hr 43 min

@maropu
Copy link
Member Author

maropu commented Oct 2, 2020

retest this please

@maropu maropu changed the title [WIP][SPARK-32741][SQL][FOLLOWUP][test-maven] Run plan integrity check only for effective plans [SPARK-32741][SQL][FOLLOWUP][test-maven] Run plan integrity check only for effective plans Oct 2, 2020
@maropu maropu changed the title [SPARK-32741][SQL][FOLLOWUP][test-maven] Run plan integrity check only for effective plans [SPARK-32741][SQL][FOLLOWUP][test-maven] Run plan integrity check only for effective plan changes Oct 2, 2020
@dongjoon-hyun
Copy link
Member

Oh, is it recovered? It's great, @maropu .

@dongjoon-hyun dongjoon-hyun changed the title [SPARK-32741][SQL][FOLLOWUP][test-maven] Run plan integrity check only for effective plan changes [SPARK-32741][SQL][FOLLOWUP] Run plan integrity check only for effective plan changes Oct 2, 2020
@dongjoon-hyun
Copy link
Member

Thank you so much, @maropu .

@maropu
Copy link
Member Author

maropu commented Oct 2, 2020

This PR is still an improvement follow-up. Did I understand correctly, @maropu ?

Yea, right, I think so. The latest test run seems much faster than the previous ones: Took 5 hr 6 min v.s. 6 hr 41-45 min https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/129329/
I'm not 100% sure that this is true now, but probably, that's because this update skips the other unencessary integrity checks, too:

plan.find(PlanHelper.specialExpressionsInUnsupportedOperator(_).nonEmpty).isEmpty &&

@SparkQA
Copy link

SparkQA commented Oct 2, 2020

Test build #129331 has finished for PR 29928 at commit 6582f90.

  • This patch fails due to an unknown error code, -9.
  • This patch merges cleanly.
  • This patch adds no public classes.

@maropu
Copy link
Member Author

maropu commented Oct 2, 2020

(I will re-run the test a couple of times before merging this..)

@dongjoon-hyun
Copy link
Member

Sure, go ahead~

@SparkQA
Copy link

SparkQA commented Oct 2, 2020

Kubernetes integration test starting
URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/33952/

@SparkQA
Copy link

SparkQA commented Oct 2, 2020

Kubernetes integration test status failure
URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/33952/

@SparkQA
Copy link

SparkQA commented Oct 2, 2020

Test build #129341 has finished for PR 29928 at commit 6582f90.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@maropu
Copy link
Member Author

maropu commented Oct 2, 2020

I checked Took 5 hr 36 min in https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129341/testReport/
This PR looks fine, so I'll merge this.

@maropu maropu closed this in 82721ce Oct 2, 2020
@maropu
Copy link
Member Author

maropu commented Oct 2, 2020

Thanks, @dongjoon-hyun ! Merged to master.

@SparkQA
Copy link

SparkQA commented Oct 2, 2020

Test build #129340 has finished for PR 29928 at commit 6582f90.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@dongjoon-hyun
Copy link
Member

dongjoon-hyun commented Oct 2, 2020

Thank you so much again.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
3 participants