Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[SPARK-33272][SPARK-36815][3.0][SQL] prune the attributes mapping in QueryPlan.transformUpWithNewOutput #34068

Closed
wants to merge 1 commit into from

Conversation

gaoyajun02
Copy link
Contributor

@gaoyajun02 gaoyajun02 commented Sep 22, 2021

What changes were proposed in this pull request?

This is a backport PR of #30173.

For complex query plans, QueryPlan.transformUpWithNewOutput will keep accumulating the attributes mapping to be propagated, which may hurt performance. This PR prunes the attributes mapping before propagating.

Why are the changes needed?

A simple perf improvement.

Does this PR introduce any user-facing change?

No

How was this patch tested?

existing tests

…mUpWithNewOutput

### What changes were proposed in this pull request?

For complex query plans, `QueryPlan.transformUpWithNewOutput` will keep accumulating the attributes mapping to be propagated, which may hurt performance. This PR prunes the attributes mapping before propagating.

### Why are the changes needed?

A simple perf improvement.

### Does this PR introduce _any_ user-facing change?

No

### How was this patch tested?

existing tests

Closes apache#30173 from cloud-fan/bug.

Authored-by: Wenchen Fan <wenchen@databricks.com>
Signed-off-by: Takeshi Yamamuro <yamamuro@apache.org>

(cherry picked from commit 2639ad4)
@viirya viirya changed the title [SPARK-33272][SPARK-36815][SQL] prune the attributes mapping in QueryPlan.transformUpWithNewOutput [SPARK-33272][SPARK-36815][3.0][SQL] prune the attributes mapping in QueryPlan.transformUpWithNewOutput Sep 22, 2021
@viirya
Copy link
Member

viirya commented Sep 22, 2021

ok to test

@SparkQA
Copy link

SparkQA commented Sep 22, 2021

Kubernetes integration test starting
URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/48013/

@SparkQA
Copy link

SparkQA commented Sep 22, 2021

Kubernetes integration test status failure
URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/48013/

@SparkQA
Copy link

SparkQA commented Sep 22, 2021

Test build #143502 has finished for PR 34068 at commit a569799.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@AmplabJenkins
Copy link

Can one of the admins verify this patch?

@viirya
Copy link
Member

viirya commented Sep 22, 2021

Thanks! Merging to 3.0.

viirya pushed a commit that referenced this pull request Sep 22, 2021
…QueryPlan.transformUpWithNewOutput

### What changes were proposed in this pull request?

This is a backport PR of #30173.

For complex query plans, `QueryPlan.transformUpWithNewOutput` will keep accumulating the attributes mapping to be propagated, which may hurt performance. This PR prunes the attributes mapping before propagating.

### Why are the changes needed?

A simple perf improvement.

### Does this PR introduce _any_ user-facing change?

No

### How was this patch tested?

existing tests

Closes #34068 from gaoyajun02/SPARK-33272.

Authored-by: Wenchen Fan <wenchen@databricks.com>
Signed-off-by: Liang-Chi Hsieh <viirya@gmail.com>
@viirya viirya closed this Sep 22, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants