[SPARK-12991] [SQL] Establish a link between SparkPlan and LogicalPlan nodes #11036

mbautin · 2016-02-02T23:50:55Z

This is a prerequisite for reusing RDDs corresponding to shared query
fragments between different Spark SQL queries, which helps improve
performance significantly on many analytical workloads even without
explicitly caching any tables.

…n nodes This is a prerequisite for reusing RDDs corresponding to shared query fragments between different Spark SQL queries, which helps improve performance significantly on many analytical workloads even without explicitly caching any tables.

Add an accessor method for `_logicalPlan`.

cloud-fan · 2016-02-03T00:02:50Z

Did you link to the corrected JIRA ticket? And what's your whole design? This is a big change and we need to discuss if it worth.

SparkQA · 2016-02-03T00:15:32Z

Test build #50614 has finished for PR 11036 at commit b79fd07.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

mbautin · 2016-02-03T00:18:36Z

Sorry -- linked to the wrong JIRA ticket. Should be https://issues.apache.org/jira/browse/SPARK-12991.

mbautin · 2016-02-03T00:20:36Z

@cloud-fan : the whole design for the feature this is needed for (query fragment RDD reuse) is at https://issues.apache.org/jira/browse/SPARK-11838, but this seems to be the only part that cannot be done without modifying the Spark SQL code, because we need to find logical plans corresponding to generated RDDs somehow.

rxin · 2016-02-03T05:22:16Z

This is not going to work at all with whole-stage codgen, in which we collapse all pipelinable operators into a single generated function.

mbautin · 2016-02-03T05:46:22Z

Even in that case, we could still obtain RDDs corresponding to SparkPlan nodes at stage boundaries, right? We would still find that useful in our query workload.

rxin · 2016-02-04T07:44:49Z

How would the change here help you with that?

SparkQA · 2016-02-21T03:51:35Z

Test build #51606 has finished for PR 11036 at commit b79fd07.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

mbautin and others added 2 commits February 2, 2016 15:48

Update SparkPlan.scala

b79fd07

Add an accessor method for `_logicalPlan`.

mbautin changed the title ~~[SPARK-12291] [SQL] Establish a link between SparkPlan and LogicalPlan nodes~~ [SPARK-12991] [SQL] Establish a link between SparkPlan and LogicalPlan nodes Feb 3, 2016

srowen mentioned this pull request May 11, 2016

[BUILD] Test closing stale PRs #13052

Closed

asfgit closed this in 5bb62b8 May 12, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-12991] [SQL] Establish a link between SparkPlan and LogicalPlan nodes #11036

[SPARK-12991] [SQL] Establish a link between SparkPlan and LogicalPlan nodes #11036

mbautin commented Feb 2, 2016

cloud-fan commented Feb 3, 2016

SparkQA commented Feb 3, 2016

mbautin commented Feb 3, 2016

mbautin commented Feb 3, 2016

rxin commented Feb 3, 2016

mbautin commented Feb 3, 2016

rxin commented Feb 4, 2016

SparkQA commented Feb 21, 2016

[SPARK-12991] [SQL] Establish a link between SparkPlan and LogicalPlan nodes #11036

[SPARK-12991] [SQL] Establish a link between SparkPlan and LogicalPlan nodes #11036

Conversation

mbautin commented Feb 2, 2016

cloud-fan commented Feb 3, 2016

SparkQA commented Feb 3, 2016

mbautin commented Feb 3, 2016

mbautin commented Feb 3, 2016

rxin commented Feb 3, 2016

mbautin commented Feb 3, 2016

rxin commented Feb 4, 2016

SparkQA commented Feb 21, 2016