[SPARK-13351] [SQL] fix column pruning on Expand #11225

davies · 2016-02-17T00:24:27Z

Currently, the columns in projects of Expand that are not used by Aggregate are not pruned, this PR fix that.

SparkQA · 2016-02-17T01:53:58Z

Test build #51397 has finished for PR 11225 at commit bfa6896.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

JoshRosen · 2016-02-18T20:59:33Z

sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/Optimizer.scala

@@ -300,6 +300,16 @@ object SetOperationPushDown extends Rule[LogicalPlan] with PredicateHelper {
 */
 object ColumnPruning extends Rule[LogicalPlan] {
  def apply(plan: LogicalPlan): LogicalPlan = plan transform {
+    case a @ Aggregate(_, _, e @ Expand(projects, output, child))


To summarize my understanding for other reviewers:

This new rule handles the case where you have an expand beneath an aggregate and the expand produces rows with columns which are not referenced by the aggregate operator. In this case, we want to rewrite the expand's projections in order to eliminate the unreferenced column.

JoshRosen · 2016-02-18T21:06:40Z

LGTM. This change looks correct to me.

I'm going to merge this PR into master.

fix column pruning on Expand

bfa6896

JoshRosen reviewed Feb 18, 2016
View reviewed changes

asfgit closed this in 26f38bb Feb 18, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-13351] [SQL] fix column pruning on Expand #11225

[SPARK-13351] [SQL] fix column pruning on Expand #11225

davies commented Feb 17, 2016

SparkQA commented Feb 17, 2016

JoshRosen Feb 18, 2016

JoshRosen commented Feb 18, 2016

[SPARK-13351] [SQL] fix column pruning on Expand #11225

[SPARK-13351] [SQL] fix column pruning on Expand #11225

Conversation

davies commented Feb 17, 2016

SparkQA commented Feb 17, 2016

JoshRosen Feb 18, 2016

Choose a reason for hiding this comment

JoshRosen commented Feb 18, 2016