[SPARK-11313][SQL] implement cogroup on DataSets #9279

cloud-fan · 2015-10-26T12:06:00Z

No description provided.

marmbrus · 2015-10-26T12:11:22Z

sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/basicOperators.scala

@@ -513,3 +513,16 @@ case class MapGroups[K, T, U](
  override def missingInput: AttributeSet = AttributeSet.empty
 }

+case class CoGroup2[K, T, U, V](


Consider adding a factory like in the other cases to simplify implicit passing.

Also I'd probably just call it cogroup as we'll probably use a single variadic operator if we decide to do more than 2.

marmbrus · 2015-10-26T12:16:47Z

Looking pretty good so far! Let's wait to add cogroup for more than 2 datasets.

SparkQA · 2015-10-26T14:28:50Z

Test build #44351 has finished for PR 9279 at commit d4b6920.

This patch passes all tests.
This patch merges cleanly.
This patch adds the following public classes (experimental):\n * case class CoGroup2[K, T, U, V](\n * case class CoGroup2[K, T, U, V](\n

SparkQA · 2015-10-27T11:17:36Z

Test build #44424 has finished for PR 9279 at commit 1c7f4c0.

This patch fails Scala style tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2015-10-27T11:22:19Z

Test build #44425 has finished for PR 9279 at commit 2f1cef0.

This patch fails Scala style tests.
This patch merges cleanly.
This patch adds the following public classes (experimental):\n * case class CoGroup[K, R](\n * case class CoGroup[K, T, U, R](\n

SparkQA · 2015-10-27T14:08:28Z

Test build #44426 has finished for PR 9279 at commit 2fb01ec.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds the following public classes (experimental):\n * case class CoGroup[K, R](\n * case class CoGroup[K, T, U, R](\n

cloud-fan · 2015-10-27T15:32:37Z

cc @marmbrus

cloud-fan · 2015-10-27T15:32:43Z

retest this please.

SparkQA · 2015-10-27T18:04:33Z

Test build #44442 has finished for PR 9279 at commit 2fb01ec.

This patch passes all tests.
This patch merges cleanly.
This patch adds the following public classes (experimental):\n * case class CoGroup[K, R](\n * case class CoGroup[K, T, U, R](\n

cloud-fan · 2015-10-28T06:33:07Z

will open it again when we need to support cogroup on more than 2 datasets.

A simpler version of apache#9279, only support 2 datasets. Author: Wenchen Fan <wenchen@databricks.com> Closes apache#9324 from cloud-fan/cogroup2.

A simpler version of apache/spark#9279, only support 2 datasets. Author: Wenchen Fan <wenchen@databricks.com> Closes #9324 from cloud-fan/cogroup2.

marmbrus reviewed Oct 26, 2015
View reviewed changes

cloud-fan force-pushed the cogroup branch 2 times, most recently from 1c7f4c0 to 2f1cef0 Compare October 27, 2015 11:12

cloud-fan changed the title ~~[SPARK-11313][SQL][WIP] implement cogroup on DataSets~~ [SPARK-11313][SQL] implement cogroup on DataSets Oct 27, 2015

implement cogroup

2fb01ec

cloud-fan force-pushed the cogroup branch from 2f1cef0 to 2fb01ec Compare October 27, 2015 11:28

cloud-fan mentioned this pull request Oct 28, 2015

[SPARK-11313][SQL] implement cogroup on DataSets (support 2 datasets) #9324

Closed

cloud-fan closed this Oct 28, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-11313][SQL] implement cogroup on DataSets #9279

[SPARK-11313][SQL] implement cogroup on DataSets #9279

cloud-fan commented Oct 26, 2015

marmbrus Oct 26, 2015

marmbrus Oct 26, 2015

marmbrus commented Oct 26, 2015

SparkQA commented Oct 26, 2015

SparkQA commented Oct 27, 2015

SparkQA commented Oct 27, 2015

SparkQA commented Oct 27, 2015

cloud-fan commented Oct 27, 2015

cloud-fan commented Oct 27, 2015

SparkQA commented Oct 27, 2015

cloud-fan commented Oct 28, 2015

[SPARK-11313][SQL] implement cogroup on DataSets #9279

[SPARK-11313][SQL] implement cogroup on DataSets #9279

Conversation

cloud-fan commented Oct 26, 2015

marmbrus Oct 26, 2015

Choose a reason for hiding this comment

marmbrus Oct 26, 2015

Choose a reason for hiding this comment

marmbrus commented Oct 26, 2015

SparkQA commented Oct 26, 2015

SparkQA commented Oct 27, 2015

SparkQA commented Oct 27, 2015

SparkQA commented Oct 27, 2015

cloud-fan commented Oct 27, 2015

cloud-fan commented Oct 27, 2015

SparkQA commented Oct 27, 2015

cloud-fan commented Oct 28, 2015