[SPARK-42942][SQL] Support coalesce table cache stage partitions #40574

ulysses-you · 2023-03-28T05:42:47Z

What changes were proposed in this pull request?

Add a new rule CoalesceCachePartitions to support coalesce partitions with TableCacheQueryStageExec. In order to reuse the code path with CoalesceShufflePartitions, this pr also does a small refactor about how we coalesce partitions.

RDD cache use the RDD id and partition id as the block id, so it seems not possible to split skewd partitions like shuffle. To reduce complexity, this pr does not allow coalesce partitions with both shuffle and cache stage since shuffle read may contain skewed partition spec.

For example, the follow case can not be coalesced by both CoalesceCachePartitions and CoalesceShufflePartitions.

SMJ
  ShuffleQueryStage
  TableCacheStage

Why are the changes needed?

Make AQE support coalesce table cache stage partitions.

Does this PR introduce any user-facing change?

yes, add a new config to control if coalesce partitions for table cache stage.

How was this patch tested?

add tests

ulysses-you · 2023-03-28T06:08:13Z

cc @dongjoon-hyun @cloud-fan @viirya @yaooqinn thank you

cloud-fan · 2023-03-28T13:33:46Z

sql/core/src/main/scala/org/apache/spark/sql/execution/CachedRDD.scala

+/**
+ * The [[Partition]] used by [[CachedRDD]].
+ */
+case class CachedRDDPartition(


can we add some PR comments to indicate which code are copied from somewhere and which code are new?

Most are from CoalescedRDD,

spark/core/src/main/scala/org/apache/spark/rdd/CoalescedRDD.scala

Lines 74 to 78 in 5a56c17

private[spark] class CoalescedRDD[T: ClassTag](

@transient var prev: RDD[T],

maxPartitions: Int,

partitionCoalescer: Option[PartitionCoalescer] = None)

extends RDD[T](prev.context, Nil) { // Nil since we implement getDependencies

that has similar use case to coalesce RDD partition. The main difference is CachedRDD know its target partitions which make things simple. All core methods in CachedRDD can reuse the origin of previous RDD.

So if that's the case, is there a reason why we can not implement this with CoalescedRDD and supplying the correct partitionCoalescer: Option[PartitionCoalescer]? It feels a little strange having an RDD classes code copied from core to SQL.

jaceklaskowski

There is so much to review...just skimmed over the PR.

jaceklaskowski · 2023-03-31T13:30:42Z

sql/core/src/main/scala/org/apache/spark/sql/execution/adaptive/AQERead.scala

+      // `RoundRobinPartitioning` but we don't need to retain the number of partitions.
+      case r: RoundRobinPartitioning =>
+        r.copy(numPartitions = numPartitions)
+      case other@SinglePartition =>


nit: spaces around @?

jaceklaskowski · 2023-03-31T13:31:18Z

sql/core/src/main/scala/org/apache/spark/sql/execution/adaptive/AQERead.scala

+        r.copy(numPartitions = numPartitions)
+      case other@SinglePartition =>
+        throw new IllegalStateException(
+          "Unexpected partitioning for coalesced shuffle read: " + other)


nit: s/Unexpected/Illegal ?

jaceklaskowski · 2023-03-31T13:32:00Z

sql/core/src/main/scala/org/apache/spark/sql/execution/adaptive/AQERead.scala

+      case _ =>
+        // Spark plugins may have custom partitioning and may replace this operator
+        // during the postStageOptimization phase, so return UnknownPartitioning here
+        // rather than throw an exception


Can we make this comment a DEBUG message?

holdenk · 2023-07-08T23:21:42Z

sql/core/src/main/scala/org/apache/spark/sql/execution/adaptive/AQERead.scala

+abstract class AQERead extends UnaryExecNode {
+  def child: SparkPlan
+  def partitionSpecs: Seq[ShufflePartitionSpec]


Can we have a comment about what AQERead is/does?

holdenk · 2023-07-08T23:24:24Z

sql/core/src/main/scala/org/apache/spark/sql/execution/adaptive/CoalesceCachePartitions.scala

+    if (specsMap.nonEmpty) {
+      updateCacheReads(plan, specsMap)
+    } else {
+      plan
+    }


So this only applies to reads?

github-actions · 2023-10-17T00:17:57Z

We're closing this PR because it hasn't been updated in a while. This isn't a judgement on the merit of the PR in any way. It's just a way of keeping the PR queue manageable.
If you'd like to revive this PR, please reopen it and ask a committer to remove the Stale tag!

github-actions bot added the SQL label Mar 28, 2023

github-actions bot added CORE PYTHON labels Mar 28, 2023

cloud-fan reviewed Mar 28, 2023

View reviewed changes

Support coalesce table cache stage partitions

229a57c

ulysses-you force-pushed the coalesce-cache-partition branch from 75409df to 229a57c Compare March 29, 2023 01:49

jaceklaskowski reviewed Mar 31, 2023

View reviewed changes

holdenk reviewed Jul 8, 2023

View reviewed changes

github-actions bot added the Stale label Oct 17, 2023

github-actions bot closed this Oct 18, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-42942][SQL] Support coalesce table cache stage partitions #40574

[SPARK-42942][SQL] Support coalesce table cache stage partitions #40574

ulysses-you commented Mar 28, 2023

ulysses-you commented Mar 28, 2023

cloud-fan Mar 28, 2023

ulysses-you Mar 29, 2023

holdenk Jul 8, 2023

jaceklaskowski left a comment

jaceklaskowski Mar 31, 2023

jaceklaskowski Mar 31, 2023

jaceklaskowski Mar 31, 2023

holdenk Jul 8, 2023

holdenk Jul 8, 2023

github-actions bot commented Oct 17, 2023

	private[spark] class CoalescedRDD[T: ClassTag](
	@transient var prev: RDD[T],
	maxPartitions: Int,
	partitionCoalescer: Option[PartitionCoalescer] = None)
	extends RDD[T](prev.context, Nil) { // Nil since we implement getDependencies

[SPARK-42942][SQL] Support coalesce table cache stage partitions #40574

[SPARK-42942][SQL] Support coalesce table cache stage partitions #40574

Conversation

ulysses-you commented Mar 28, 2023

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

ulysses-you commented Mar 28, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jaceklaskowski left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

github-actions bot commented Oct 17, 2023