[SPARK-25240][SQL] Fix for a deadlock in RECOVER PARTITIONS #22233

MaxGekk · 2018-08-25T13:48:16Z

What changes were proposed in this pull request?

In the PR, I propose to not perform recursive parallel listening of files in the scanPartitions method because it can cause a deadlock. Instead of that I propose to do scanPartitions in parallel for top level partitions only.

How was this patch tested?

I extended an existing test to trigger the deadlock.

HyukjinKwon · 2018-08-25T16:06:45Z

sql/core/src/test/scala/org/apache/spark/sql/execution/command/DDLSuite.scala

@@ -1131,7 +1135,7 @@ abstract class DDLSuite extends QueryTest with SQLTestUtils {
  }

  test("alter table: recover partition (parallel)") {
-    withSQLConf("spark.rdd.parallelListingThreshold" -> "1") {
+    withSQLConf("spark.rdd.parallelListingThreshold" -> "0") {


@MaxGekk, out of curiosity, why does this have to be 0?

On the recursive calls this condition

spark/sql/core/src/main/scala/org/apache/spark/sql/execution/command/ddl.scala

Lines 685 to 686 in 131ca14

val result = if (partitionNames.length > 1 &&

statuses.length > threshold || partitionNames.length > 2) {

is false because statuses.length is 1 and threshold is 1. So, it leads to sequential listening of files. I just enforce parallel scanning even for 1 file/folder.

SparkQA · 2018-08-25T17:48:52Z

Test build #95251 has finished for PR 22233 at commit 59a376d.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

gatorsmile · 2018-08-26T03:11:39Z

sql/core/src/main/scala/org/apache/spark/sql/execution/command/ddl.scala

@@ -671,7 +674,7 @@ case class AlterTableRecoverPartitionsCommand(
        val value = ExternalCatalogUtils.unescapePathName(ps(1))
        if (resolver(columnName, partitionNames.head)) {
          scanPartitions(spark, fs, filter, st.getPath, spec ++ Map(partitionNames.head -> value),
-            partitionNames.drop(1), threshold, resolver)
+            partitionNames.drop(1), threshold, resolver, listFilesInParallel = false)


This change might introduce performance regression. Do you know why it works when using .par previously?

This change might introduce performance regression.

Right, if there is significant disbalance of sub-folders, scanning will be slower probably.

Do you know why it works when using .par previously?

Scala parallel collections can cope with nested calls. See this from slide 12: https://www.slideshare.net/AleksandarProkopec/scala-parallel-collections

@gatorsmile I can revert Scala parallel collections here since we use them on the driver, and parmap is not not necessary here.

@MaxGekk Do you have a stack trace on each thread when a dead lock occurs?

Yes, I do jstack.txt

Thank you attaching the stack trace. I have just looked at it. It looks strange to me. Every thread is waiting for. No blocker is there, only one locked exists.
In typical case, a deadlock occurs due to existence of blocker as attached stack trace in #22221

I will investigate it furthermore tomorrow if we need to use this implementation instead of reverting it to the original implementation to use Scala parallel collection.

... - parking to wait for <0x0000000793c0d610> (a scala.concurrent.impl.Promise$CompletionLatch) at java.util.concurrent.locks.LockSupport.park(LockSupport.java:175) at java.util.concurrent.locks.AbstractQueuedSynchronizer.parkAndCheckInterrupt(AbstractQueuedSynchronizer.java:836) at java.util.concurrent.locks.AbstractQueuedSynchronizer.doAcquireSharedInterruptibly(AbstractQueuedSynchronizer.java:997) at java.util.concurrent.locks.AbstractQueuedSynchronizer.acquireSharedInterruptibly(AbstractQueuedSynchronizer.java:1304) at scala.concurrent.impl.Promise$DefaultPromise.tryAwait(Promise.scala:206) at scala.concurrent.impl.Promise$DefaultPromise.ready(Promise.scala:222) at scala.concurrent.impl.Promise$DefaultPromise.result(Promise.scala:227) at org.apache.spark.util.ThreadUtils$.awaitResult(ThreadUtils.scala:220) at org.apache.spark.util.ThreadUtils$.parmap(ThreadUtils.scala:317) at org.apache.spark.sql.execution.command.AlterTableRecoverPartitionsCommand.scanPartitions(ddl.scala:690) at org.apache.spark.sql.execution.command.AlterTableRecoverPartitionsCommand.run(ddl.scala:626) at org.apache.spark.sql.execution.command.ExecutedCommandExec.sideEffectResult$lzycompute(commands.scala:70) - locked <0x0000000793b04e88> (a org.apache.spark.sql.execution.command.ExecutedCommandExec) at org.apache.spark.sql.execution.command.ExecutedCommandExec.sideEffectResult(commands.scala:68) at org.apache.spark.sql.execution.command.ExecutedCommandExec.executeCollect(commands.scala:79) ...

Does it mean there is no avaiable thread in a given thread pool when a problem try to execute a new Future?

@kiszk Right, all Futures do the same - trying to execute another Future on the same fixed thread pool.

I am worry whether the similar deadlock may occur in other places due to

larger parallelism than the fixed thread pool

nested parallelism like this

I also realized there is another parmap implementation uses thread pool. Can we use another implemetation?

Can we use another implemetation?

This is what @zsxwing proposed. Please, look at my comment #22233 (comment)

Got it. sorry for my overlooking.

Are other places safe where parallelism would not reach the fixed thread pool size?

This reverts commit 59a376d.

This reverts commit 6a5f2ae.

SparkQA · 2018-08-28T13:48:25Z

Test build #95343 has finished for PR 22233 at commit 071de47.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

gatorsmile · 2018-08-28T17:34:36Z

sql/core/src/test/scala/org/apache/spark/sql/execution/command/DDLSuite.scala

@@ -52,23 +52,24 @@ class InMemoryCatalogedDDLSuite extends DDLSuite with SharedSQLContext with Befo
  protected override def generateTable(
      catalog: SessionCatalog,
      name: TableIdentifier,
-      isDataSource: Boolean = true): CatalogTable = {
+      isDataSource: Boolean = true,
+      partitionCols: Seq[String] = Seq("a", "b")): CatalogTable = {


A question about the changes in this file. Are they related to the work of this PR?

@gatorsmile Yes, the changes are related to an existing test which was modified to reproduce the issue. In particular, this line is related to support of any number of partition columns.

gatorsmile · 2018-08-28T18:26:29Z

sql/hive/src/test/scala/org/apache/spark/sql/hive/execution/HiveDDLSuite.scala

@@ -60,7 +60,8 @@ class HiveCatalogedDDLSuite extends DDLSuite with TestHiveSingleton with BeforeA
  protected override def generateTable(
      catalog: SessionCatalog,
      name: TableIdentifier,
-      isDataSource: Boolean): CatalogTable = {
+      isDataSource: Boolean,
+      partitionCols: Seq[String] = Seq("a", "b")): CatalogTable = {


The interface of this function looks strange. The original one is also hacky. We should refine them later.

Yeah, please don't overwrite a method with a default parameter. It's very easy to use different default values then the value to pick up will depend on the type you are using...

gatorsmile · 2018-08-28T18:28:17Z

Basically, this PR is to revert the code to the original .par -based solution.

LGTM Thanks! Merged to master.

MaxGekk added 3 commits August 25, 2018 14:36

Extending tests to catch a dead-lock

189e14d

Enable the tests which catches a dead-lock

bc660a0

List file in parallel on top level only

59a376d

HyukjinKwon reviewed Aug 25, 2018

View reviewed changes

gatorsmile reviewed Aug 26, 2018

View reviewed changes

MaxGekk added 2 commits August 28, 2018 11:53

Revert "List file in parallel on top level only"

80f9e7d

This reverts commit 59a376d.

Revert "Porting ALTER TABLE on parmap"

071de47

This reverts commit 6a5f2ae.

gatorsmile reviewed Aug 28, 2018

View reviewed changes

asfgit closed this in aff8f15 Aug 28, 2018

kiszk mentioned this pull request Aug 31, 2018

[SPARK-25286][CORE] Removing the dangerous parmap #22292

Closed

MaxGekk deleted the fix-recover-partitions branch August 17, 2019 13:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-25240][SQL] Fix for a deadlock in RECOVER PARTITIONS #22233

[SPARK-25240][SQL] Fix for a deadlock in RECOVER PARTITIONS #22233

MaxGekk commented Aug 25, 2018

HyukjinKwon Aug 25, 2018

MaxGekk Aug 25, 2018 •

edited

SparkQA commented Aug 25, 2018

gatorsmile Aug 26, 2018

MaxGekk Aug 26, 2018

kiszk Aug 26, 2018

MaxGekk Aug 26, 2018

kiszk Aug 27, 2018

kiszk Aug 27, 2018

MaxGekk Aug 28, 2018

kiszk Aug 28, 2018

MaxGekk Aug 28, 2018

kiszk Aug 28, 2018

SparkQA commented Aug 28, 2018

gatorsmile Aug 28, 2018

MaxGekk Aug 28, 2018

gatorsmile Aug 28, 2018

zsxwing Aug 30, 2018

gatorsmile commented Aug 28, 2018

	val result = if (partitionNames.length > 1 &&
	statuses.length > threshold \|\| partitionNames.length > 2) {

[SPARK-25240][SQL] Fix for a deadlock in RECOVER PARTITIONS #22233

[SPARK-25240][SQL] Fix for a deadlock in RECOVER PARTITIONS #22233

Conversation

MaxGekk commented Aug 25, 2018

What changes were proposed in this pull request?

How was this patch tested?

Choose a reason for hiding this comment

MaxGekk Aug 25, 2018 • edited

Choose a reason for hiding this comment

SparkQA commented Aug 25, 2018

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

SparkQA commented Aug 28, 2018

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

gatorsmile commented Aug 28, 2018

MaxGekk Aug 25, 2018 •

edited