[SPARK-28699][CORE][2.3] Fix a corner case for aborting indeterminate stage #25508

xuanyuanking · 2019-08-20T07:32:52Z

What changes were proposed in this pull request?

Change the logic of collecting the indeterminate stage, we should look at stages from mapStage, not failedStage during handle FetchFailed.

Why are the changes needed?

In the fetch failed error handle logic, the original logic of collecting indeterminate stage from the fetch failed stage. And in the scenario of the fetch failed happened in the first task of this stage, this logic will cause the indeterminate stage to resubmit partially. Eventually, we are capable of getting correctness bug.

Does this PR introduce any user-facing change?

It makes the corner case of indeterminate stage abort as expected.

How was this patch tested?

New UT in DAGSchedulerSuite.
Run below integrated test with local-cluster[5, 2, 5120], and set spark.sql.execution.sortBeforeRepartition=false, it will abort the indeterminate stage as expected:

import scala.sys.process._
import org.apache.spark.TaskContext

val res = spark.range(0, 10000 * 10000, 1).map{ x => (x % 1000, x)}
// kill an executor in the stage that performs repartition(239)
val df = res.repartition(113).map{ x => (x._1 + 1, x._2)}.repartition(239).map { x =>
  if (TaskContext.get.attemptNumber == 0 && TaskContext.get.partitionId < 1 && TaskContext.get.stageAttemptNumber == 0) {
    throw new Exception("pkill -f -n java".!!)
  }
  x
}
val r2 = df.distinct.count()

Change the logic of collecting the indeterminate stage, we should look at stages from mapStage, not failedStage during handle FetchFailed. In the fetch failed error handle logic, the original logic of collecting indeterminate stage from the fetch failed stage. And in the scenario of the fetch failed happened in the first task of this stage, this logic will cause the indeterminate stage to resubmit partially. Eventually, we are capable of getting correctness bug. It makes the corner case of indeterminate stage abort as expected. New UT in DAGSchedulerSuite. Run below integrated test with `local-cluster[5, 2, 5120]`, and set `spark.sql.execution.sortBeforeRepartition`=false, it will abort the indeterminate stage as expected: ``` import scala.sys.process._ import org.apache.spark.TaskContext val res = spark.range(0, 10000 * 10000, 1).map{ x => (x % 1000, x)} // kill an executor in the stage that performs repartition(239) val df = res.repartition(113).map{ x => (x._1 + 1, x._2)}.repartition(239).map { x => if (TaskContext.get.attemptNumber == 0 && TaskContext.get.partitionId < 1 && TaskContext.get.stageAttemptNumber == 0) { throw new Exception("pkill -f -n java".!!) } x } val r2 = df.distinct.count() ``` Closes apache#25498 from xuanyuanking/SPARK-28699-followup. Authored-by: Yuanjian Li <xyliyuanjian@gmail.com> Signed-off-by: Wenchen Fan <wenchen@databricks.com> (cherry picked from commit 0d3a783) Signed-off-by: Yuanjian Li <xyliyuanjian@gmail.com>

SparkQA · 2019-08-20T10:48:14Z

Test build #109394 has finished for PR 25508 at commit a4d7360.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

dongjoon-hyun · 2019-08-20T16:15:26Z

Retest this please.

SparkQA · 2019-08-20T19:33:11Z

Test build #109422 has finished for PR 25508 at commit a4d7360.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

dongjoon-hyun · 2019-08-20T20:33:27Z

Could you fix the UT failure?

org.apache.spark.scheduler.DAGSchedulerSuite.SPARK-23207: retry all the succeeding stages when the map stage is indeterminate

xuanyuanking · 2019-08-21T03:09:34Z

Yeah, I'm looking into this, seems the behavior is not the same between 2.3 and 2.4.

xuanyuanking · 2019-08-21T07:25:55Z

core/src/test/scala/org/apache/spark/scheduler/DAGSchedulerSuite.scala

@@ -2521,33 +2521,19 @@ class DAGSchedulerSuite extends SparkFunSuite with LocalSparkContext with TimeLi
      (Success, makeMapStatus("hostD", 2))))
    assert(mapOutputTracker.findMissingPartitions(shuffleId2) === Some(Seq.empty))

+    // Simulate the scenario of executor lost
+    runEvent(ExecutorLost("exec-hostC", ExecutorKilled))


The behavior difference between 2.3 and 2.4 is related to #21758, which move the output status clean work forward: https://github.com/apache/spark/pull/21758/files#diff-6a9ff7fb74fd490a50462d45db2d5e11L1390.
So I fix the behavior by simulating the executor lost because here we want a scenario of missing some partitions while rerunning the shuffle map stage.

SparkQA · 2019-08-21T11:00:47Z

Test build #109480 has finished for PR 25508 at commit b5413e7.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

dongjoon-hyun · 2019-08-21T15:24:12Z

cc @cloud-fan

xuanyuanking · 2019-08-22T02:35:35Z

Supply UT for this cherry-pick in the last commit.

dongjoon-hyun · 2019-08-22T03:31:02Z

Thank you for update this, too. @xuanyuanking .
cc @kiszk

SparkQA · 2019-08-22T06:25:56Z

Test build #109542 has finished for PR 25508 at commit b7b7150.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

cloud-fan · 2019-08-22T06:35:43Z

thanks, merging to 2.3!

… stage ### What changes were proposed in this pull request? Change the logic of collecting the indeterminate stage, we should look at stages from mapStage, not failedStage during handle FetchFailed. ### Why are the changes needed? In the fetch failed error handle logic, the original logic of collecting indeterminate stage from the fetch failed stage. And in the scenario of the fetch failed happened in the first task of this stage, this logic will cause the indeterminate stage to resubmit partially. Eventually, we are capable of getting correctness bug. ### Does this PR introduce any user-facing change? It makes the corner case of indeterminate stage abort as expected. ### How was this patch tested? New UT in DAGSchedulerSuite. Run below integrated test with `local-cluster[5, 2, 5120]`, and set `spark.sql.execution.sortBeforeRepartition`=false, it will abort the indeterminate stage as expected: ``` import scala.sys.process._ import org.apache.spark.TaskContext val res = spark.range(0, 10000 * 10000, 1).map{ x => (x % 1000, x)} // kill an executor in the stage that performs repartition(239) val df = res.repartition(113).map{ x => (x._1 + 1, x._2)}.repartition(239).map { x => if (TaskContext.get.attemptNumber == 0 && TaskContext.get.partitionId < 1 && TaskContext.get.stageAttemptNumber == 0) { throw new Exception("pkill -f -n java".!!) } x } val r2 = df.distinct.count() ``` Closes #25508 from xuanyuanking/spark-28699-backport-2.3. Authored-by: Yuanjian Li <xyliyuanjian@gmail.com> Signed-off-by: Wenchen Fan <wenchen@databricks.com>

dongjoon-hyun added the SPARK CORE label Aug 20, 2019

dongjoon-hyun changed the title ~~[SPARK-28699][CORE][BACKPORT-2.3] Fix a corner case for aborting indeterminate stage~~ [SPARK-28699][CORE][2.3] Fix a corner case for aborting indeterminate stage Aug 20, 2019

Fix the behavior difference between branch-2.3 and branch-2.4

b5413e7

xuanyuanking commented Aug 21, 2019

View reviewed changes

Add test

b7b7150

cloud-fan approved these changes Aug 22, 2019

View reviewed changes

cloud-fan closed this Aug 22, 2019

xuanyuanking deleted the spark-28699-backport-2.3 branch August 22, 2019 06:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-28699][CORE][2.3] Fix a corner case for aborting indeterminate stage #25508

[SPARK-28699][CORE][2.3] Fix a corner case for aborting indeterminate stage #25508

xuanyuanking commented Aug 20, 2019 •

edited by dongjoon-hyun

Loading

SparkQA commented Aug 20, 2019

dongjoon-hyun commented Aug 20, 2019

SparkQA commented Aug 20, 2019

dongjoon-hyun commented Aug 20, 2019

xuanyuanking commented Aug 21, 2019

xuanyuanking Aug 21, 2019

SparkQA commented Aug 21, 2019

dongjoon-hyun commented Aug 21, 2019

xuanyuanking commented Aug 22, 2019

dongjoon-hyun commented Aug 22, 2019

SparkQA commented Aug 22, 2019

cloud-fan commented Aug 22, 2019

[SPARK-28699][CORE][2.3] Fix a corner case for aborting indeterminate stage #25508

[SPARK-28699][CORE][2.3] Fix a corner case for aborting indeterminate stage #25508

Conversation

xuanyuanking commented Aug 20, 2019 • edited by dongjoon-hyun Loading

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

SparkQA commented Aug 20, 2019

dongjoon-hyun commented Aug 20, 2019

SparkQA commented Aug 20, 2019

dongjoon-hyun commented Aug 20, 2019

xuanyuanking commented Aug 21, 2019

xuanyuanking Aug 21, 2019

Choose a reason for hiding this comment

SparkQA commented Aug 21, 2019

dongjoon-hyun commented Aug 21, 2019

xuanyuanking commented Aug 22, 2019

dongjoon-hyun commented Aug 22, 2019

SparkQA commented Aug 22, 2019

cloud-fan commented Aug 22, 2019

xuanyuanking commented Aug 20, 2019 •

edited by dongjoon-hyun

Loading