[SPARK-28699][Core] Cache an indeterminate RDD could lead to incorrect result while stage rerun #25420

xuanyuanking · 2019-08-12T14:12:48Z

What changes were proposed in this pull request?

It's another case for the indeterminate stage/RDD rerun while stage rerun happened. In the CachedRDDBuilder, we miss tracking the isOrderSensitive characteristic to the newly created MapPartitionsRDD.
This patch just a safeguard, if we need the support for stage rerunning, it should be done after #24892.

How was this patch tested?

Integrated test with an exception, instead of the wrong answer.

import scala.sys.process._
import org.apache.spark.TaskContext

val res = spark.range(0, 10000 * 10000, 1).map{ x => (x % 1000, x)}
// kill an executor in the stage that performs repartition(239)
val df = res.repartition(113).cache.repartition(239).map { x =>
  if (TaskContext.get.attemptNumber == 0 && TaskContext.get.partitionId < 1 && TaskContext.get.stageAttemptNumber == 0) {
    throw new Exception("pkill -f -n java".!!)
  }
  x
}
val r2 = df.distinct.count()

(cherry picked from commit b543fab) Signed-off-by: Yuanjian Li <xyliyuanjian@gmail.com>

xuanyuanking · 2019-08-12T14:16:50Z

core/src/test/scala/org/apache/spark/scheduler/DAGSchedulerSuite.scala

@@ -2710,7 +2710,7 @@ class DAGSchedulerSuite extends SparkFunSuite with LocalSparkContext with TimeLi
    assert(countSubmittedMapStageAttempts() === 2)
  }

-  test("SPARK-23207: retry all the succeeding stages when the map stage is indeterminate") {
+  ignore("SPARK-23207: retry all the succeeding stages when the map stage is indeterminate") {


Ignore this for the behavior change, as the approach now, we need to abort the stage of the current mapStage.
As we will finally support stage rerun, I suggest to skip this behavior, we can directly support the cache scenario after SPARK-25341 merged.

cloud-fan · 2019-08-12T15:26:26Z

sql/core/src/main/scala/org/apache/spark/sql/execution/columnar/InMemoryRelation.scala

@@ -131,7 +131,7 @@ case class CachedRDDBuilder(

        def hasNext: Boolean = rowIterator.hasNext
      }
-    }.persist(storageLevel)
+    }, isOrderSensitive = true).persist(storageLevel)


I don't quite get it. The isOrderSensitive flag is used to describe the map function. Why the map function is order sensitive?

SparkQA · 2019-08-12T16:01:18Z

Test build #108978 has finished for PR 25420 at commit b3ea90f.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

viirya · 2019-08-12T18:33:47Z

sql/core/src/main/scala/org/apache/spark/sql/execution/columnar/InMemoryRelation.scala

@@ -86,7 +86,7 @@ case class CachedRDDBuilder(

  private def buildBuffers(): RDD[CachedBatch] = {
    val output = cachedPlan.output
-    val cached = cachedPlan.execute().mapPartitionsInternal { rowIterator =>
+    val cached = cachedPlan.execute().mapPartitionsWithIndexInternal({ (_, rowIterator) =>


I think we only run cached plan once. I think it should be determinate? If so, this map should not be order sensitive effectively?

dongjoon-hyun · 2019-08-12T19:40:11Z

Retest this please.

dongjoon-hyun · 2019-08-12T21:48:27Z

@xuanyuanking Can we have a test case to prevent a future regression on this?

SparkQA · 2019-08-12T22:17:34Z

Test build #108993 has finished for PR 25420 at commit b3ea90f.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

xuanyuanking · 2019-08-19T09:05:34Z

Sorry for the late. Thanks for the comment and review! After taking a further investigation, I found the root cause for this issue is about radix sort in ShuffleExchangeExec, please review it in #25491.
The bug fix of DAGScheduler is still needed, I'll give another follow-up PR and add UT for it. Done in #25498.
After all the fix has done, I'll give detailed explanation in the JIRA description.

xuanyuanking added 2 commits August 12, 2019 21:40

Rerun the indeterminate stage for cache operation

99457d8

(cherry picked from commit b543fab) Signed-off-by: Yuanjian Li <xyliyuanjian@gmail.com>

Abort indeterminate stage

b3ea90f

xuanyuanking commented Aug 12, 2019

View reviewed changes

cloud-fan reviewed Aug 12, 2019

View reviewed changes

viirya reviewed Aug 12, 2019

View reviewed changes

dongjoon-hyun added the SPARK CORE label Aug 12, 2019

xuanyuanking closed this Aug 19, 2019

xuanyuanking deleted the SPARK-28699 branch August 19, 2019 09:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-28699][Core] Cache an indeterminate RDD could lead to incorrect result while stage rerun #25420

[SPARK-28699][Core] Cache an indeterminate RDD could lead to incorrect result while stage rerun #25420

xuanyuanking commented Aug 12, 2019 •

edited

xuanyuanking Aug 12, 2019

cloud-fan Aug 12, 2019

SparkQA commented Aug 12, 2019

viirya Aug 12, 2019

dongjoon-hyun commented Aug 12, 2019

dongjoon-hyun commented Aug 12, 2019

SparkQA commented Aug 12, 2019

xuanyuanking commented Aug 19, 2019 •

edited

[SPARK-28699][Core] Cache an indeterminate RDD could lead to incorrect result while stage rerun #25420

[SPARK-28699][Core] Cache an indeterminate RDD could lead to incorrect result while stage rerun #25420

Conversation

xuanyuanking commented Aug 12, 2019 • edited

What changes were proposed in this pull request?

How was this patch tested?

xuanyuanking Aug 12, 2019

Choose a reason for hiding this comment

cloud-fan Aug 12, 2019

Choose a reason for hiding this comment

SparkQA commented Aug 12, 2019

viirya Aug 12, 2019

Choose a reason for hiding this comment

dongjoon-hyun commented Aug 12, 2019

dongjoon-hyun commented Aug 12, 2019

SparkQA commented Aug 12, 2019

xuanyuanking commented Aug 19, 2019 • edited

xuanyuanking commented Aug 12, 2019 •

edited

xuanyuanking commented Aug 19, 2019 •

edited