[SPARK-18928][branch-2.0]Check TaskContext.isInterrupted() in FileScanRDD, JDBCRDD & UnsafeSorter by JoshRosen · Pull Request #16357 · apache/spark

JoshRosen · 2016-12-20T19:50:12Z

This is a branch-2.0 backport of #16340; the original description follows:

What changes were proposed in this pull request?

In order to respond to task cancellation, Spark tasks must periodically check TaskContext.isInterrupted(), but this check is missing on a few critical read paths used in Spark SQL, including FileScanRDD, JDBCRDD, and UnsafeSorter-based sorts. This can cause interrupted / cancelled tasks to continue running and become zombies (as also described in #16189).

This patch aims to fix this problem by adding TaskContext.isInterrupted() checks to these paths. Note that I could have used InterruptibleIterator to simply wrap a bunch of iterators but in some cases this would have an adverse performance penalty or might not be effective due to certain special uses of Iterators in Spark SQL. Instead, I inlined InterruptibleIterator-style logic into existing iterator subclasses.

How was this patch tested?

Tested manually in spark-shell with two different reproductions of non-cancellable tasks, one involving scans of huge files and another involving sort-merge joins that spill to disk. Both causes of zombie tasks are fixed by the changes added here.

…DD & UnsafeSorter In order to respond to task cancellation, Spark tasks must periodically check `TaskContext.isInterrupted()`, but this check is missing on a few critical read paths used in Spark SQL, including `FileScanRDD`, `JDBCRDD`, and UnsafeSorter-based sorts. This can cause interrupted / cancelled tasks to continue running and become zombies (as also described in apache#16189). This patch aims to fix this problem by adding `TaskContext.isInterrupted()` checks to these paths. Note that I could have used `InterruptibleIterator` to simply wrap a bunch of iterators but in some cases this would have an adverse performance penalty or might not be effective due to certain special uses of Iterators in Spark SQL. Instead, I inlined `InterruptibleIterator`-style logic into existing iterator subclasses. Tested manually in `spark-shell` with two different reproductions of non-cancellable tasks, one involving scans of huge files and another involving sort-merge joins that spill to disk. Both causes of zombie tasks are fixed by the changes added here. Author: Josh Rosen <joshrosen@databricks.com> Closes apache#16340 from JoshRosen/sql-task-interruption.

yhuai · 2016-12-20T19:55:22Z

LGTM

mridulm · 2016-12-20T20:44:18Z

core/src/main/java/org/apache/spark/util/collection/unsafe/sort/UnsafeInMemorySorter.java

+      // `getNumRecords()` instead of `hasNext()` to know when to stop.
+      if (taskContext != null && taskContext.isInterrupted()) {
+        throw new TaskKilledException();
+      }


Wont this not be in the internal tight loop for reading data ?
If yes, dereferencing a volatile for each tuple processed is worrying.

We already have this in tight loops in the form of InterruptibleIterator wrapping all over the place.

In an admittedly non-scientific benchmark, I tried running

import org.apache.spark._ sc.parallelize(1 to (1000 * 1000 * 1000), 1).mapPartitions { iter => val tc = TaskContext.get() iter.map { x => tc.isInterrupted() x + 1 } }.count()

a few times with and without the tc.isInterrupted() check and there wasn't a measurable time difference in my environment. While I imagine that the volatile read could incur some higher costs in certain circumstances I think that the overhead of all of the virtual function calls and iterator interfaces, etc. will mask any gains by optimizing this read.

As you mentioned, this is unscientific microbenchmark :-)
Also, the isInterrupted was probably optimized away anyway ?

Since we dont need gaurantees on how soon interruption is to be honoured, batching its application (if possible) would be better in sorting ?

Yes, we do have InterruptibleIterator - unfortunately, we dont have a way to optimize that (afaik).

mridulm · 2016-12-20T20:55:48Z

sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/FileScanRDD.scala

+        // to avoid performance overhead.
+        if (context.isInterrupted()) {
+          throw new TaskKilledException
+        }


Why not move this to next() instead of hasNext (latter is not mandatory to be called - as seen here : #16252)

This particular iterator already won't work unless hasNext() is called since in that case nobody will call nextIterator().

Yes, looks like this iterator is already broken; and we are adding to that now.

SparkQA · 2016-12-20T22:23:25Z

Test build #70417 has finished for PR 16357 at commit 66a8370.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

yhuai · 2016-12-21T00:03:52Z

Merging to branch 2.0.

…anRDD, JDBCRDD & UnsafeSorter This is a branch-2.0 backport of #16340; the original description follows: ## What changes were proposed in this pull request? In order to respond to task cancellation, Spark tasks must periodically check `TaskContext.isInterrupted()`, but this check is missing on a few critical read paths used in Spark SQL, including `FileScanRDD`, `JDBCRDD`, and UnsafeSorter-based sorts. This can cause interrupted / cancelled tasks to continue running and become zombies (as also described in #16189). This patch aims to fix this problem by adding `TaskContext.isInterrupted()` checks to these paths. Note that I could have used `InterruptibleIterator` to simply wrap a bunch of iterators but in some cases this would have an adverse performance penalty or might not be effective due to certain special uses of Iterators in Spark SQL. Instead, I inlined `InterruptibleIterator`-style logic into existing iterator subclasses. ## How was this patch tested? Tested manually in `spark-shell` with two different reproductions of non-cancellable tasks, one involving scans of huge files and another involving sort-merge joins that spill to disk. Both causes of zombie tasks are fixed by the changes added here. Author: Josh Rosen <joshrosen@databricks.com> Closes #16357 from JoshRosen/sql-task-interruption-branch-2.0.

mridulm · 2016-12-21T22:59:09Z

@yhuai as I already mentioned a couple of days back, please allow more time for review - not everyone is in the same time zone or have the same working hours. Thanks.

yhuai · 2016-12-21T23:17:44Z

@mridulm ok. I merged this because it is a backport (the original patch has already been merged to 2.1 and master) and I believe Josh has already addressed your concerns. If you want us hold the merge, it will be good to explicitly mention it next time. Thanks!

mridulm · 2016-12-21T23:36:31Z

@yhuai I had not seen the original PR - else would have commented there.
By default, if there is an active review in progress (and is not stale), please give more time : for both the submitter to respond (so that we dont aggresively close pr's as stale) and for reviewers to get to the response.

mridulm reviewed Dec 20, 2016

View reviewed changes

JoshRosen closed this Dec 21, 2016

JoshRosen deleted the sql-task-interruption-branch-2.0 branch December 21, 2016 00:05

Conversation

JoshRosen commented Dec 20, 2016

What changes were proposed in this pull request?

How was this patch tested?

Uh oh!

yhuai commented Dec 20, 2016

Uh oh!

mridulm Dec 20, 2016

Choose a reason for hiding this comment

Uh oh!

JoshRosen Dec 20, 2016

Choose a reason for hiding this comment

Uh oh!

JoshRosen Dec 20, 2016

Choose a reason for hiding this comment

Uh oh!

mridulm Dec 21, 2016

Choose a reason for hiding this comment

Uh oh!

mridulm Dec 20, 2016

Choose a reason for hiding this comment

Uh oh!

JoshRosen Dec 20, 2016

Choose a reason for hiding this comment

Uh oh!

mridulm Dec 21, 2016

Choose a reason for hiding this comment

Uh oh!

SparkQA commented Dec 20, 2016

Uh oh!

yhuai commented Dec 21, 2016

Uh oh!

mridulm commented Dec 21, 2016

Uh oh!

yhuai commented Dec 21, 2016

Uh oh!

mridulm commented Dec 21, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants