[SPARK-7988][STREAMING] Round-robin scheduling of receivers by default #6607

nishkamravi2 · 2015-06-03T07:45:23Z

Minimal PR for round-robin scheduling of receivers. Dense scheduling can be enabled by setting preferredLocation, so a new config parameter isn't really needed. Tested this on a cluster of 6 nodes and noticed 20-25% gain in throughput compared to random scheduling.

@tdas @pwendell

tdas · 2015-06-04T09:25:16Z

streaming/src/main/scala/org/apache/spark/streaming/receiver/Receiver.scala

@@ -107,8 +107,8 @@ abstract class Receiver[T](val storageLevel: StorageLevel) extends Serializable
   */
  def onStop()

-  /** Override this to specify a preferred location (hostname). */
-  def preferredLocation : Option[String] = None


I think this break binary compatibility. This is not feasible.

Adding a new private var in receiver feels like the simplest workaround for the compatibility issue

tdas · 2015-06-04T09:25:31Z

ok to test

SparkQA · 2015-06-04T09:29:45Z

Test build #34180 has finished for PR 6607 at commit 41705de.

This patch fails Scala style tests.
This patch merges cleanly.
This patch adds no public classes.

…in scheduling

SparkQA · 2015-06-05T02:26:54Z

Test build #34236 has finished for PR 6607 at commit b05ee2f.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

tdas · 2015-06-05T03:13:00Z

streaming/src/main/scala/org/apache/spark/streaming/scheduler/ReceiverTracker.scala

      // Run the dummy Spark job to ensure that all slaves have registered.
      // This avoids all the receivers to be scheduled on the same node.
      if (!ssc.sparkContext.isLocal) {
        ssc.sparkContext.makeRDD(1 to 50, 50).map(x => (x, 1)).reduceByKey(_ + _, 20).collect()
      }

+      // Right now, we only honor preferences if all receivers have them
+      val hasLocationPreferences = receivers.map(_.preferredLocation.isDefined).reduce(_ && _)


Would be nice if this new functionality is put into a function by itself. So that we can unit test it individually to test against different combinations of executors and all, may be different policies in the future.

tdas · 2015-06-05T03:45:25Z

I added some comments. At a highlevel 2 things are needed

better allocation algorithm,
tests.

…into master_nravi Conflicts: streaming/src/main/scala/org/apache/spark/streaming/scheduler/ReceiverTracker.scala

SparkQA · 2015-06-05T10:06:30Z

Test build #34270 has finished for PR 6607 at commit 975b8d8.

This patch fails Scala style tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2015-06-05T10:26:11Z

Test build #34271 has finished for PR 6607 at commit 6e3515c.

This patch fails Scala style tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2015-06-05T10:39:24Z

Test build #34273 has finished for PR 6607 at commit 7888257.

This patch fails Scala style tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2015-06-05T11:53:46Z

Test build #34274 has finished for PR 6607 at commit 6caeefe.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

harishreedharan · 2015-06-05T18:09:03Z

streaming/src/main/scala/org/apache/spark/streaming/scheduler/ReceiverTracker.scala

@@ -17,8 +17,9 @@

 package org.apache.spark.streaming.scheduler

-import scala.collection.mutable.{HashMap, SynchronizedMap}
+import scala.collection.mutable.{HashMap, SynchronizedMap, ArrayBuffer}


nit: ordering

harishreedharan · 2015-06-05T18:26:50Z

This looks pretty good. I left one comment that I felt can improve code readability and makes the logic easier to understand.

nishkamravi2 · 2015-06-05T19:05:33Z

Thanks for the review. Will look into the failing test cases soon.

SparkQA · 2015-06-05T20:16:59Z

Test build #34300 has finished for PR 6607 at commit 07b9dfa.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

tdas · 2015-06-12T22:42:13Z

streaming/src/main/scala/org/apache/spark/streaming/scheduler/ReceiverTracker.scala

+      val locations = new Array[ArrayBuffer[String]](receivers.length)
+      if (!executors.isEmpty) {
+        var i = 0
+        for (i <- 0 to (receivers.length - 1)) {


nit: cleaner to use 0 until X rather than 0 to X-1

tdas · 2015-06-13T02:07:06Z

streaming/src/main/scala/org/apache/spark/streaming/scheduler/ReceiverTracker.scala

+      val executors = getExecutors(ssc)
+      val locations = scheduleReceivers(receivers, executors)
+      val tempRDD =
+        if (locations(0) != null) {


Under what condition will location(0) be null?

sparkContext.isLocal == true

Then its more intuitive to check that directly. If !local, then schedule and makeRDD, otherwise, makeRDD

location(0) check is all-encompassing (no assumptions made about when it may be true). We can add a comment next to it to clarify that it can be null for local.

It may be so, but its hard to read and understand the condition (which is why I asked). Also checking for null at location(0) to detect whether an ArrayBuffer was assigned to the position to detect whether very very brittle check and ties the logic deep with the implementation of the scheduleReceiver. If someone changes the implementation of the function to apply a different way to allocate receivers (say, always assign a ArrayBuffer even if it is empty), this may totally break. So this condition makes non-intuitive assumptions about the implementation logic of the scheduleReceiver. This is BAD code design.

We try to design the code as intuitive and modular as possible, so that others can easily contribute. That's the only way to manage a large open source project with so many contributors.

Generally speaking, what if there were numerous conditions for which locations(0) could be null, would you enlist them all? It's common practice to do: Obj x = f(); if (x) {do blah}. If we don't want a check on locations(0), the right way would be to return null (or some such) from scheduleReceiver when locations(0) is null. So we can check if(locations) instead of if(locations(0)). Better still, we can check for if(!executors.isEmpty) before invoking scheduleReceiver, so no further check is needed.

First of all, in Scala, we try to not rely on null rather use Option and None. Here a suggestion, which I think is a cleaner design with clean semantics. The makeRDD is designed to take a sequence of (item, locations). If for a item, the location is a empty (not null, just empty seq), then that automatically means there is no preferred location. That's intuitive.

So the scheduleReceiver can designed as follows.

scheduleReceiver always returns a Array[ArrayBuffer[String]] where any of the buffers can be empty, but there are no nulls.

the logic in this location becomes

if (sparkContext is local) { // make RDD } else { // schedule receivers // make RDD with returned result }

if there were no executors at that point of time, all the buffers will be empty,
which is perfectly okay to pass on to makeRDD. The code stays simple with only condition, and no matter what the executors are empty or not, it just works.

If you want to be extra careful, you can simply add a check that none of the returned locations are null. That still is just one line and easy to understand code rather than introducing another level of conditions.

How does this sound?

That's fine. Following, I think, is slightly cleaner:

if(!executors.isEmpty){
scheduleReceivers
make RDD with returned result
}else{
make RDD
}
Avoids the redundant invocation to scheduleReceivers and subsequent memory allocations. Optimizes away the extra logic (check on local) and assumptions about how locations is formatted. If this sounds good, I think we can get the final iteration of this PR going.

SparkQA · 2015-06-17T09:45:19Z

Test build #35036 has finished for PR 6607 at commit ae29152.

This patch fails Scala style tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2015-06-17T11:47:31Z

Test build #35037 has finished for PR 6607 at commit 9f1abc2.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

nishkamravi2 · 2015-06-23T05:07:45Z

This should contain all the changes we have discussed so far. ReceiverTrackerSuite now contains two instead of four tests (intentionally keeping "some preferred location" test separate)

SparkQA · 2015-06-23T05:10:15Z

Test build #35516 has finished for PR 6607 at commit 6127e58.

This patch fails Scala style tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2015-06-23T06:39:42Z

Test build #35518 has finished for PR 6607 at commit f747739.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

tdas · 2015-06-23T06:48:25Z

streaming/src/test/scala/org/apache/spark/streaming/scheduler/ReceiverTrackerSuite.scala

+      }
+      loc
+    }
+    def testScheduler(numReceivers: Int, preferredLocation: Boolean, allocation: String) {


Add empty line.

tdas · 2015-06-23T06:54:22Z

Few nits, otherwise good to go.

SparkQA · 2015-06-23T09:39:54Z

Test build #35537 has finished for PR 6607 at commit 1918819.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

tdas · 2015-06-30T18:04:46Z

LGTM! Merging this. Thanks!

tedyu · 2015-06-30T22:39:10Z

streaming/src/main/scala/org/apache/spark/streaming/scheduler/ReceiverTracker.scala

+        }
+      }
+      var count = 0
+      for (i <- 0 until max(receivers.length, executors.length)) {


Why is max used here ?
receivers.length is not enough ?

Because we want to allocate more executors per receiver so that receiver tasks can failover to other executors, but do not conflict with other running receivers.

nishkamravi2 · 2015-07-01T08:15:11Z

Thanks!

nishkamravi2 and others added 2 commits June 3, 2015 00:36

Round-robin scheduling of streaming receivers

fff1b2e

Update ReceiverTracker.scala

41705de

tdas reviewed Jun 4, 2015
View reviewed changes

nishkamravi2 and others added 2 commits June 4, 2015 18:10

Add a new var in receiver to store location information for round-rob…

bb5e09b

…in scheduling

Update ReceiverTracker.scala

b05ee2f

tdas reviewed Jun 5, 2015
View reviewed changes

nishkamravi2 added 2 commits June 5, 2015 02:54

Generalize the scheduling algorithm

3cac21b

Merge branch 'master_nravi' of https://github.com/nishkamravi2/spark …

975b8d8

…into master_nravi Conflicts: streaming/src/main/scala/org/apache/spark/streaming/scheduler/ReceiverTracker.scala

Minor changes

6e3515c

Update ReceiverTracker.scala

7888257

Update ReceiverTracker.scala

6caeefe

harishreedharan reviewed Jun 5, 2015
View reviewed changes

Update ReceiverTracker.scala

07b9dfa

Update ReceiverTracker.scala

02dbdb8

tdas reviewed Jun 12, 2015
View reviewed changes

nishkamravi2 added 2 commits June 12, 2015 16:29

Update ReceiverTracker.scala

bc23907

Update ReceiverTracker.scala

48a4a97

tdas reviewed Jun 13, 2015
View reviewed changes

Update test suite with TD's suggestions

ae29152

Update ReceiverTrackerSuite.scala

9f1abc2

Update ReceiverTracker and ReceiverTrackerSuite

6127e58

Update ReceiverTrackerSuite.scala

f747739

tdas reviewed Jun 23, 2015
View reviewed changes

Update ReceiverTrackerSuite.scala

1918819

asfgit closed this in ca7e460 Jun 30, 2015

tedyu reviewed Jun 30, 2015
View reviewed changes

[SPARK-7988][STREAMING] Round-robin scheduling of receivers by default #6607

[SPARK-7988][STREAMING] Round-robin scheduling of receivers by default #6607

Conversation

nishkamravi2 commented Jun 3, 2015

Choose a reason for hiding this comment

Choose a reason for hiding this comment

tdas commented Jun 4, 2015

SparkQA commented Jun 4, 2015

SparkQA commented Jun 5, 2015

Choose a reason for hiding this comment

tdas commented Jun 5, 2015

SparkQA commented Jun 5, 2015

SparkQA commented Jun 5, 2015

SparkQA commented Jun 5, 2015

SparkQA commented Jun 5, 2015

Choose a reason for hiding this comment

harishreedharan commented Jun 5, 2015

nishkamravi2 commented Jun 5, 2015

SparkQA commented Jun 5, 2015

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

SparkQA commented Jun 17, 2015

SparkQA commented Jun 17, 2015

nishkamravi2 commented Jun 23, 2015

SparkQA commented Jun 23, 2015

SparkQA commented Jun 23, 2015

Choose a reason for hiding this comment

tdas commented Jun 23, 2015

SparkQA commented Jun 23, 2015

tdas commented Jun 30, 2015

Choose a reason for hiding this comment

Choose a reason for hiding this comment

nishkamravi2 commented Jul 1, 2015