Add IndexedSet to help manage workers/jobs. #463

stephenh · 2013-02-11T16:38:23Z

This is admittedly somewhat cute, but I like the idea of more strictly/DRYly applying the "remove the job from all xxxToJob indexes" logic, instead of having to manually remember to do the "xxxToJob -= job" calls.

mateiz · 2013-02-14T02:41:56Z

This is definitely interesting. Let me think about it a bit more (haven't had a lot of time in the past few days), but it might be worth going for if we can use this throughout.

This also fixes a bug where a StatusUpdate message after an executor had already been removed would result in a NoSuchElementException when updating freeCores.

Conflicts: core/src/main/scala/spark/scheduler/cluster/StandaloneSchedulerBackend.scala

stephenh · 2013-02-18T22:38:58Z

Yesterday I had a job run into a race condition in StandaloneSchedulerBackend where freeCores was being decremented in StatusUpdate, but the executor had already been removed, so it failed with NoSuchElementException:

13/02/17 07:03:23 ERROR cluster.StandaloneSchedulerBackend$DriverActor: key not found: 13
java.util.NoSuchElementException: key not found: 13
  at scala.collection.MapLike$class.default(MapLike.scala:225)
  at scala.collection.mutable.HashMap.default(HashMap.scala:45)
  at scala.collection.MapLike$class.apply(MapLike.scala:135)
  at scala.collection.mutable.HashMap.apply(HashMap.scala:45)
  at spark.scheduler.cluster.StandaloneSchedulerBackend$DriverActor$$anonfun$receive$1.apply(StandaloneSchedulerBackend.scala:60)

I thought this would be a good excuse for more IndexedSet cuteness, so fixed the bug by having just one "executors" map and ensuring the executor still exists before updating it.

Also merged in master and ran the tests.

Conflicts: core/src/main/scala/spark/deploy/master/Master.scala

stephenh · 2013-02-19T16:59:50Z

Again remerged in master with your job->app/split->partition changes (nice!).

stephenh · 2013-02-19T21:01:02Z

core/src/main/scala/spark/scheduler/cluster/StandaloneSchedulerBackend.scala

          totalCoreCount.addAndGet(cores)
-          makeOffers()
+          makeOffers(e)


I believe this change is right--we only have 1 new executor, so I believe calling makeOffers(e) is fine, vs. the previous behavior which would re-makeOffers() for the new + all existing executors.

Conflicts: core/src/main/scala/spark/deploy/master/Master.scala

AmplabJenkins · 2013-04-04T21:13:11Z

Can one of the admins verify this patch?

AmplabJenkins · 2013-04-10T20:49:14Z

I'm the Jenkins test bot for the UC, Berkeley AMPLab. I've noticed your pull request and will test it once an admin authorizes me to. Thanks for your submission!

AmplabJenkins · 2013-04-18T22:05:44Z

I'm the Jenkins test bot for the UC, Berkeley AMPLab. I've noticed your pull request and will test it once an admin authorizes me to. Thanks for your submission!

AmplabJenkins · 2013-08-05T21:33:57Z

Thank you for your pull request. An admin will review this request soon.

Add IndexedSet to help manage workers/jobs.

e599aad

Stephen Haberman added 2 commits February 18, 2013 16:26

Use IndexedSet in StandaloneSchedulerBackend.

93ee539

This also fixes a bug where a StatusUpdate message after an executor had already been removed would result in a NoSuchElementException when updating freeCores.

Merge branch 'master' into indexedset

dd8df32

Conflicts: core/src/main/scala/spark/scheduler/cluster/StandaloneSchedulerBackend.scala

Merge branch 'master' into indexedset

4260b26

Conflicts: core/src/main/scala/spark/deploy/master/Master.scala

stephenh reviewed Feb 19, 2013
View reviewed changes

Merge branch 'master' into indexedset

ed59039

Conflicts: core/src/main/scala/spark/deploy/master/Master.scala

stephenh closed this Oct 8, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add IndexedSet to help manage workers/jobs. #463

Add IndexedSet to help manage workers/jobs. #463

stephenh commented Feb 11, 2013

mateiz commented Feb 14, 2013

stephenh commented Feb 18, 2013

stephenh commented Feb 19, 2013

stephenh Feb 19, 2013

AmplabJenkins commented Apr 4, 2013

AmplabJenkins commented Apr 10, 2013

AmplabJenkins commented Apr 18, 2013

AmplabJenkins commented Aug 5, 2013

Add IndexedSet to help manage workers/jobs. #463

Add IndexedSet to help manage workers/jobs. #463

Conversation

stephenh commented Feb 11, 2013

mateiz commented Feb 14, 2013

stephenh commented Feb 18, 2013

stephenh commented Feb 19, 2013

stephenh Feb 19, 2013

Choose a reason for hiding this comment

AmplabJenkins commented Apr 4, 2013

AmplabJenkins commented Apr 10, 2013

AmplabJenkins commented Apr 18, 2013

AmplabJenkins commented Aug 5, 2013