[SPARK-4030] Make destroy public for broadcast variables #2922

shivaram · 2014-10-24T06:42:16Z

This change makes the destroy function public for broadcast variables. Motivation for the change is described in https://issues.apache.org/jira/browse/SPARK-4030.
This patch also logs where destroy was called from if a broadcast variable is used after destruction.

Also log where destroy was called from if a broadcast variable is used after destruction.

shivaram · 2014-10-24T06:42:27Z

cc @pwendell @rxin for review

SparkQA · 2014-10-24T06:50:05Z

Test build #22125 has started for PR 2922 at commit e80c1ab.

This patch merges cleanly.

SparkQA · 2014-10-24T06:51:13Z

Test build #22125 has finished for PR 2922 at commit e80c1ab.

This patch fails Scala style tests.
This patch merges cleanly.
This patch adds the following public classes (experimental):
- abstract class Broadcast[T: ClassTag](val id: Long) extends Serializable with Logging

AmplabJenkins · 2014-10-24T06:51:14Z

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/22125/
Test FAILed.

pwendell · 2014-10-24T06:53:29Z

core/src/main/scala/org/apache/spark/broadcast/Broadcast.scala

   */
-  private[spark] def destroy(blocking: Boolean) {
+  def destroy(blocking: Boolean) {


should we only expose a version where blocking is set to true for users? It seems like asynchronous destroy is a bit more complex. @shivaram does your app need the async version?

No - I actually prefer the synchronous version in my applications. I will add another destroy() which is always synchronous and make it public.

Okay if that's the case I'd propose making destroy() public and keeping destroy(blocking: Boolean) as private. That way we minimize the surface area of public APIs.

pwendell · 2014-10-24T06:54:05Z

small question - this looks good overall

pwendell · 2014-10-24T06:54:39Z

oh one thing - can we add a java version of this? should be pretty simple, right?

srowen · 2014-10-24T14:22:10Z

core/src/main/scala/org/apache/spark/broadcast/Broadcast.scala

@@ -60,6 +62,8 @@ abstract class Broadcast[T: ClassTag](val id: Long) extends Serializable {
   */
  @volatile private var _isValid = true

+  private var _destroySite = ""


How useful is it to store this? it only helps in case of an invalid Broadcast instance. (PS you can use string interpolation instead of format in these changes if you care to)

@pwendell requested this in the JIRA -- The main reason is that we'd like to make it easy to debug if users call destroy by mistake.

Oh and I used .format to be consistent with the rest of the file. I'm not very sure what our policy is on this ?

@srowen yeah I asked for tracking the callsite because we now have this case where someone can try to use a destroyed broadcast (destroyed by e.g. another thread) and it will be very hard for users to debug this. Tracking the callsite has almost no overhead here and it seemed like it might be useful for debugging.

SparkQA · 2014-10-27T05:57:30Z

Test build #22276 has started for PR 2922 at commit bed9c9d.

This patch merges cleanly.

SparkQA · 2014-10-27T05:58:27Z

Test build #22276 has finished for PR 2922 at commit bed9c9d.

This patch fails Scala style tests.
This patch merges cleanly.
This patch adds no public classes.

AmplabJenkins · 2014-10-27T05:58:28Z

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/22276/
Test FAILed.

SparkQA · 2014-10-27T06:07:27Z

Test build #22277 has started for PR 2922 at commit a11abab.

This patch merges cleanly.

shivaram · 2014-10-27T06:09:56Z

@pwendell - I made destroy blocking by default and only made that version public (its not clear we need the non-blocking version to also be public -- we can add it later if required)

Also all the Broadcast stuff in the Java API seems to come directly from the java classes ? Let me know if I missed something.

pwendell · 2014-10-27T06:30:28Z

Oh right yeah. Great, LGTM.

SparkQA · 2014-10-27T07:17:48Z

Test build #22277 has finished for PR 2922 at commit a11abab.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

AmplabJenkins · 2014-10-27T07:17:51Z

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/22277/
Test PASSed.

shivaram · 2014-10-27T15:47:07Z

Thanks. Merging this

Make destroy public for broadcast variables

e80c1ab

Also log where destroy was called from if a broadcast variable is used after destruction.

pwendell reviewed Oct 24, 2014
View reviewed changes

srowen reviewed Oct 24, 2014
View reviewed changes

Make destroy blocking by default

bed9c9d

Fix scala style in Utils.scala

a11abab

asfgit closed this in 9aa340a Oct 27, 2014

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-4030] Make destroy public for broadcast variables #2922

[SPARK-4030] Make destroy public for broadcast variables #2922

shivaram commented Oct 24, 2014

shivaram commented Oct 24, 2014

SparkQA commented Oct 24, 2014

SparkQA commented Oct 24, 2014

AmplabJenkins commented Oct 24, 2014

pwendell Oct 24, 2014

shivaram Oct 25, 2014

pwendell Oct 26, 2014

pwendell commented Oct 24, 2014

pwendell commented Oct 24, 2014

srowen Oct 24, 2014

shivaram Oct 25, 2014

pwendell Oct 26, 2014

SparkQA commented Oct 27, 2014

SparkQA commented Oct 27, 2014

AmplabJenkins commented Oct 27, 2014

SparkQA commented Oct 27, 2014

shivaram commented Oct 27, 2014

pwendell commented Oct 27, 2014

SparkQA commented Oct 27, 2014

AmplabJenkins commented Oct 27, 2014

shivaram commented Oct 27, 2014

[SPARK-4030] Make destroy public for broadcast variables #2922

[SPARK-4030] Make destroy public for broadcast variables #2922

Conversation

shivaram commented Oct 24, 2014

shivaram commented Oct 24, 2014

SparkQA commented Oct 24, 2014

SparkQA commented Oct 24, 2014

AmplabJenkins commented Oct 24, 2014

pwendell Oct 24, 2014

Choose a reason for hiding this comment

shivaram Oct 25, 2014

Choose a reason for hiding this comment

pwendell Oct 26, 2014

Choose a reason for hiding this comment

pwendell commented Oct 24, 2014

pwendell commented Oct 24, 2014

srowen Oct 24, 2014

Choose a reason for hiding this comment

shivaram Oct 25, 2014

Choose a reason for hiding this comment

pwendell Oct 26, 2014

Choose a reason for hiding this comment

SparkQA commented Oct 27, 2014

SparkQA commented Oct 27, 2014

AmplabJenkins commented Oct 27, 2014

SparkQA commented Oct 27, 2014

shivaram commented Oct 27, 2014

pwendell commented Oct 27, 2014

SparkQA commented Oct 27, 2014

AmplabJenkins commented Oct 27, 2014

shivaram commented Oct 27, 2014