SPARK-5358: Rework the classloader impelementation. #4166

MattWhelan · 2015-01-22T22:44:22Z

The fundamental issue is that you can't change the delegation scheme
without overriding loadClass (rather than findClass). And, if you
override loadClass, you kind of have to do it in Java, because you need
a static initializer call to register yourself as parallel-capable.

This is an alternative to PR #4165. That PR requires Java 1.7. This
one sticks with 1.6, and implements the classloader as a trait, because
it can.

The fundamental issue is that you can't change the delegation scheme without overriding loadClass (rather than findClass). And, if you override loadClass, you kind of have to do it in Java, because you need a static initializer call to register yourself as parallel-capable.

The fundamental issue is that you can't change the delegation scheme without overriding loadClass (rather than findClass). And, if you override loadClass, you kind of have to do it in Java, because you need a static initializer call to register yourself as parallel-capable. This is an alternative to PR apache#4165. That PR requires Java 1.7. This one sticks with 1.6, and implements the classloader as a trait, because it can.

AmplabJenkins · 2015-01-22T22:47:10Z

Can one of the admins verify this patch?

MattWhelan · 2015-01-22T22:52:17Z

BTW, I spent a few minutes pondering the deadlock scenario, and the delegation changes you see here. I'm pretty sure we're safe with 1.6-style coarse locking, because no class from a parent classloader will ever be able to refer to any of the child classloader's classes (statically - instances don't really matter). We don't have funky cyclic references between CLs, so I don't think we can have funky deadlocks.

stephenh · 2015-02-04T15:41:57Z

@MattWhelan I believe you that loadClass is preferable to findClass, but can you articulate why? And maybe as a comment on this SO post so that future people that copy/paste from the 1st answer can know as well? :-)

http://stackoverflow.com/questions/5445511/how-do-i-create-a-parent-last-child-first-classloader-in-java-or-how-to-overr

Tangentially, I'd eventually like to port/copy/paste the Hadoop ApplicationClassLoader (which I believe they copy/pasted from Jetty) so that we can ignore any org.apache.spark.* classes that might accidentally be in the user's jar:

https://github.com/apache/hadoop/blob/trunk/hadoop-common-project/hadoop-common/src/main/java/org/apache/hadoop/util/ApplicationClassLoader.java

I'm also half-tempted to copy/paste their ApplicationClassLoader into a separate utility project, so that going forward Jetty/Hadoop/Spark/etc. could all reuse the same tested functionality instead of copy/pasting the code around. But then I'd have to publish to maven central, it'd be another dependency to worry about, etc., etc...

pwendell · 2015-02-08T18:48:47Z

core/src/main/scala/org/apache/spark/executor/ExecutorURLClassLoader.scala

+ * (cache, self, parent).  This lets this CL "hide" or "override"
+ * class defs that also exist in the parent loader.
+ */
+private[spark] trait GreedyClassLoader extends ClassLoader {


Rather than making this a trait, can you just modify ChildExecutorURLClassLoader to have this logic? We don't need it anywhere else at this point, and this is the main intended goal of ChildExecutorURLClassLoader (to do greedy loading like this).

pwendell · 2015-02-08T18:49:08Z

If you bring this one up to date we may be able to slip it into 1.3. Just let me know, thanks!

MattWhelan · 2015-02-16T22:43:29Z

Hey, sorry I was away for a bit.

After an extended discussion with Vanzin on his #3233, I'm convinced that PR covers the problem that this one solves. Since that's been merged, let's go with that.

Matt Whelan added 3 commits January 22, 2015 12:01

Added copyright header.

9283a36

vanzin mentioned this pull request Jan 22, 2015

[SPARK-2996] Implement userClassPathFirst for driver, yarn. #3233

Closed

pwendell reviewed Feb 8, 2015
View reviewed changes

MattWhelan closed this Feb 16, 2015

MattWhelan deleted the greedyCL1.6 branch February 16, 2015 22:43

srowen mentioned this pull request Feb 16, 2015

SPARK-5358: Rework the classloader impelementation. #4165

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SPARK-5358: Rework the classloader impelementation. #4166

SPARK-5358: Rework the classloader impelementation. #4166

MattWhelan commented Jan 22, 2015

AmplabJenkins commented Jan 22, 2015

MattWhelan commented Jan 22, 2015

stephenh commented Feb 4, 2015

pwendell Feb 8, 2015

pwendell commented Feb 8, 2015

MattWhelan commented Feb 16, 2015

SPARK-5358: Rework the classloader impelementation. #4166

SPARK-5358: Rework the classloader impelementation. #4166

Conversation

MattWhelan commented Jan 22, 2015

AmplabJenkins commented Jan 22, 2015

MattWhelan commented Jan 22, 2015

stephenh commented Feb 4, 2015

pwendell Feb 8, 2015

Choose a reason for hiding this comment

pwendell commented Feb 8, 2015

MattWhelan commented Feb 16, 2015