[SPARK-21928][CORE] Set classloader on SerializerManager's private kryo #19280

squito · 2017-09-19T18:15:59Z

What changes were proposed in this pull request?

We have to make sure that SerializerManager's private instance of
kryo also uses the right classloader, regardless of the current thread
classloader. In particular, this fixes serde during remote cache
fetches, as those occur in netty threads.

How was this patch tested?

Manual tests & existing suite via jenkins. I haven't been able to reproduce this is in a unit test, because when a remote RDD partition can be fetched, there is a warning message and then the partition is just recomputed locally. I manually verified the warning message is no longer present.

We have to make sure thatthat SerializerManager's private instance of kryo also uses the right classloader, regardless of the current thread classloader. In particular, this fixes serde during remote cache fetches, as those occur in netty threads.

SparkQA · 2017-09-19T20:42:14Z

Test build #81944 has finished for PR 19280 at commit acbaf8b.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2017-09-20T00:00:52Z

Test build #81949 has finished for PR 19280 at commit 20e3585.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

squito · 2017-09-20T19:42:39Z

reaching out to some potential reviewers: @vanzin @srowen @JoshRosen @mridulm @tgravescs

vanzin · 2017-09-20T22:27:13Z

Looks ok to me, assuming the "default serializer" in SerializerManager is configured correctly through other means.

Title would sound better with a possessive: "SerializerManager's private kryo"

squito · 2017-09-21T02:35:58Z

Looks ok to me, assuming the "default serializer" in SerializerManager is configured correctly through other means.

I think that part is fine. The serializer is created here:
https://github.com/apache/spark/blob/master/core/src/main/scala/org/apache/spark/SparkEnv.scala#L279

The same instance is assigned to SparkEnv.serializer: https://github.com/apache/spark/blob/master/core/src/main/scala/org/apache/spark/SparkEnv.scala#L374

Which has its default classloader set in Executor.scala, right by the part I'm changing: https://github.com/apache/spark/blob/master/core/src/main/scala/org/apache/spark/executor/Executor.scala#L131

vanzin · 2017-09-21T17:19:55Z

LGTM, merging to master / 2.2 / 2.1.

## What changes were proposed in this pull request? We have to make sure that SerializerManager's private instance of kryo also uses the right classloader, regardless of the current thread classloader. In particular, this fixes serde during remote cache fetches, as those occur in netty threads. ## How was this patch tested? Manual tests & existing suite via jenkins. I haven't been able to reproduce this is in a unit test, because when a remote RDD partition can be fetched, there is a warning message and then the partition is just recomputed locally. I manually verified the warning message is no longer present. Author: Imran Rashid <irashid@cloudera.com> Closes #19280 from squito/SPARK-21928_ser_classloader. (cherry picked from commit b75bd17) Signed-off-by: Marcelo Vanzin <vanzin@cloudera.com>

vanzin · 2017-09-21T17:21:17Z

Didn't merge to 2.1, please open a PR against that branch if you want it there.

We have to make sure that SerializerManager's private instance of kryo also uses the right classloader, regardless of the current thread classloader. In particular, this fixes serde during remote cache fetches, as those occur in netty threads. Manual tests & existing suite via jenkins. I haven't been able to reproduce this is in a unit test, because when a remote RDD partition can be fetched, there is a warning message and then the partition is just recomputed locally. I manually verified the warning message is no longer present. Author: Imran Rashid <irashid@cloudera.com> Closes apache#19280 from squito/SPARK-21928_ser_classloader. (cherry picked from commit b75bd17)

## What changes were proposed in this pull request? We have to make sure that SerializerManager's private instance of kryo also uses the right classloader, regardless of the current thread classloader. In particular, this fixes serde during remote cache fetches, as those occur in netty threads. ## How was this patch tested? Manual tests & existing suite via jenkins. I haven't been able to reproduce this is in a unit test, because when a remote RDD partition can be fetched, there is a warning message and then the partition is just recomputed locally. I manually verified the warning message is no longer present. Author: Imran Rashid <irashid@cloudera.com> Closes apache#19280 from squito/SPARK-21928_ser_classloader. (cherry picked from commit b75bd17) Signed-off-by: Marcelo Vanzin <vanzin@cloudera.com>

fix tests

20e3585

squito changed the title ~~[SPARK-21928][CORE] Set classloader on SerializerManager private kryo~~ [SPARK-21928][CORE] Set classloader on SerializerManager's private kryo Sep 21, 2017

asfgit closed this in b75bd17 Sep 21, 2017

squito deleted the SPARK-21928_ser_classloader branch September 25, 2017 20:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-21928][CORE] Set classloader on SerializerManager's private kryo #19280

[SPARK-21928][CORE] Set classloader on SerializerManager's private kryo #19280

squito commented Sep 19, 2017 •

edited

Loading

SparkQA commented Sep 19, 2017

SparkQA commented Sep 20, 2017

squito commented Sep 20, 2017

vanzin commented Sep 20, 2017

squito commented Sep 21, 2017

vanzin commented Sep 21, 2017

vanzin commented Sep 21, 2017

[SPARK-21928][CORE] Set classloader on SerializerManager's private kryo #19280

[SPARK-21928][CORE] Set classloader on SerializerManager's private kryo #19280

Conversation

squito commented Sep 19, 2017 • edited Loading

What changes were proposed in this pull request?

How was this patch tested?

SparkQA commented Sep 19, 2017

SparkQA commented Sep 20, 2017

squito commented Sep 20, 2017

vanzin commented Sep 20, 2017

squito commented Sep 21, 2017

vanzin commented Sep 21, 2017

vanzin commented Sep 21, 2017

squito commented Sep 19, 2017 •

edited

Loading