[SPARK-26489][CORE] Use ConfigEntry for hardcoded configs for python/r categories #23428

HeartSaVioR · 2019-01-02T13:29:39Z

What changes were proposed in this pull request?

The PR makes hardcoded configs below to use ConfigEntry.

spark.pyspark
spark.python
spark.r

This patch doesn't change configs which are not relevant to SparkConf (e.g. system properties, python source code)

How was this patch tested?

Existing tests.

…r categories

SparkQA · 2019-01-02T17:59:40Z

Test build #100640 has finished for PR 23428 at commit b0fba98.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

vanzin · 2019-01-02T23:38:32Z

core/src/main/scala/org/apache/spark/internal/config/package.scala

@@ -733,4 +733,45 @@ package object config {
      .stringConf
      .toSequence
      .createWithDefault(Nil)
+
+  private[spark] val PYTHON_WORKER_REUSE = ConfigBuilder("spark.python.worker.reuse")


I'd rather have separate files for these (e.g. Python.scala, R.scala) to avoid polluting this object. See other examples in the config package.

Thanks for the nice suggestion. I was thinking config package object is already all-in-one (except History, Kafka, Status) so adding these config into here would be OK and we may have chance to sort out all the things, but I agree we can do this earlier for clear cases. Will address.

SparkQA · 2019-01-03T05:15:37Z

Test build #100670 has finished for PR 23428 at commit f82d269.

This patch fails to build.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2019-01-03T05:37:18Z

Test build #100672 has finished for PR 23428 at commit 85ea372.

This patch fails to build.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2019-01-03T08:05:02Z

Test build #100675 has finished for PR 23428 at commit d0a345d.

This patch fails due to an unknown error code, -9.
This patch merges cleanly.
This patch adds no public classes.

HyukjinKwon · 2019-01-03T08:24:24Z

retest this please

HyukjinKwon · 2019-01-03T08:53:40Z

core/src/main/scala/org/apache/spark/api/r/RBackend.scala

@@ -47,10 +48,8 @@ private[spark] class RBackend {

  def init(): (Int, RAuthHelper) = {
    val conf = new SparkConf()
-    val backendConnectionTimeout = conf.getInt(
-      "spark.r.backendConnectionTimeout", SparkRDefaults.DEFAULT_CONNECTION_TIMEOUT)


the whole file SparkRDefaults.scala looks not being referred anymore. Can we delete this file?

Nice finding! Will remove.

HyukjinKwon · 2019-01-03T08:55:17Z

core/src/main/scala/org/apache/spark/internal/config/package.scala

@@ -733,4 +729,5 @@ package object config {
      .stringConf
      .toSequence
      .createWithDefault(Nil)
+


not a big deal at all but I'd remove this line.

OK will address.

HyukjinKwon

Hm, shouldn't we move other Python/R configurations into Python.scala and R.scala? For instance, I'm seeing spark.pyspark.driver.python and spark.executor.pyspark.memory

SparkQA · 2019-01-03T13:04:49Z

Test build #100677 has finished for PR 23428 at commit d0a345d.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

HeartSaVioR · 2019-01-03T13:46:49Z

shouldn't we move other Python/R configurations into Python.scala and R.scala?

I'm a bit careful to avoid conflict with other PR (#23415 addresses spark.executor). If #23415 will be merged sooner I can rebase and also move other things as well.

SparkQA · 2019-01-03T18:26:34Z

Test build #100691 has finished for PR 23428 at commit c919a1c.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

vanzin · 2019-01-03T22:30:07Z

Looks good. Merging to master.

HeartSaVioR · 2019-01-03T22:43:52Z

Thanks @vanzin and @HyukjinKwon for reviewing and merging!

HyukjinKwon · 2019-01-04T02:36:12Z

LGTM too

…r categories ## What changes were proposed in this pull request? The PR makes hardcoded configs below to use ConfigEntry. * spark.pyspark * spark.python * spark.r This patch doesn't change configs which are not relevant to SparkConf (e.g. system properties, python source code) ## How was this patch tested? Existing tests. Closes apache#23428 from HeartSaVioR/SPARK-26489. Authored-by: Jungtaek Lim (HeartSaVioR) <kabhwan@gmail.com> Signed-off-by: Marcelo Vanzin <vanzin@cloudera.com>

[SPARK-26489][CORE] Use ConfigEntry for hardcoded configs for python/…

b0fba98

…r categories

vanzin reviewed Jan 2, 2019

View reviewed changes

Move out Python/R related configurations to separate objects

f82d269

Fix compilation issue

d0a345d

HeartSaVioR force-pushed the SPARK-26489 branch from 85ea372 to d0a345d Compare January 3, 2019 06:52

HyukjinKwon reviewed Jan 3, 2019

View reviewed changes

Address review comment from @HyukjinKwon

c919a1c

asfgit closed this in 05372d1 Jan 3, 2019

HeartSaVioR deleted the SPARK-26489 branch January 3, 2019 22:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-26489][CORE] Use ConfigEntry for hardcoded configs for python/r categories #23428

[SPARK-26489][CORE] Use ConfigEntry for hardcoded configs for python/r categories #23428

HeartSaVioR commented Jan 2, 2019

SparkQA commented Jan 2, 2019

vanzin Jan 2, 2019

HeartSaVioR Jan 2, 2019

SparkQA commented Jan 3, 2019

SparkQA commented Jan 3, 2019

SparkQA commented Jan 3, 2019

HyukjinKwon commented Jan 3, 2019

HyukjinKwon Jan 3, 2019

HeartSaVioR Jan 3, 2019

HyukjinKwon Jan 3, 2019

HeartSaVioR Jan 3, 2019

HyukjinKwon left a comment

SparkQA commented Jan 3, 2019

HeartSaVioR commented Jan 3, 2019 •

edited

SparkQA commented Jan 3, 2019

vanzin commented Jan 3, 2019

HeartSaVioR commented Jan 3, 2019

HyukjinKwon commented Jan 4, 2019

[SPARK-26489][CORE] Use ConfigEntry for hardcoded configs for python/r categories #23428

[SPARK-26489][CORE] Use ConfigEntry for hardcoded configs for python/r categories #23428

Conversation

HeartSaVioR commented Jan 2, 2019

What changes were proposed in this pull request?

How was this patch tested?

SparkQA commented Jan 2, 2019

vanzin Jan 2, 2019

Choose a reason for hiding this comment

HeartSaVioR Jan 2, 2019

Choose a reason for hiding this comment

SparkQA commented Jan 3, 2019

SparkQA commented Jan 3, 2019

SparkQA commented Jan 3, 2019

HyukjinKwon commented Jan 3, 2019

HyukjinKwon Jan 3, 2019

Choose a reason for hiding this comment

HeartSaVioR Jan 3, 2019

Choose a reason for hiding this comment

HyukjinKwon Jan 3, 2019

Choose a reason for hiding this comment

HeartSaVioR Jan 3, 2019

Choose a reason for hiding this comment

HyukjinKwon left a comment

Choose a reason for hiding this comment

SparkQA commented Jan 3, 2019

HeartSaVioR commented Jan 3, 2019 • edited

SparkQA commented Jan 3, 2019

vanzin commented Jan 3, 2019

HeartSaVioR commented Jan 3, 2019

HyukjinKwon commented Jan 4, 2019

HeartSaVioR commented Jan 3, 2019 •

edited