[SPARK-22756] [Build] [SparkR] Run SparkR tests if hive_thriftserver module has code changes #19944

gatorsmile · 2017-12-11T23:25:44Z

What changes were proposed in this pull request?

The recent PR change in hive_thriftserver caused the test failure in CRAN requirements. To some extends, SparkR module also depends on hive_thriftserver module which could output some log files, so we should run SparkR tests if the hive_thriftserver module has code changes.

How was this patch tested?

N/A

gatorsmile · 2017-12-11T23:26:24Z

dev/sparktestsupport/modules.py

@@ -481,7 +481,7 @@ def __hash__(self):

 sparkr = Module(
    name="sparkr",
-    dependencies=[hive, mllib],
+    dependencies=[hive, mllib, hive_thriftserver],


Since SparkR already depends on hive, this PR just adds the dependence on hive_thriftserver

SparkQA · 2017-12-11T23:27:49Z

Test build #84735 has finished for PR 19944 at commit 1fd2d53.

This patch fails due to an unknown error code, 255.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2017-12-12T02:51:52Z

Test build #84736 has finished for PR 19944 at commit 6439458.

This patch fails SparkR unit tests.
This patch merges cleanly.
This patch adds no public classes.

gatorsmile · 2017-12-12T05:46:41Z

cc @felixcheung Very weird... I can reproduce it in my local environment. Could you take a look at why SparkR failed?

Caused by: org.apache.spark.SparkException: R computation failed with
 Error : requireNamespace("e1071", quietly = TRUE) is not TRUE
    at org.apache.spark.api.r.RRunner.compute(RRunner.scala:108)
    at org.apache.spark.api.r.BaseRRDD.compute(RRDD.scala:51)
    at org.apache.spark.rdd.RDD.computeOrReadCheckpoint(RDD.scala:324)
    at org.apache.spark.rdd.RDD.iterator(RDD.scala:288)
    at org.apache.spark.scheduler.ResultTask.runTask(ResultTask.scala:87)
    at org.apache.spark.scheduler.Task.run(Task.scala:108)
    at org.apache.spark.executor.Executor$TaskRunner.run(Executor.scala:345)
    at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
    at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
    ... 1 more

HyukjinKwon · 2017-12-12T07:38:09Z

Hm, @gatorsmile. BTW, do you maybe know how CRAN check fails by the changes in the thrift server? I was just double checking for sure but it sounds orthogonal to me now.

The test failure above seems due to missing package e1071 in your local.

felixcheung · 2017-12-12T12:06:54Z

Yes this seems unrelated - it’s just saying you need the package e1071 to run the test. Anyone know why SparkR tests are failing all of a sudden? There is nothing in the log file like this one https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/84726/console

felixcheung · 2017-12-12T12:08:26Z

To clarify it looks like SparkR test is failing in other PRs.

HyukjinKwon · 2017-12-12T12:20:21Z

Yup and it looks passing fine back now (roughly just a couple(?) of hours ago).

Seems the problem was this one(?)

* checking CRAN incoming feasibility ...Error in .check_package_CRAN_incoming(pkgdir) : 
  dims [product 39] do not match the length of object [0]
Execution halted
Loading required package: methods

HyukjinKwon · 2017-12-12T12:21:04Z

retest this please

SparkQA · 2017-12-12T15:39:17Z

Test build #84768 has finished for PR 19944 at commit 6439458.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

gatorsmile · 2017-12-12T16:58:12Z

<property>
    <name>hive.server2.logging.operation.enabled</name>
    <value>true</value>
</property>
<property>
  <name>hive.server2.logging.operation.level</name>
  <value>VERBOSE</value>
</property>
<property>
    <name>hive.querylog.location</name>
    <value>/data/logs/hive/${user.name}</value>
</property>

I am afraid that thriftserver PR could break it, because it writes to the log if hive.server2.logging.operation.enabled is set to true.

gatorsmile · 2017-12-12T16:59:29Z

How about merging this PR first? and let @zouchenjun resubmit the PR?

gatorsmile · 2017-12-12T18:48:10Z

cc @srowen @liancheng

fix.

1fd2d53

gatorsmile commented Dec 11, 2017

View reviewed changes

change the order

6439458

gatorsmile closed this Dec 12, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-22756] [Build] [SparkR] Run SparkR tests if hive_thriftserver module has code changes #19944

[SPARK-22756] [Build] [SparkR] Run SparkR tests if hive_thriftserver module has code changes #19944

gatorsmile commented Dec 11, 2017 •

edited

gatorsmile Dec 11, 2017

SparkQA commented Dec 11, 2017

SparkQA commented Dec 12, 2017

gatorsmile commented Dec 12, 2017

HyukjinKwon commented Dec 12, 2017

felixcheung commented Dec 12, 2017 via email

felixcheung commented Dec 12, 2017 via email

HyukjinKwon commented Dec 12, 2017 •

edited

HyukjinKwon commented Dec 12, 2017

SparkQA commented Dec 12, 2017

gatorsmile commented Dec 12, 2017

gatorsmile commented Dec 12, 2017

gatorsmile commented Dec 12, 2017

[SPARK-22756] [Build] [SparkR] Run SparkR tests if hive_thriftserver module has code changes #19944

[SPARK-22756] [Build] [SparkR] Run SparkR tests if hive_thriftserver module has code changes #19944

Conversation

gatorsmile commented Dec 11, 2017 • edited

What changes were proposed in this pull request?

How was this patch tested?

gatorsmile Dec 11, 2017

Choose a reason for hiding this comment

SparkQA commented Dec 11, 2017

SparkQA commented Dec 12, 2017

gatorsmile commented Dec 12, 2017

HyukjinKwon commented Dec 12, 2017

felixcheung commented Dec 12, 2017 via email

felixcheung commented Dec 12, 2017 via email

HyukjinKwon commented Dec 12, 2017 • edited

HyukjinKwon commented Dec 12, 2017

SparkQA commented Dec 12, 2017

gatorsmile commented Dec 12, 2017

gatorsmile commented Dec 12, 2017

gatorsmile commented Dec 12, 2017

gatorsmile commented Dec 11, 2017 •

edited

HyukjinKwon commented Dec 12, 2017 •

edited