"Failed to create Hive context" warning on EC2 instance (/tmp/hive not writable) #386

fereshtehRS · 2016-12-15T18:43:18Z

Sparklyr version 0.4.41

Logging this for reference.

Brought up an EC2 cluster with Spark 1.6.1
After going through the setup (install packages, ...), a connection to spark was giving the following warning:

Warning messages:
1: In value[[3L]](cond) :
  java.lang.RuntimeException: java.io.IOException: Filesystem closed
    at org.apache.hadoop.hive.ql.session.SessionState.start(SessionState.java:522)
    at org.apache.spark.sql.hive.client.ClientWrapper.<init>(ClientWrapper.scala:204)
    at org.apache.spark.sql.hive.client.IsolatedClientLoader.createClient(IsolatedClientLoader.scala:238)
    at org.apache.spark.sql.hive.HiveContext.executionHive$lzycompute(HiveContext.scala:218)
    at org.apache.spark.sql.hive.HiveContext.executionHive(HiveContext.scala:208)
    at org.apache.spark.sql.hive.HiveContext.functionRegistry$lzycompute(HiveContext.scala:462)
    at org.apache.spark.sql.hive.HiveContext.functionRegistry(HiveContext.scala:461)
    at org.apache.spark.sql.UDFRegistration.<init>(UDFRegistration.scala:40)
    at org.apache.spark.sql.SQLContext.<init>(SQLContext.scala:330)
    at org.apache.spark.sql.hive.HiveContext.<init>(HiveContext.scala:90)
    at org.apache.spark.sql.hive.HiveContext.<init>(HiveContext.scala:101)
    at sparklyr.Backend$.getOrCreateHiveCont [... truncated]
2: In create_hive_context_v1(sc) :
  Failed to create Hive context, falling back to SQL. Some operations, like window-functions, will not work

It is not very clear from that message what the root cause is. But running the following shows the root cause:

ctx <- spark_context(sc)

invoke_new(
      sc,
      "org.apache.spark.sql.hive.HiveContext",
      ctx
)

which is:

 java.lang.RuntimeException: java.lang.RuntimeException: The root scratch dir: /tmp/hive on HDFS should be writable. Current permissions are: rwx--x--x

The mentioned HDFS directory has these permissions and ownerships:

drwx--x--x   - rstudio supergroup          0 2016-12-15 16:32 /tmp/hive

Changed the permissions and ownerships as follows:

drwxrwxr-x   - rstudio hadoop          0 2016-12-15 16:32 /tmp/hive

but even this was not enough. The only thing that worked was full write permissions:

drwxrwxrwx   - rstudio supergroup          0 2016-12-15 16:32 /tmp/hive

The text was updated successfully, but these errors were encountered:

edgararuiz-zz · 2016-12-16T14:00:36Z

Hi @fereshtehRS, alternatively, the hadoop fs command can be used to update user and folder changes

javierluraschi · 2017-08-17T22:21:56Z

We've been using EMR heavily and haven't hit this one, maybe an older version of EMR triggering, etc. I think we can close at this point.

javierluraschi added the hive label May 10, 2017

javierluraschi closed this as completed Aug 17, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

"Failed to create Hive context" warning on EC2 instance (/tmp/hive not writable) #386

"Failed to create Hive context" warning on EC2 instance (/tmp/hive not writable) #386

fereshtehRS commented Dec 15, 2016

edgararuiz-zz commented Dec 16, 2016

javierluraschi commented Aug 17, 2017

"Failed to create Hive context" warning on EC2 instance (/tmp/hive not writable) #386

"Failed to create Hive context" warning on EC2 instance (/tmp/hive not writable) #386

Comments

fereshtehRS commented Dec 15, 2016

edgararuiz-zz commented Dec 16, 2016

javierluraschi commented Aug 17, 2017