Sample does not work for Mesos Cluster #23

yanglei99 · 2015-12-03T19:59:01Z

I can run sample using spark-submit against a --master=local[2]. However when I target it to my mesos cluster, I got NPE: by class water.parser.ParseSetup$GuessSetupTsk; class java.lang.NullPointerException: null
at water.parser.ParseSetup$GuessSetupTsk.map(ParseSetup.java:269)
at water.MRTask.compute2(MRTask.java:624)
at water.H2O$H2OCountedCompleter.compute(H2O.java:1017)

If the issue is related to how the sample is loading the data, I wonder if we should be creating some sample that will work with remote clusters, e.g. hosting the data on S3..

Thank you .

Yang.

mmalohlava · 2015-12-03T22:55:45Z

Hi Yang,

if you would like to point to some data store on HDFS/S3, please point URI to them:

val hf = new H2OFrame(new java.net.URI("hdfs//mynamenode/mydirectory/myfile.csv"))

The same for S3.

If you would like to parse a local file, it has to be distributed to each node in the cluster:
val hf = new H2OFrame(new java.io.File("/my/cluster/datastore/file.csv"))

Right now, we our API does not provide a shortcut for upload of local file (open issue:
https://0xdata.atlassian.net/browse/SW-56).

Thank you
Michal

On 12/3/15 11:59 AM, Yang Lei wrote:

I can run sample using spark-submit against a --master=local[2]. However when I target it to my
mesos cluster, I got NPE: by class water.parser.ParseSetup$GuessSetupTsk; class
java.lang.NullPointerException: null
at water.parser.ParseSetup$GuessSetupTsk.map(ParseSetup.java:269)
at water.MRTask.compute2(MRTask.java:624)
at water.H2O$H2OCountedCompleter.compute(H2O.java:1017)

If the issue is related to how the sample is loading the data, I wonder if we should be creating
some sample that will work with remote clusters, e.g. hosting the data on S3..

Thank you .

Yang.

—
Reply to this email directly or view it on GitHub
#23.

yanglei99 · 2015-12-04T15:11:34Z

Thanks Michal,

So basically we are saying the samples only work for local mode. That is the reason I asked if we should host the sample data somewhere like s3 and so it can run out of box.

I will lose the issue now. Thank you.

Yang

yanglei99 · 2015-12-04T15:18:31Z

Another thought is if the sample can read the "SPARKLING_WATER_HOME" to construct the full path of where the file is. So that as long as the target slaves also having the Sparking Water installed, it will be able to load the file.

Thanks. Yang.

yanglei99 · 2015-12-04T18:19:29Z

verified the sample works after changing the file location to be downloadable.

crystalfuns · 2017-01-25T16:56:22Z

How to connect sparklingwater to DCOS Mesos Spark Cluster ?

mmalohlava · 2017-01-25T17:10:09Z

Right now, we do not provide any explicit support for DC/OS. However, any feedback, recommendations, or requirements are welcomed.

yanglei99 closed this as completed Dec 4, 2015

yanglei99 reopened this Dec 4, 2015

yanglei99 closed this as completed Dec 4, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sample does not work for Mesos Cluster #23

Sample does not work for Mesos Cluster #23

yanglei99 commented Dec 3, 2015

mmalohlava commented Dec 3, 2015

yanglei99 commented Dec 4, 2015

yanglei99 commented Dec 4, 2015

yanglei99 commented Dec 4, 2015

crystalfuns commented Jan 25, 2017

mmalohlava commented Jan 25, 2017

Sample does not work for Mesos Cluster #23

Sample does not work for Mesos Cluster #23

Comments

yanglei99 commented Dec 3, 2015

mmalohlava commented Dec 3, 2015

yanglei99 commented Dec 4, 2015

yanglei99 commented Dec 4, 2015

yanglei99 commented Dec 4, 2015

crystalfuns commented Jan 25, 2017

mmalohlava commented Jan 25, 2017