jupyter-scala with Spark does not see imports #63

mmatloka · 2016-03-16T10:16:32Z

I am running jupyter notebook on the machine where is available configured spark distribution with spark-submit , spark-defaults etc. I have running Python and R kernels but I have issues with jupyter-scala. I try to run the following code inside the notebook;

import org.apache.spark.SparkConf
import org.apache.spark.SparkContext
import org.apache.spark.SparkContext._
import com.datastax.spark.connector._

val conf = new SparkConf(true)
val sc = new SparkContext("spark://127.0.0.1:7077", "test", conf)

but it seems the jars are not visible by it

Main.scala:29: object datastax is not a member of package com
 ; import org.apache.spark.SparkConf ; import org.apache.spark.SparkContext ; import com.datastax.spark.connector._ ; val conf = { () =>
                                                                                         ^
Main.scala:29: object apache is not a member of package org
 ; import org.apache.spark.SparkConf ; import org.apache.spark.SparkContext ; import com.datastax.spark.connector._ ; val conf = { () =>
              ^
Main.scala:30: not found: type SparkConf
new SparkConf(true) 
    ^
Main.scala:29: object apache is not a member of package org
 ; import org.apache.spark.SparkConf ; import org.apache.spark.SparkContext ; import com.datastax.spark.connector._ ; val conf = { () =>
                                                  ^
Main.scala:33: not found: type SparkContext
new SparkContext("spark://127.0.0.1:7077", "test", conf) 
    ^

I have tried to add before it

classpath.addPath("/var/lib/spark/lib/spark-assembly-1.6.0-hadoop2.6.0.jar")
classpath.add("datastax" % "spark-cassandra-connector" % "1.6.0-M1-s_2.11")

but nothing changes.

I am running jupyter-scala from https://git.io/vzhRi .

The text was updated successfully, but these errors were encountered:

drizham · 2016-03-16T11:07:05Z

This is how I went about solving the issue I had (similar to you)
Installed Jupyter Toree from here: https://github.com/apache/incubator-toree

mmatloka · 2016-03-16T13:20:57Z

@drizham thx, but unfortunatelly toree supports only scala 2.10?

alexarchambault · 2016-03-16T15:10:58Z

@mmatloka @drizham Sorry, I've been kind of lacking time recently for jupyter-scala (although I and colleagues are using it quite a lot - with Spark in particular - at my $$ job).

This notebook illustrates how I usually use it to run things with Spark on a YARN cluster. I intend to make the doc better after the 1.0 final release of coursier, that jupyter-scala uses extensively.

alexarchambault · 2016-03-16T15:12:44Z

Also, you should follow the updated README to install it.

dimon777 · 2018-03-29T03:10:58Z

I run into the same issue with Spark 2.3 and Scala 2.11. I used this repo to integrate with jupyter but seeing same error. Is there Toree agnostic way to integrate Spark+Scala and Jupyter?
Also spark assembly is no longer presented in spark distribution. Perhaps your README and This notebook has to be updated

alexarchambault added the 0.3.x label Nov 30, 2016

alexarchambault closed this as completed Sep 13, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

jupyter-scala with Spark does not see imports #63

jupyter-scala with Spark does not see imports #63

mmatloka commented Mar 16, 2016

drizham commented Mar 16, 2016

mmatloka commented Mar 16, 2016

alexarchambault commented Mar 16, 2016

alexarchambault commented Mar 16, 2016

dimon777 commented Mar 29, 2018 •

edited

jupyter-scala with Spark does not see imports #63

jupyter-scala with Spark does not see imports #63

Comments

mmatloka commented Mar 16, 2016

drizham commented Mar 16, 2016

mmatloka commented Mar 16, 2016

alexarchambault commented Mar 16, 2016

alexarchambault commented Mar 16, 2016

dimon777 commented Mar 29, 2018 • edited

dimon777 commented Mar 29, 2018 •

edited