java.util.NoSuchElementException: key not found: path #230

onsDridi · 2016-07-11T13:44:03Z

I'm trying to test this code

from pyspark.sql import SQLContext
from pyspark import SparkContext
sc = SparkContext(appName="Connect Spark with Redshift")
sql_context = SQLContext(sc)
sc._jsc.hadoopConfiguration().set("fs.s3n.awsAccessKeyId", "ACCESSID")
sc._jsc.hadoopConfiguration().set("fs.s3n.awsSecretAccessKey", "ACEESKEY")
df = sql_context.read \
    .option("url", "jdbc:redshift://example.coyf2i236wts.eu-central1.redshift.amazonaws.com:5439/agcdb?user=user&password=pwd") \
    .option("dbtable", "table_name") \
    .option("tempdir", "s3://bucket/path") \
    .load()

but i'm getting this error

Any ideas ?

The text was updated successfully, but these errors were encountered:

JoshRosen · 2016-07-11T17:58:16Z

I think that you need to add .format("com.databricks.spark.redshift") to your sql_context.read call; my hunch is that Spark can't infer the format for this data source, so you need to explicitly specify that we should use the spark-redshift connector.

(This is an unhelpful error message in Spark; I'll see if there's a way to provide a more helpful one).

onsDridi · 2016-07-11T19:49:41Z

Thank you Josh , I tried to add it but now i'm getting this error : java.lang.ClassNotFoundException: Failed to find data source: com.databricks.spark.redshift

JoshRosen · 2016-07-11T19:59:35Z

Is the Redshift connector JAR on your Spark driver's classpath?

onsDridi · 2016-07-11T20:19:37Z

Yes, i used this command to run the sript
bin/spark-submit --driver-class-path path/RedshiftJDBC41-1.1.17.1017.jar script.py
I tried also
bin/spark-submit redshiftTestCode/sparkRedshift.py --jars path/RedshiftJDBC41-1.1.17.1017.jar
but i still get the same error

JoshRosen · 2016-07-11T20:21:20Z

You also need to add the spark-redshift JAR; the Redshift JDBC driver is not sufficient by itself.

onsDridi · 2016-07-11T20:29:25Z

I did it too but i still have the same error

JoshRosen · 2016-07-12T21:04:07Z

Can you post the exact command that you tried most recently and which didn't work?

onsDridi · 2016-07-13T08:14:32Z

I run this commands :
bin/spark-submit redshiftTestCode/sparkRedshift.py --jars /Users/od/Documents/Work/spark-redshift_2.10-1.0.0.jar
bin/spark-submit redshiftTestCode/sparkRedshift.py --jars /Users/od/Documents/Work/RedsphiftJDBC41-1.1.17.1017.jar

Both commands give me the same error : java.util.NoSuchElementException: key not found: path

JoshRosen · 2016-07-13T17:22:25Z

Okay, and you also added .format("com.databricks.spark.redshift") to your code?

tokland · 2016-09-29T09:16:24Z

Note you write -- jars after the python script, but this is an option of spark-submit. For the record, this worked for me:

$ spark-submit --jars spark-redshift_2.10-1.1.0.jar,RedshiftJDBC.jar,minimal-json-0.9.4.jar test-redshift.py

JoshRosen added the stale / awaiting update label Jul 15, 2016

JoshRosen closed this as completed Aug 1, 2016

proinsias mentioned this issue Sep 9, 2016

Failed to find data source: com.databricks.spark.redshift #265

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

java.util.NoSuchElementException: key not found: path #230

java.util.NoSuchElementException: key not found: path #230

onsDridi commented Jul 11, 2016 •

edited by JoshRosen

JoshRosen commented Jul 11, 2016

onsDridi commented Jul 11, 2016

JoshRosen commented Jul 11, 2016

onsDridi commented Jul 11, 2016

JoshRosen commented Jul 11, 2016

onsDridi commented Jul 11, 2016

JoshRosen commented Jul 12, 2016

onsDridi commented Jul 13, 2016

JoshRosen commented Jul 13, 2016

tokland commented Sep 29, 2016 •

edited

java.util.NoSuchElementException: key not found: path #230

java.util.NoSuchElementException: key not found: path #230

Comments

onsDridi commented Jul 11, 2016 • edited by JoshRosen

JoshRosen commented Jul 11, 2016

onsDridi commented Jul 11, 2016

JoshRosen commented Jul 11, 2016

onsDridi commented Jul 11, 2016

JoshRosen commented Jul 11, 2016

onsDridi commented Jul 11, 2016

JoshRosen commented Jul 12, 2016

onsDridi commented Jul 13, 2016

JoshRosen commented Jul 13, 2016

tokland commented Sep 29, 2016 • edited

onsDridi commented Jul 11, 2016 •

edited by JoshRosen

tokland commented Sep 29, 2016 •

edited