Skip to content

This project is an example for reading data from Impala (using impala for transformation) as a Spark DataFrame and writing objects from Spark into oracle database using JDBC.

Notifications You must be signed in to change notification settings

alisheykhi/Spark-Impala-Example

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Spark-Impala-Example

This project is an example for reading data from Impala (using impala for transformation) as a Spark DataFrame and writing objects from Spark into oracle database using JDBC.

Version Compatibility

Scala Spark Impala JDBC Oracle JDBC sbt
2.11.12 2.4.0 2.6.23.1028 odbc8 1.3.13

Build from Source

$ sbt assembly

Run

in order to run this application in kerberos enabled environment use the following command. you have to create your jaas.config file based on your production.

spark-submit
 --deploy-mode cluster
 --files "spark_jaas.conf#spark_jaas.conf,your_keytabfile..keytab#your_keytabfile..keytab"
 --driver-java-options "-Djava.security.auth.login.config=./spark_jaas.conf"
 --conf "spark.executor.extraJavaOptions=-Djava.security.auth.login.config=./spark_jaas.conf"
 --conf spark.yarn.submit.waitAppCompletion=false
 --driver-memory 16G
 --name SPARK_IMPALA
 --files config.ini,log4j.properties,spark_jaas.conf,your_keytabfile..keytab
 path_to_your_jar_file.jar path_to_log4j.properties path_to_config.ini

About

This project is an example for reading data from Impala (using impala for transformation) as a Spark DataFrame and writing objects from Spark into oracle database using JDBC.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages