WuKong

WuKong is a big data pipeline toolkit implementation by Spark and Scala.

Modules

Add your key $ ssh-add your-key-path
Send the jar to cluster $ rsync -avz /path/wukong-assembly-0.1.jar username@ip:/path
Submit Job to Yarn $ /opt/spark-1.5/bin/spark-submit --class com.alvin.wukong.apps.ETLApp --master yarn-cluster --num-executors 8 --executor-memory 12g --driver-memory 4g --executor-cores 8 --driver-java-options "-XX:MaxPermSize=2048m -Dconfig.resource=/dev.conf " --files /path/ETLApp.conf /path/wukong-assembly-0.1.jar --env dev --configFile ETLApp.conf

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
project		project
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
build.sbt		build.sbt