GitHub - ranvirm/scala-spark-titanic-example-project: A Scala-Spark example project using the Kaggle Titanic dataset.

Overview

This project serves as an example of a scala-spark project using the Kaggle Titanic dataset

sbt package

spark-submit --class ModelTrain --master local[*] --driver-memory 4G target/scala-2.11/scalasparktitanicproject_2.11-1.0.jar

spark-submit --class ModelPredict --master local[*] --driver-memory 4G target/scala-2.11/scalasparktitanicproject_2.11-1.0.jar

Predictions data will be saved as a csv file in the predictions directory found in project root dir

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
src/main		src/main
.gitignore		.gitignore
README.md		README.md
build.sbt		build.sbt