Learning to work with Spark
Jupyter Notebook Shell
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
data
downloads
notebooks
scripts
.gitignore
README.md
Vagrantfile

README.md

SparkVM

Starting the VM

The VM is built with Vagrant and VirtualBox. Start the VM with vagrant up. The provision scripts will install Python 3.5, Java, Spark 2.1, and pip.

Datasets

  • Credit Card Fraud data set downloaded from Kaggle on 03-May-2017.