Demo project showcasing the use of different Apache Spark libraries.
sparking-travelbug-data
- contains the sample data ingested/processed by the Spark jobssparking-travelbug-etl
- contains ETL (extract-transform-load) Spark jobs using Spark DataFrames and RDDs
Pre-requisites:
- Maven 3.x
- Java 1.7+
To build the project, just execute the following command:
$ mvn clean install
(In-progress..)