This project use Scala (with sbt), Spark and MLlib to predict customers clicks on advertising.
artifacts
: contains the compiled jarsoracle.jar
: the jar to predict new customers clicksmodel.jar
: the jar to build the model
common
: contains the common code such as data cleaning stuffmodel
: contains the code to build the modeloracle
: contains the code to predict new customers clicks
This repo comes with a prebuilt model. You should have:
- Java 1.8;
- Scala 2.11.11;
- sbt 1.0.3;
- Spark 2.2.0.
- Download or clone this repo
git clone https://github.com/bern-ads/DataAnalyser
- Go into the repo
cd DataAnalyser
- Go to the
artifacts
directorycd artifacts
- Submit the spark job
spark-submit --class "oracle.Main" --master local oracle.jar "<path_to_the_data_file>" "<path_to_selector_directory>" "<path_to_model_directory>"
You can find <path_to_selector_directory>
at the root the repository.
You can find <path_to_model_directory>
at the root the repository.
So the command should look like this:
spark-submit --class "oracle.Main" --master local oracle.jar "<path_to_the_data_file>" "../bernads.spark.selector" "../bernads.spark.model"