Recommendation example with Apache Spark2.3.1 and Apache Mahout0.13.0

As Apache Mahout0.13.0 is build on Scala2.10 which is no longer supported in Apache Spark2.x. we need to build mahout 0.13.0 in order to run with scala2.11 and Spark2.x.

This is an example for Correlated Cross-Occurrence algorithm playing Apache Mahout0.13.0 with the latest Apache Spark. I will update this project when Apache Mahout0.14.0 comes up.

Build Mahout0.13.0

$ git clone http://github.com/apache/mahout
$ cd mahout
$ mvn clean install -Pscala-2.11,spark-2.1 -DskipTests

or just download prebuilt mahout with Scala2.11 from
https://github.com/heroku/predictionio-buildpack/tree/master/repo/org/apache/mahout

Build this repository

$ mvn clean scala:compile package

Set up your datasources

Copy all datasources from kaggle to /opt/nfs.

You need to set up your NFS server and nfs directory as /opt/nfs when your spark is in cluster mode.

Run the driver program on your local machine

$ mvn exec:exec@run-local

Run the driver program on your Spark Cluster

$ mvn exec:exec@run-cluster

Confirm that your master node has a hostname as 'master' or you need to change the master url specifiled in pom.xml

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
src/main/scala/com/sparkexample/recommendations		src/main/scala/com/sparkexample/recommendations
README.md		README.md
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Recommendation example with Apache Spark2.3.1 and Apache Mahout0.13.0

Build Mahout0.13.0

Build this repository

Set up your datasources

Run the driver program on your local machine

Run the driver program on your Spark Cluster

About

Releases

Packages

Languages

george-j-zhu/Spark2.3.1-Mahout0.13.0-example

Folders and files

Latest commit

History

Repository files navigation

Recommendation example with Apache Spark2.3.1 and Apache Mahout0.13.0

Build Mahout0.13.0

Build this repository

Set up your datasources

Run the driver program on your local machine

Run the driver program on your Spark Cluster

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages