code used to generate the minrva team's submission to the 2018 RecSys Challenge in the creative track
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
load-data
LICENSE
README.md
data.md
generate-rec.md

README.md

RecSys-Challenge-2018-creative-track

code used to generate the minrva team's submission to the 2018 RecSys Challenge in the creative track

Engine:

Apache PredictionIO (http://predictionio.apache.org/) 'similar product template' (https://github.com/apache/predictionio-template-similar-product) with Alternating Least Squares (ALS) algorithm, as implemented in Spark MLlib (https://spark.apache.org/docs/latest/mllib-collaborative-filtering.html); system relied on PostgreSQL 10 (https://www.postgresql.org/) for data persistence.

Software:

Data Set:

VIAF: http://viaf.org/viaf/data/

The Virtual International Authority File (VIAF) is an OCLC service -- built in cooperation with national libraries and other partners -- that virtually combines multiple LAM (Library Archives Museum) name authority files into a single name authority service.

System:

  • Ubuntu 18.04 LTS (GNU/Linux 4.15.0-20-generic x86_64)
  • java-8-openjdk-amd64
  • 300GB RAM, 16 CPU Cores, 340GB SSD
  • psql (10.4 (Ubuntu 10.4-0ubuntu0.18.04))