Movie Recommender System

Version: Spark – 2.2.1, Python – 2.7

Command to run on terminal:

spark-submit [CF python file] [input file] [testing file]

The above command will generate an output file in the current directory

Implementation of User-User, Item-Item and Model Collaborative Filtering methods on the MovieLens Database.
Locality Sensitive Hashing was used to speed up computation of Item-Item pairs
Item-Item performs best with lowest RMSE of 0.94

Approximate running times:

Note: testing file must be a subset of input file and both should resemble the ratings.csv file on MovieLens Database

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
data		data
README.md		README.md
item-item-CF.py		item-item-CF.py
model-based-CF.py		model-based-CF.py
user-user-CF.py		user-user-CF.py

Provide feedback