- This practice consists of carrying out a study of a recommendation system based on ALS based on data from Movielens movies that can be downloaded from https://movielens.org.
- The data of the study constitute two files (ratings1k.csv and ratings100k.csv) composed of three columns identified by userId, movieId and rating.
- In order to carry out the whole process, the Cloudera environment has been used and within it the Anaconda environment has been integrated with Jupyter Notebook, Spark and the Python API for access to Spark called PySpark.
*You agree not to republish and copy partly or totally this project without express authorization from owner