Skip to content

A deep dive into how Netflix, Youtube and other videos providers recommend relevant material to their audience

Notifications You must be signed in to change notification settings

BeTechLabs/Movies-Recommendation-Case-Study-

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Movies Recommendations Case Study

Movies Recommendations Case Study

Introduction

What movie should I watch this evening?, Have you ever had to answer this question at least once when you came home from work? As for me — yes, and more than once. From Netflix to Hulu, the need to build robust movie recommendation systems is extremely important given the huge demand for personalized content of modern consumers.

An example of recommendation system is such as this:

  • User A watches Game of Thrones and Breaking Bad.
  • User B does search on Game of Thrones, then the system suggests Breaking Bad from data collected about user A.

Recommendation systems are used not only for movies, but on multiple other products and services like Goodreads (Books) Amazon (Items), Pandora/Spotify (Music), Google (News, Search), YouTube or Netflix (Videos) etc.

Two most ubiquitous types of personalized recommendation systems are Content-Based and Collaborative Filtering. Collaborative filtering produces recommendations based on the knowledge of users’ attitude to items, that is it uses the wisdom of the crowd to recommend items. In contrast, content-based recommendation systems focus on the attributes of the items and give you recommendations based on the similarity between them.

In this repository, We will attempt at implementing these two systems to recommend movies and evaluate them to see which one performs better.

Let’s get started ..

Dataset

One of the most common datasets that is available on the internet for building a Recommender System is the MovieLens DataSet. This version of the dataset that I'm working with (1M) contains 1,000,209 anonymous ratings of approximately 3,900 movies made by 6,040 MovieLens users who joined MovieLens in 2000.

The data was collected by GroupLens researchers over various periods of time, depending on the size of the set. This 1M version was released on February 2003. Users were selected at random for inclusion. All users selected had rated at least 20 movies. Each user is represented by an id, and no other information is provided.

Models

  • we worked in building 2 types of recommednation systems:
    • Content Based Filtering :

      that finds similars for specific item based on its content.

    • Collaborative Filtering :

      that finds similarity based on history of user and his prefrences with another users those interacts with same movies. and represent it by two methods :

      • Matrix Factorzation :

        uses SVD to predict approxmate user ratings.

      • Deep Learning :

        uses deep neural network to apply embedding on users and movies to predict approxmate user ratings.

Requirements

About

A deep dive into how Netflix, Youtube and other videos providers recommend relevant material to their audience

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published