Skip to content
Series data from IMDB
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
R
data
.gitignore
LICENSE
README.md
get_series_data.R
imdb-series.Rproj
series_urls.csv

README.md

IMDB Ratings for TV/Streaming Series

A dataset of ratings given in IMDB to episodes of popular TV and Streaming series. And the R code to generate your version of it.

Each line in the dtaset is an episode from a series from a handcrafted list of series. The variables are as follows:

* series_name <chr> Self explanatory
* series_ep   <int> Episode index in the series from 1 onwards.
* season      <int> From 1 onwards
* season_ep   <int> Episode index in the season
* url         <chr> IMDB url for the episode (eg "http://www.imdb.com/title/tt5174246/")
* Episode     <chr> Episode title
* UserRating  <dbl> IMDB User Rating calculated [as explained in their site](http://www.imdb.com/help/show_leaf?votestopfaq).
* UserVotes   <dbl> Num of votes for the rating
* r1          <dbl> Proportion of users who rated this episode with score 1
* r2          <dbl> Proportion of users who rated this episode with score 2
* ...
* r10         <dbl> Proportion of users who rated this episode with score 10

Analysing series

Go for data/series_from_imdb.csv and start analysing series/episode data.

Fetching the data / more data yourself

Dependencies

You'll need tidyverse and rvest.

Runnnig

Run get_series_data.R. It will fetch ratings for every episode of the series in series_urls.csv and save the result in data/series_from_imdb.csv.

You can’t perform that action at this time.