Switch branches/tags
Nothing to show
Find file History
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
..
Failed to load latest commit information.
README.md
top_directors_data.csv
top_directors_data_processing.ipynb
top_movies_data.csv
top_movies_data_processing.ipynb

README.md

This folder contains all of the data used in The Pudding essay Film or Digital: Breaking Down Hollywood's Choice of Shooting Medium published in August 2018.

Below you'll find metadata for each file.

top_movies_data.csv

  • What is this?: Data representing top 100 movies at the US Box Office every year between 2006 and 2017, along with relevant information used for the analysis in the essay.
  • Source(s) & Methods: IMDb and The Numbers.
    • The Numbers' list of US cumulative box office records is used to determine the top 100 movies each year.
    • Genres, directors, camera, and negative format were collected from IMDb bulk data and processed using IMDbpy.
    • Budget information was collected from The Numbers (as the primary source) and IMDb (when information is missing in The Numbers).
    • Film type is determined from the camera and negative format information.
  • Last Modified: August 12, 2018
  • Contact Information: Damar Aji Pramudita
  • Spatial Applicability: United States
  • Temporal Applicability: 2006 - 2017
  • Observations (Rows): Each row represents a movie in the top 100 US box office between 2006 and 2017.
  • Variables (Columns):
Header Description Data Type
production_year Production time of a movie number
id Unique number assign to a movie number
title Title of a movie text
directors Director(s) of a movie. Multiple directors are separated by a ` `
genres Genre(s) of a movie. Multiple genres are separated by a ` `
cameras Camera(s) and lense(s) used in a movie, as listed in IMDb technical specs section. Multiple cameras/lenses are separated by a ` `.
negative_format Negative fomat(s) in which a movie was recorded, as listed in IMDb technical specs section. Multiple formats separated by a ` `.
budget Budget of a movie (in US$ nominal value, not adjusted to inflation) number
budget_source Source of the budget information:
the-numbers: Information taken from The Numbers website
imdb: Information taken from IMDb
text
film_type Medium of a movie:
D: Digital
F: Film
• `D
F: Both Digital and Film <br/>&bull;U`: Unknown medium.
This field is determined based on cameras and negative format used in a movie.

top_directors_data.csv

  • What is this?: Data representing filmography of top directors along with the relevant information used for the analysis in the essay.
  • Source(s) & Methods: IMDb and The Numbers.
    • Top directors refer to ones with at least one movies it the top movies list above.
    • Genres, directors, camera, and negative format were collected from IMDb bulk data and processed using IMDbpy.
    • Film type is determined from the camera and negative format information.
  • Last Modified: August 12, 2018
  • Contact Information: Damar Aji Pramudita
  • Spatial Applicability: United States
  • Temporal Applicability: 2006 - 2017
  • Observations (Rows): Each row represents a movie made by a top director between 2006 and 2017.
  • Variables (Columns):
Header Description Data Type
production_year Production time of the movie number
id Unique number assign to the movie number
title Title of the movie text
director Name of a director that direct the movie. This field will only contain one director involved in the movie. If a movie is directed by more than one director, the title will appear in multiple row, with each director listed in this column. text
director_id Unique number assign to the director listed in director column. number
co_directors List of all directors involved in the movie. Multiple directors are separated by a ` `
co_directors_id Unique number assign to each director listed in co_directors column. Multiple director ids are separated by a ` `
genres Genre(s) of a movie. Multiple genres are separated by a ` `
cameras Camera(s) and lense(s) used in a movie, as listed in IMDb technical specs section. Multiple cameras/lenses are separated by a ` `.
negative_format Negative fomat(s) in which a movie was recorded, as listed in IMDb technical specs section. Multiple formats are separated by a ` `.
film_type Medium of a movie:
D: Digital
F: Film
• `D
F: Both Digital and Film <br/>&bull;U`: Unknown medium.
This field is determined based on cameras and negative format used in a movie.