Skip to content

LucDemortier/MoviesAndStars

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

MoviesAndStars

Project Luther for Spring 2015 Metis Data Science Bootcamp. See blog post at lucdemortier.github.io for a description of the results.

iPython notebooks used to generate the results and plots for the MoviesAndStars project:

  1. MovieScrape:
  • Scrapes BoxOfficeMojo website for info about all movies released during a given year.
  • Stores the result in a json file of the form XXXX_all_movie_dict.json, where XXXX is the year.
  1. OscarScrape:
  • Scrapes the Oscars webpages and collects all nominations and wins in categories containing the keywords "Actor", "Actress", and "Directing" from 1950 until 2015.
  • Stores the result in json file oscars.json.
  • Corrects various mistakes and stores the result in oscars_c1.json.
  • Creates a dictionary keyed by Actor/Actress/Director name, and with values equal to [no. of nominations, no. of wins].
  • Stores the result in aad_oscars.json.
  1. AnalyzeThis:
  • Reads in the json files created by MovieScrape and OscarScrape and creates a dictionary of movie features, including the star value of a movie's cast, which is defined as the total number of oscar wins and nominations by actors, actresses, and director of the movie, won in years previous to the movie's release.
  • Turns above dictionary into a Pandas dataframe for further processing.
  • Generates all the plots shown on the blog, and a few more.
  • Performs a quantile regression of Total Domestic Revenue versus Star Value of Cast.
  • Performs an ordinary linear regression of Total Domestic Revenue versus Production Budget.

In addition to these iPython notebooks, the repository also contains a short document, BrainStorming.pdf, with the results of a brainstorming session done at the beginning of this project with a few Metis colleagues.

About

Can we use the "star value" of a movie's cast to predict revenue?

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published