MoviesAndStars

Project Luther for Spring 2015 Metis Data Science Bootcamp. See blog post at lucdemortier.github.io for a description of the results.

iPython notebooks used to generate the results and plots for the MoviesAndStars project:

MovieScrape:

Scrapes BoxOfficeMojo website for info about all movies released during a given year.
Stores the result in a json file of the form XXXX_all_movie_dict.json, where XXXX is the year.

OscarScrape:

Scrapes the Oscars webpages and collects all nominations and wins in categories containing the keywords "Actor", "Actress", and "Directing" from 1950 until 2015.
Stores the result in json file oscars.json.
Corrects various mistakes and stores the result in oscars_c1.json.
Creates a dictionary keyed by Actor/Actress/Director name, and with values equal to [no. of nominations, no. of wins].
Stores the result in aad_oscars.json.

AnalyzeThis:

Reads in the json files created by MovieScrape and OscarScrape and creates a dictionary of movie features, including the star value of a movie's cast, which is defined as the total number of oscar wins and nominations by actors, actresses, and director of the movie, won in years previous to the movie's release.
Turns above dictionary into a Pandas dataframe for further processing.
Generates all the plots shown on the blog, and a few more.
Performs a quantile regression of Total Domestic Revenue versus Star Value of Cast.
Performs an ordinary linear regression of Total Domestic Revenue versus Production Budget.

In addition to these iPython notebooks, the repository also contains a short document, BrainStorming.pdf, with the results of a brainstorming session done at the beginning of this project with a few Metis colleagues.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
documents		documents
.gitignore		.gitignore
AnalyzeThis.ipynb		AnalyzeThis.ipynb
MovieScrape.ipynb		MovieScrape.ipynb
OscarScrape.ipynb		OscarScrape.ipynb
Pipfile		Pipfile
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

documents

documents

.gitignore

.gitignore

AnalyzeThis.ipynb

AnalyzeThis.ipynb

MovieScrape.ipynb

MovieScrape.ipynb

OscarScrape.ipynb

OscarScrape.ipynb

Pipfile

Pipfile

README.md

README.md

Repository files navigation

MoviesAndStars

About

Releases

Packages

Languages

LucDemortier/MoviesAndStars

Folders and files

Latest commit

History

Repository files navigation

MoviesAndStars

About

Resources

Stars

Watchers

Forks

Languages