Skip to content

Latest commit

 

History

History
38 lines (24 loc) · 1.76 KB

README.md

File metadata and controls

38 lines (24 loc) · 1.76 KB

TMDb Analysis made-with-python

Investigating a dataset contains information about 10,000+ movies collected from The Movie Database (TMDb).

Project Overview

This project is connected with the Introduction to Data Analysis course, but depending on your background knowledge, you may not need to take the whole class to complete this project.

In this project, I will analyze a dataset and then communicate my findings about it. I will use the Python libraries NumPy, pandas, and Matplotlib to make my analysis easier.

I will conduct my own data analysis and create a file to share that documents your findings. I will start by taking a look at your dataset and brainstorming what questions I could answer using it. Then I will use pandas and NumPy to answer the questions I am most interested in, and create a report sharing the answers.

About the dataset: TMDb movies data: This data set contains information about 10,000 movies collected from The Movie Database (TMDb), including user ratings and revenue.

Download: TMDb movie data (cleaned from original data on Kaggle).

Requirments:

  • Python 3.7.9
  • NumPy 1.19.1
  • Pandas 1.1.2
  • matplotlib 3.3.1
  • seaborn 0.11.0

Execution:

In a terminal or command window, navigate to the project directory and run one of the following commands:

ipython notebook TMDb-Analysis.ipynb

OR

jupyter notebook TMDb-Analysis.ipynb