Skip to content

Investigating a dataset contains information about 10,000+ movies collected from The Movie Database (TMDb).

Notifications You must be signed in to change notification settings

kfrawee/TMDb-Analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

TMDb Analysis made-with-python

Investigating a dataset contains information about 10,000+ movies collected from The Movie Database (TMDb).

Project Overview

This project is connected with the Introduction to Data Analysis course, but depending on your background knowledge, you may not need to take the whole class to complete this project.

In this project, I will analyze a dataset and then communicate my findings about it. I will use the Python libraries NumPy, pandas, and Matplotlib to make my analysis easier.

I will conduct my own data analysis and create a file to share that documents your findings. I will start by taking a look at your dataset and brainstorming what questions I could answer using it. Then I will use pandas and NumPy to answer the questions I am most interested in, and create a report sharing the answers.

About the dataset: TMDb movies data: This data set contains information about 10,000 movies collected from The Movie Database (TMDb), including user ratings and revenue.

Download: TMDb movie data (cleaned from original data on Kaggle).

Requirments:

  • Python 3.7.9
  • NumPy 1.19.1
  • Pandas 1.1.2
  • matplotlib 3.3.1
  • seaborn 0.11.0

Execution:

In a terminal or command window, navigate to the project directory and run one of the following commands:

ipython notebook TMDb-Analysis.ipynb

OR

jupyter notebook TMDb-Analysis.ipynb

About

Investigating a dataset contains information about 10,000+ movies collected from The Movie Database (TMDb).

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published