Welcome to the Netflix Dataset Analysis repository! This project is dedicated to cleaning, analyzing, and visualizing data from Netflix. Our analysis focuses on uncovering insights into the types of shows available, the distribution of movies and TV shows, director contributions, and much more.
The dataset used for this analysis is sourced from Kaggle, a platform for predictive modelling and analytics competitions. It contains detailed information about the movies and TV shows available on Netflix up to 2021. You can access the dataset using the following link:
Netflix Data: Cleaning, Analysis and Visualization
This repository contains the following files:
-
analyse.ipynb
: A Jupyter notebook containing all the Python code used for the data cleaning, analysis, and visualization. This notebook is well-documented with markdown cells explaining each step of the process. -
netflix1.csv
: The dataset file that contains data about Netflix movies and TV shows.
The analysis covers various aspects of the Netflix data, including:
- Data Cleaning: Renaming columns, handling missing values, and converting data types.
- Exploratory Data Analysis (EDA): Analyzing the distribution of content types, the popularity of different genres, release patterns, and ratings.
- Visualization: Creating visual representations for a better understanding of the data using libraries like Matplotlib and Seaborn.
This project includes a variety of visualizations to aid in understanding the data better:
- Pie charts showing the proportion of Movies vs. TV Shows.
- Bar charts depicting the top countries producing Netflix content.
- Histograms visualizing the distribution of release years.
- More complex graphs showing the frequency of different ratings among directors and genres.
To view the analysis:
- Ensure that you have Jupyter Notebook installed, or use an IDE that supports IPython Notebooks.
- Download
netflix1.csv
andanalyse.ipynb
. - Open the
analyse.ipynb
file in Jupyter Notebook to view and run the code.
If you have any feedback or would like to discuss this project further, feel free to connect with me on my professional and social platforms:
Feel free to contribute or suggest improvements!
Thank you for visiting this repository, and I hope you find the insights from this analysis useful!