This project analyzes the Netflix Movies and TV Shows Dataset (from Kaggle) to explore trends in content type, release year, genre, and country distribution.
The goal is to understand how Netflixβs content library has evolved over time.
- Pandas β for data cleaning and manipulation
- NumPy β for numerical operations
- Matplotlib β for static visualizations
- Seaborn β for advanced and beautiful charts
- Data Understanding β explored dataset structure, shape, and column info.
- Data Cleaning β handled missing values and inconsistent data.
- Exploratory Data Analysis (EDA) β examined patterns in release years, genres, and countries.
- Visualization β plotted charts for better understanding.
- Insights & Conclusion β summarized key findings.
- Movies make up the majority of Netflixβs content.
- The United States and India contribute the highest number of titles.
- Most Netflix releases occurred between 2017β2019, showing rapid content expansion.
- Drama and Comedy are among the most popular genres.
πΉ : The Netflix library has more Movies than TV Shows. The pie chart shows that roughly 60β70% of the content is Movies and 30β40% are TV Shows. πΉ : United States and India have the highest number of titles. The top 5 countries hold a major share of the total content. πΉ : Drama and Comedy are the most popular genres. The genre-wise content is unevenly distributed, providing useful insights for strategic decisions. πΉ : Between 2017 and 2019, the rate of content addition was high. In recent years, Netflix has continued to add new titles consistently.
Recommendations: πΉ : Focus on popular genres and marketing efforts in high-content countries. πΉ : Explore new content opportunities in countries with lower content representation.
Kaggle - Netflix Movies and TV Shows Dataset
Python, Pandas, NumPy, Matplotlib, Seaborn, Data Cleaning, EDA, Data Visualization