Skip to content

sid7219/Netflix-Data-Analysis-Using-SQL

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

3 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

🎬 Netflix Content Analysis Using SQL Server

πŸ“– Project Overview

This project explores Netflix's content catalog using SQL Server to uncover trends in content distribution, genre popularity, audience targeting, country-wise contribution, and platform growth over time.

The objective was to transform raw Netflix data into actionable business insights using SQL-based exploratory and analytical techniques.


🎯 Business Questions

This project aims to answer the following questions:

  1. What percentage of Netflix content consists of Movies and TV Shows?
  2. Which countries contribute the most content?
  3. Which genres dominate Netflix's catalog?
  4. Which ratings are most common?
  5. How much content is targeted toward Kids, Teens, and Adults?
  6. How has content production evolved over time?
  7. Which decades contributed the most content?
  8. Which directors have the highest number of titles?
  9. Which actors appear most frequently?
  10. How quickly has Netflix expanded its content library?
  11. Which genres dominated different release years?

πŸ—‚ Dataset Information

Metric Value
Total Titles 8,807
Movies 6,131
TV Shows 2,676

Dataset Features

  • Title
  • Type
  • Director
  • Cast
  • Country
  • Date Added
  • Release Year
  • Rating
  • Listed In (Genres)

πŸ›  SQL Skills Demonstrated

Data Exploration

  • Aggregate Functions
  • GROUP BY
  • ORDER BY

Data Cleaning

  • TRIM()
  • Handling NULL Values

Advanced SQL

  • STRING_SPLIT()
  • CROSS APPLY
  • Common Table Expressions (CTEs)

Window Functions

  • DENSE_RANK()
  • ROW_NUMBER()
  • COUNT() OVER()

Analytical Techniques

  • Ranking Analysis
  • Trend Analysis
  • Running Totals
  • Audience Segmentation
  • Top-N Analysis

πŸ“Š Key Findings

1️⃣ Content Distribution

Insight

  • Movies account for approximately 69.6% of Netflix content.
  • TV Shows account for approximately 30.4%.
  • Netflix's catalog is heavily movie-focused.

Content Distribution


2️⃣ Top Content Producing Countries

Insight

Top contributing countries:

  1. United States
  2. India
  3. United Kingdom
  4. Canada
  5. France

The United States dominates Netflix's catalog, while India has emerged as Netflix's second-largest content contributor.

Top Countries


3️⃣ Genre Analysis

Insight

Most popular genres:

  • International Movies
  • Dramas
  • Comedies
  • International TV Shows
  • Documentaries

Netflix places a strong emphasis on international and drama-based content.

Top Genres


4️⃣ Audience & Rating Analysis

Insight

  • TV-MA and TV-14 dominate the platform.
  • Adult-oriented content forms the largest segment.
  • Teen content is the second-largest category.
  • Kids-focused content represents a relatively small share of the catalog.

Rating Distribution


5️⃣ Director Analysis

Insight

Top Director:

Rajiv Chilaka β€” 22 Titles

A small group of directors repeatedly contribute content to Netflix's catalog.

Top Directors


6️⃣ Actor Analysis

Insight

Top Actor:

Anupam Kher β€” 43 Titles

Indian actors dominate the list of most frequently appearing actors.

Top Actors


7️⃣ Netflix Growth Analysis

Insight

Netflix experienced rapid content expansion after 2015.

Peak Growth Year: 2019

Approximately 1,999 titles were added during this year.

This period represents Netflix's most aggressive content acquisition phase.

Netflix Growth


8️⃣ Genre Dominance by Year

Insight

International Movies dominated most release years, while Dramas frequently emerged as the leading genre in other years.

This highlights Netflix's continued investment in globally distributed content.

Genre By Year


πŸ“Œ Business Conclusions

  • Netflix is primarily a movie-focused streaming platform.
  • The United States remains the largest content producer.
  • India is Netflix's second-largest content market.
  • International Movies and Dramas dominate the catalog.
  • Adult-oriented content forms the largest audience segment.
  • Netflix experienced rapid expansion between 2015 and 2019.
  • Modern content from the 2010s dominates the platform.
  • A small group of directors and actors appear repeatedly across the catalog.

πŸ“‚ Repository Structure

Netflix-SQL-Analysis/
β”‚
β”œβ”€β”€ README.md
β”‚
β”œβ”€β”€ dataset/
β”‚   └── netflix_titles.csv
β”‚
β”œβ”€β”€ sql/
β”‚   └── netflix_analysis.sql
β”‚
β”œβ”€β”€ screenshots/
β”‚   β”œβ”€β”€ 01_content_distribution.png
β”‚   β”œβ”€β”€ 02_top_countries.png
β”‚   β”œβ”€β”€ 03_top_genres.png
β”‚   β”œβ”€β”€ 04_rating_distribution.png
β”‚   β”œβ”€β”€ 05_top_directors.png
β”‚   β”œβ”€β”€ 06_top_actors.png
β”‚   β”œβ”€β”€ 07_content_growth.png
β”‚   └── 08_genre_by_year.png
β”‚
└── LICENSE (Optional)

πŸš€ Author

Siddharth Gupta

B.Tech in Geoinformatics
Netaji Subhas University of Technology (NSUT)

Skills Demonstrated

  • SQL Server
  • Data Analysis
  • Data Cleaning
  • Exploratory Data Analysis (EDA)
  • Window Functions
  • Data Storytelling
  • Business Intelligence Thinking

⭐ Project Outcome

This project demonstrates how SQL can be used to transform raw data into meaningful business insights through data cleaning, exploration, trend analysis, ranking techniques, and window functions.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors