# Netflix Data Analysis
## Objective
This notebook provides data analysis and insights based on Netflix's dataset, focusing on content trends, genre distribution, ratings, and more.

In [None]:
# Importing necessary libraries
import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
import seaborn as sns
sns.set(style='whitegrid')

# Load the dataset
df = pd.read_csv('netflix.csv')

# Display the first 5 rows
df.head()

## Data Overview
- The dataset contains information about Netflix's content, including movies and TV shows.
- We will start by inspecting the dataset for missing values and other basic information.

In [None]:
# Display dataset shape
df.shape

# Display column data types
df.dtypes

# Checking for missing values
df.isnull().sum()

## Data Cleaning
We will now clean the data by handling missing values and ensuring consistency in data types.

In [None]:
# Fill missing values
df['director'].fillna('Unknown', inplace=True)
df['country'].fillna('Unknown', inplace=True)
df.dropna(subset=['rating'], inplace=True)

# Checking again for missing values
df.isnull().sum()

## Data Analysis
We will now analyze different aspects of the dataset, including the distribution of movies and TV shows, genres, and content ratings.

In [None]:
# Plotting number of Movies vs TV Shows
plt.figure(figsize=(10,5))
sns.countplot(data=df, x='type', palette='Set1')
plt.title('Movies vs TV Shows on Netflix')
plt.show()

### Top Genres
Now let's analyze the top genres available on Netflix.

In [None]:
# Plot top genres
df['listed_in'].value_counts().head(10).plot(kind='bar', figsize=(12,6), color='skyblue')
plt.title('Top 10 Genres on Netflix')
plt.show()

## Conclusion and Recommendations
- Movies dominate the Netflix platform in terms of content.
- International movies, dramas, and comedies are the most popular genres.
- Recommendations for business strategy based on the data analysis include increasing content for younger audiences and expanding regional content.