# 18 â€“ Seaborn Real Dataset Projects
**Author:** Hamna Munir  
**Repository:** Python-Libraries-for-AI-ML  
**Topic:** Real-world Data Visualization using Seaborn

---
## ðŸŽ¯ Goal
Apply Seaborn on **real datasets** to build meaningful visual insights:
- Titanic survival analysis
- Restaurant tips behavior
- Airline passenger trends
- Custom project-style visualizations


In [None]:
import seaborn as sns
import matplotlib.pyplot as plt
import pandas as pd
import numpy as np

sns.set_theme(style='whitegrid')

---
## ðŸ“Œ Project 1: Titanic Dataset Analysis
### Question:
**Who survived the Titanic and why?**

We analyze survival patterns based on:
- Gender
- Passenger class
- Age


In [None]:
titanic = sns.load_dataset('titanic')

In [None]:
plt.figure(figsize=(7,4))
sns.countplot(data=titanic, x='sex', hue='survived')
plt.title('Survival Count by Gender')
plt.tight_layout()

In [None]:
plt.figure(figsize=(7,4))
sns.barplot(data=titanic, x='class', y='survived', errorbar=('ci', 95))
plt.title('Survival Rate by Passenger Class')
plt.tight_layout()

### âœ… Insight
- Females had higher survival rates
- First-class passengers were more likely to survive


---
## ðŸ“Œ Project 2: Tips Dataset Analysis
### Question:
**How do bills and tips vary by day, gender, and time?**

In [None]:
tips = sns.load_dataset('tips')

In [None]:
plt.figure(figsize=(7,4))
sns.boxplot(data=tips, x='day', y='total_bill', hue='day', legend=False)
plt.title('Total Bill Distribution by Day')
plt.tight_layout()

In [None]:
plt.figure(figsize=(7,4))
sns.scatterplot(data=tips, x='total_bill', y='tip', hue='time')
plt.title('Total Bill vs Tip')
plt.tight_layout()

### âœ… Insight
- Higher bills generally lead to higher tips
- Dinner time shows larger spending


---
## ðŸ“Œ Project 3: Flights Dataset (Time Series)
### Question:
**How did airline passengers grow over time?**

In [None]:
flights = sns.load_dataset('flights')
flights['month'] = flights['month'].astype(str)
flights['date'] = pd.to_datetime(flights['year'].astype(str) + '-' + flights['month'])

In [None]:
plt.figure(figsize=(9,4))
sns.lineplot(data=flights, x='date', y='passengers', errorbar=None)
plt.title('Airline Passengers Over Time')
plt.tight_layout()

### âœ… Insight
- Clear upward trend over years
- Strong seasonal patterns


---
## ðŸ“Œ Project 4: Custom Visualization
### Question:
**Which numeric variables are correlated in the Tips dataset?**

In [None]:
corr = tips.corr(numeric_only=True)

plt.figure(figsize=(6,4))
sns.heatmap(corr, annot=True, cmap='coolwarm')
plt.title('Correlation Heatmap (Tips Dataset)')
plt.tight_layout()

---
## âœ… Final Summary
- Real datasets improve **data intuition**
- Seaborn simplifies **EDA & storytelling**
- These projects mirror **real analytics tasks**

ðŸ“Œ Next step: convert insights into **reports or dashboards**