# 🏙️ Exploring NYC Airbnb Market Trends

**Dataset:** Simulated NYC Airbnb 2024 data
**Author:** Fr. Nduka Anthony

This notebook explores the dynamics of the NYC Airbnb market using a simulated dataset. We’ll look at price patterns, neighborhood trends, and room availability insights.

In [None]:
import pandas as pd
import seaborn as sns
import matplotlib.pyplot as plt
import numpy as np

sns.set(style='whitegrid')
df = pd.read_csv('simulated_airbnb_nyc.csv')
df.head()

## 📌 Dataset Overview

In [None]:
df.info()

## 🧼 Data Cleaning

In [None]:
# Check for missing values
df.isnull().sum()

## 📊 Room Type Distribution

In [None]:
room_counts = df['room_type'].value_counts()
sns.countplot(data=df, x='room_type', order=room_counts.index, palette='Set2')
plt.title('Room Type Distribution')
plt.xticks(rotation=15)
plt.show()

## 💰 Average Price by Neighbourhood Group

In [None]:
sns.barplot(data=df, x='neighbourhood_group', y='price', estimator=np.mean, palette='viridis')
plt.title('Average Price by Neighbourhood Group')
plt.xticks(rotation=30)
plt.ylabel('Average Price ($)')
plt.show()

## 🔍 Correlation Heatmap

In [None]:
corr = df[['price', 'minimum_nights', 'number_of_reviews', 'reviews_per_month', 'calculated_host_listings_count', 'availability_365']].corr()
sns.heatmap(corr, annot=True, cmap='coolwarm', fmt='.2f')
plt.title('Correlation Matrix')
plt.show()

## 📆 Availability Distribution

In [None]:
sns.histplot(data=df, x='availability_365', bins=30, color='skyblue')
plt.title('Availability Distribution (0–365 days)')
plt.xlabel('Days Available Per Year')
plt.show()

## 🌍 Geospatial Room Type Distribution

In [None]:
plt.figure(figsize=(8,6))
sns.scatterplot(data=df, x='longitude', y='latitude', hue='room_type', alpha=0.6)
plt.title('Room Types Across NYC (Simulated)')
plt.xlabel('Longitude')
plt.ylabel('Latitude')
plt.legend(loc='upper right')
plt.show()

## 🧠 Key Discoveries
- Entire homes tend to dominate Brooklyn and Manhattan.
- Shared rooms are less common and concentrated.
- Prices are higher in Manhattan; Staten Island is most affordable.
- Availability is widely distributed; some hosts list year-round.
- Positive correlation between number of listings and reviews/month (suggesting active hosts).