# Exploratory Data Analysis (EDA)

In this notebook, we will perform an exploratory data analysis (EDA) on the dataset. The steps include:

1. Viewing the first few rows, dataset information, and summary statistics.
2. Visualizing correlations using a heatmap.
3. Creating scatterplots to explore relationships between variables.

In [None]:
# Import necessary libraries
import pandas as pd
import seaborn as sns
import matplotlib.pyplot as plt

# Load the dataset
data = pd.read_csv('../data/raw/housing.csv')

# Display the first few rows of the dataset
data.head()

In [None]:
# Display dataset information
data.info()

In [None]:
# Display summary statistics
data.describe()

In [None]:
# Generate a correlation heatmap
plt.figure(figsize=(10, 8))
sns.heatmap(data.corr(), annot=True, cmap='coolwarm')
plt.title('Correlation Heatmap')
plt.show()

In [None]:
# Create scatterplots for selected variable pairs
sns.pairplot(data, vars=['Variable1', 'Variable2', 'Variable3'], diag_kind='kde')
plt.show()