# 🛒 Day 1 - Superstore Sales Data Analysis
## ✅ Step 1: Load and Explore the Dataset
Let's begin by importing the necessary libraries and loading the dataset to understand its structure.

In [None]:
import pandas as pd

# Load the dataset
df = pd.read_csv('../data/SuperstoreSales.csv', encoding='ISO-8859-1')

## 🔍 Step 2: View Basic Information
This will help us understand the number of rows, columns, data types, and memory usage.

In [None]:
# Dataset info
df.info()

## 🧾 Step 3: Preview the First Few Rows
This gives us an idea of how the data is structured and what kind of values are present.

In [None]:
# Preview the data
df.head()

## 🧹 Step 4: Check for Duplicates
Let's see if there are any duplicate rows in the dataset that need to be cleaned.

In [None]:
# Check for duplicates
df.duplicated().sum()

## 📏 Step 5: Check for Missing Values
We need to identify missing data so we can decide how to handle it in the cleaning phase.

In [None]:
# Check for missing values
df.isnull().sum()

## ✍️ Step 6: Rename Columns
We'll clean up any column names that have spaces or inconsistent formatting for easier access in future steps.

In [None]:
# Rename columns (optional step)
df.columns = df.columns.str.strip().str.replace(' ', '_').str.lower()
df.columns

## 📊 Step 7: Understand Unique Values in Each Column
We’ll inspect categorical fields to get a sense of the distinct entries, which helps in EDA and feature engineering.

In [None]:
# Check unique values for object columns
for col in df.select_dtypes(include='object').columns:
    print(f"{col}: {df[col].nunique()} unique values")

## 📦 Step 8: Save Cleaned Dataset for Future Use
We'll save this version so we don't need to repeat the basic cleanup in upcoming days.

In [None]:
# Save cleaned dataset
df.to_csv('../data/superstore_cleaned.csv', index=False)

## ✅ Summary of Day 1
- Imported and loaded the Superstore dataset
- Inspected structure, null values, and duplicates
- Renamed columns for ease of use
- Saved a cleaned version for future analysis

🚀 Ready to move on to Day 2: Exploratory Data Analysis!