# STEP 1 — Title & Objective

# Video Game Sales Data Cleaning & Analysis

This notebook cleans, explores, and prepares video game sales data for Tableau dashboards.

### Goals
- Ensure data quality
- Perform exploratory data analysis (EDA)
- Create insights-ready features
- Export a Tableau-ready dataset



## STEP 2 — Import Libraries

In [None]:
import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
import seaborn as sns

pd.set_option("display.max_columns", None)
sns.set(style="whitegrid")


## STEP 3 — Dataset Path

In [None]:
data_path = "../data/video_game_sales.csv"


## STEP 4 — Load Dataset

In [None]:
df = pd.read_csv(data_path)

## STEP 5 — Preview Rows

In [None]:
df.head()


## STEP 6 — Tail Rows

In [None]:
df.tail()


## STEP 7 — Dataset Shape

In [None]:
df.shape


## STEP 8 — Column Names

In [None]:
df.columns


## STEP 9 — Dataset Info

In [None]:
df.info()


## STEP 10 — Summary Statistics

In [None]:
df.describe()


## STEP 11 — Missing Values Count

In [None]:
df.isnull().sum()


## STEP 12 — Missing Values Percentage

In [None]:
(df.isnull().mean() * 100).round(2)


## STEP 13 — Drop Missing Rows

In [None]:
df = df.dropna()


## STEP 14 — Confirm Missing Values Removed

In [None]:
df.isnull().sum()


## STEP 15 — Duplicate Records Count

In [None]:
df.duplicated().sum()

## STEP 16 — Remove Duplicates

In [None]:
df = df.drop_duplicates()


## STEP 17 — Standardize Column Names

In [None]:
df.columns = (
    df.columns
    .str.lower()
    .str.strip()
    .str.replace(" ", "_")
)
df.columns


## STEP 18 — Inspect Data Types

In [None]:
df.dtypes
