# 💾 ETL Phase 3: Load

## 🎯 Objective
In this notebook, we load the transformed datasets into a structured format using **SQLite**. This enables efficient querying and integration with analytics tools.

We:
- Import transformed CSV files
- Store them into SQLite databases
- Preview and verify loaded results


In [5]:
import pandas as pd
import sqlite3
import os

# Ensure loaded folder exists
os.makedirs("loaded", exist_ok=True)

# Load transformed data
full_df = pd.read_csv("transformed/transformed_full.csv")
incremental_df = pd.read_csv("transformed/transformed_incremental.csv")

# Connect to SQLite databases
full_conn = sqlite3.connect("loaded/full_data.db")
incremental_conn = sqlite3.connect("loaded/incremental_data.db")

# Load DataFrames into SQLite
full_df.to_sql("orders", full_conn, if_exists="replace", index=False)
incremental_df.to_sql("orders", incremental_conn, if_exists="replace", index=False)

# Preview 5 rows using SQL
print(" Preview from full_data.db:")
print(pd.read_sql("SELECT * FROM orders LIMIT 5", full_conn))

print("\n Preview from incremental_data.db:")
print(pd.read_sql("SELECT * FROM orders LIMIT 5", incremental_conn))

# Close connections
full_conn.close()
incremental_conn.close()

 Preview from full_data.db:
   order_id customer_name   region   product  quantity  unit_price  \
0         1         Diana    South    Tablet         1       500.0   
1         2           Eve    North  Computer         1       250.0   
2         3       Charlie  Unknown  Computer         2       250.0   
3         4           Eve     West  Computer         2       750.0   
4         5           Eve    South    Tablet         3       750.0   

   total_price price_category  order_date  
0        500.0      Mid-Range  2024-01-20  
1        250.0         Budget  2024-04-29  
2        500.0         Budget  2024-01-08  
3       1500.0         Budget  2024-01-07  
4       2250.0        Premium  2024-03-07  

 Preview from incremental_data.db:
   order_id customer_name   region   product  quantity  unit_price  \
0       101         Alice  Unknown  Computer         1       900.0   
1       102     Anonymous  Unknown  Computer         1       300.0   
2       103     Anonymous  Unknown  Compu