This project performs Exploratory Data Analysis (EDA) on a Brazilian e-commerce dataset using SQL queries.
The dataset consists of multiple interconnected tables (customers, orders, payments, products, sellers, etc.), and the objective is to uncover business insights and customer behavior patterns.
The project uses tables from the SQL_Target
database:
- customers β customer demographics (city, state, ID mapping).
- geolocation β geographical details (lat/long, cities, states).
- order_items β details about purchased products, prices, freight values.
- order_reviews β customer reviews and ratings.
- orders_1 β orders placed, timestamps, delivery details.
- payments β payment methods and amounts.
- products β product catalog with attributes.
- sellers β seller details and locations.
Key insights generated through SQL queries include:
-
Data Profiling
- Data types of all columns in the
customers
table. - Order purchase time range.
- Data types of all columns in the
-
Customer Insights
- Cities and states of customers ordering within specific periods.
- Customer distribution across Brazilian states.
-
Order Trends
- Yearly and monthly order volume.
- Growth trend of e-commerce orders.
- Orders placed by time of day (dawn, morning, afternoon, night).
-
Payments & Revenue
- % increase in order cost (2017 vs. 2018).
- Orders by payment type (monthly breakdown).
- Orders by number of installments.
-
Delivery & Logistics
- Average delivery time across states.
- Top 5 states with highest & lowest freight values.
- Gap between actual vs. estimated delivery times.
-
Price & Freight Analysis
- Mean and total values of product prices and freight charges by customer state.
- SQL (Google BigQuery / any SQL engine)
- Dataset: Brazilian e-commerce data
- Environment: Jupyter Notebook, BigQuery console, or other SQL IDE