Transforming raw customer data into strategic business intelligence β this project mirrors the complete workflow of a professional data analyst, from messy raw data to boardroom-ready insights.
This end-to-end portfolio project demonstrates mastery across the full analytics stack: data wrangling in Python, business querying in SQL, and executive dashboards in Power BI β capped with a professional report and stakeholder presentation.
βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β PROJECT PIPELINE β
ββββββββββββββββ¬βββββββββββββββ¬βββββββββββββββ¬βββββββββββββββββββββ€
β π PYTHON β ποΈ SQL β π POWER BI β π REPORTING β
β β β β β
β Data Import β DB Simulate β Dashboard β Insight Report β
β Exploration β Segments β KPI Cards β Presentation β
β Cleaning β Loyalty β Trend Lines β Recommendations β
β Modeling β Purchase β Filters β β
β EDA β Drivers β Drill-down β β
ββββββββββββββββ΄βββββββββββββββ΄βββββββββββββββ΄βββββββββββββββββββββ
Before you begin, ensure you have the following installed:
Python 3.8+ | MySQL / PostgreSQL / MS SQL Server | Power BI Desktop | Jupyter Notebookgit clone https://github.com/sorol25/SQLPythonPowerBI-customer-trendAnalysis.git
cd SQLPythonPowerBI-customer-trendAnalysisOpen the Jupyter Notebook:
π Customer_Shopping_Behavior_Analysis.ipynb
This notebook covers:
- Data Import β Load raw CSV/Excel data
- Data Exploration β Shape, types, null checks, distributions
- Data Cleaning β Handle missing values, outliers, encoding
- SQL Connection β Bridge Python β Database via SQLAlchemy / pyodbc
-- Create your database
CREATE DATABASE customer_behavior_db;
-- Then run the provided query file:
-- π customer_behavior_sql_queries.sqlQueries answer key business questions around:
- π― Customer Segmentation (RFM Analysis)
- π Loyalty Tier Breakdowns
- π Top Purchase Drivers & Product Categories
- π Seasonal & Temporal Trends
π customer_behavior_dashboard.pbix
- Open the
.pbixfile in Power BI Desktop - Update your SQL Server connection string
- Refresh data and explore:
- KPI Summary Cards
- Revenue by Segment Visuals
- Customer Cohort Heatmaps
- Dynamic Slicers & Drill-throughs
- π Write your Project Report β findings, methodology, and strategic recommendations
- π¨ Build your Presentation Deck using Gamma AI for a polished stakeholder pitch
SQLPythonPowerBI-customer-trendAnalysis/
β
βββ π Customer_Shopping_Behavior_Analysis.ipynb # Python EDA & Data Prep
βββ ποΈ customer_behavior_sql_queries.sql # SQL Business Queries
βββ π customer_behavior_dashboard.pbix # Power BI Dashboard
βββ π data/
β βββ raw_customer_data.csv # Source Dataset
βββ π project_report.pdf # Final Report
βββ π README.md
| # | Business Question | Tool Used |
|---|---|---|
| 1 | Which customer segments drive the most revenue? | SQL + Power BI |
| 2 | What factors most influence repeat purchases? | Python + SQL |
| 3 | How does customer loyalty correlate with spend? | SQL |
| 4 | What are the peak buying seasons and categories? | SQL + Power BI |
| 5 | Which demographics are most valuable long-term? | Python + Power BI |
MIT License β Copyright (c) 2026 Yeamine Alam Sorol
Permission is hereby granted, free of chargeβ fork it, star it.
Yeamine Alam Sorol Data Analyst & Web Developer
I break down complex data topics into simple, practical content that actually helps you land a job. I regularly share around SQL, analytics workflows, portfolio projects, and career growth.