This repository highlights a complete end-to-end implementation of a modern analytics solution using PostgreSQL — from raw data ingestion to refined insights in Tableau, all built through a structured, layered Medallion-Style architecture.
Final Dashboard (
The project is designed using the Medallion Architecture model, featuring three distinct layers for scalable data transformation:
Stores raw, unprocessed data directly from CSV files. Data is loaded as-is into PostgreSQL for traceability.
In this layer, data undergoes essential transformations: cleaning, validation, and standardization to prep for analysis.
This layer contains business-ready, analysis-optimized data. Data is modeled using a Star Schema to support robust analytics and reporting.
This project demonstrates practical skills in:
- PostgreSQL-based Data Engineering
- ETL Pipelines (Extraction, Transformation, Load)
- Dimensional Modeling (Star Schema)
- SQL Reporting and dashboard support
- Data Visualization
Key workflows include:
- Designing scalable data pipelines
- Building analytical data models
- Creating SQL reports for business insights
- Delivering insights through Tableau (In Progress)
All resources and tools used are open and accessible:
- PostgreSQL – Open-source database platform for storage & transformations
- PGAdmin - Interface to work with Postgre Databases
- CSV Datasets – ERP & CRM data used as source systems
- pgAdmin / Terminal – GUI clients for PostgreSQL
- Draw.io – Data architecture and modeling diagrams
- GitHub – Version control and collaboration
- Notion – Project task management and documentation | SQL Data Warehouse Project
- Tableau - Data visualization to derive insights
Goal: Create a PostgreSQL data warehouse for consolidated sales analytics
- Import ERP & CRM data from CSV files
- Clean, normalize, and unify data into one cohesive model
- Focus on current data only (no historical tracking required)
- Model data using dimensions and facts for efficient querying
Goal: Deliver meaningful insights using SQL-based queries
Analyze business-critical metrics like:
- Customer behavior trends
- Product-level performance
- Sales growth and regional analysis
Goal: Summarize insights using KPIs and Graphs in an Interactive Dashboard using Tableau
Visualize Metrics liks:
- Total Sales
- Top Products
- Top Customers
- Top Regions
data-warehouse-project/
│
├── Datasets/ # Raw datasets used for the project (ERP and CRM data)
│
├── Documents/ # Project documentation and architecture details
│ ├── etl.drawio # Draw.io file shows all different techniquies and methods of ETL
│ ├── data_architecture.drawio # Draw.io file shows the project's architecture
│ ├── data_catalog.md # Catalog of datasets, including field descriptions and metadata
│ ├── data_flow.drawio # Draw.io file for the data flow diagram
│ ├── data_models.drawio # Draw.io file for data models (star schema)
│ ├── naming-conventions.md # Consistent naming guidelines for tables, columns, and files
│
├── Scripts/ # SQL scripts for ETL and transformations
│ ├── bronze/ # Scripts for extracting and loading raw data
│ ├── silver/ # Scripts for cleaning and transforming data
│ ├── gold/ # Scripts for creating analytical models
│
├── Tests/ # Test scripts and quality files
│
├── README.md # Project overview and instructions
├── LICENSE # License information for the repository
├── .gitignore # Files and directories to be ignored by Git
I am a data enthusiast with a strong interest in business intelligence, data analytics, and visualization. I enjoy transforming raw data into actionable insights using tools like SQL, Tableau, and Python. With hands-on experience in building data pipelines and dashboards, I aim to bridge the gap between data and decision-making.
- 📞 Phone: +1 (437) 661-3674
- 💻 GitHub
- 📊 Tableau Public
A special thanks to @DataWithBaraa for inspiring me to work on this project :)