Welcome to my SQL Server Data Warehouse Project Repository!
This Portfolio project is my first attempt of Building a modern data warehouse with SQL Sever, including ETL processes, data modeling and analytics.
As the lines of Data professionals continue to be blurred, I'm Very exited to demonstrate the industry best practices and explore the world of Data Engeneering.
This project involes:
- Data Architecture: Designing A Modern Data Warehouse Useing the Medallion Architecture Bronze, Silver, and Gold layers.
- ETL Pipelines: Etracting, Transforming and Loading data from source system into warehose.
Develop a modern data warehouse using SQL Server to consolidate sales data, enabling analytical reporting and informed decision-making.
Specifications:
. Data Sources: Import data from two source systems (ERP and CRM) provided as CSV files.
. Data Quality: Cleanse and resolve data quality issues prior to analysis.
. Integration: Combine both sources into a single, user-friendly data model designed for analytical queries.
. Scope: Focus on the latest dataset only; historization of data is not required.
. Documentation: Provide clear documentation of the data model to support both business stakeholders and analytics teams.
The Structure of the the project follows the Medallion Architecture Bronze, Silver, and Gold layers.
- Bronze Layer: Stores raw data as-is from the source systems. Data is ingested from CSV Files into SQL Server Database.
- Silver Layer: This layer includes data cleansing, standardization, and normalization processes to prepare data for analysis.
- Gold Layer: Houses business-ready data modeled into a star schema required for reporting and analytics.
.drawio.png)