Skip to content

Abdelrahman13-coder/Data-Integration-Pipelines-for-NYC-Payroll-Data-Analytics

Repository files navigation

Data Integration Pipelines for NYC Payroll Data Analytics

Project Overview

The City of New York would like to develope a Data Analytics platform on Azure Synapse Analytics to accomplich two primary objectives:

  1. Analyze how th City's financial resources are allocated and how much of the City's budget is being devoted to overtime.

  2. Make the data available to the interested public to show how the City's budget is being spent on salary and overtime pay for all municipal employees.

In this project as a Data Engineer I created high-quality data pipelines that are dynamic, can be automated, and monitored for efficient operations. My Team also includes the city's Quality Assurance experts who will test the pipelines to find any errors and improve overall data quality.

The sources data resides in Azure Data Lake and needs to be processed in a NYC data warehouse in Aure Synapse Analytics. The source datasets consist of CSV files with Employee master data and monthly payroll data entered by various City agencies

image