You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
A full data warehouse infrastructure with ETL pipelines running inside docker on Apache Airflow for data orchestration, AWS Redshift for cloud data warehouse and Metabase to serve the needs of data visualizations such as analytical dashboards.
A simple pipeline infrastructure with ETL pipeline contained in a Docker environment on Apache Airflow for orchestration and Postgres for data warehousing
A Data Warehousing project for retail sales using dimension modelling best practices with SCD type 2 on AWS Redshift. Utilizing AWS Lambda, Glue Workflows and Python Shell jobs to create and automate an ELT pipeline where batch data coming into S3 is loaded onto Redshift and necessary transformations are performed to meet requirements.
This repository contains code for building a Data Warehouse from scratch. I started with the elicitation process, then used functional dependencies for conversion to GOM4DW schema, followed by conversion to Star Schema to find out different facts and dimensions and lastly I implemented the ETL process. I have used HTML and flask to provide a use…
End-to-end data engineering processes for the NIGERIA Health Facility Registry (HFR). The project leveraged Selenium, Pandas, PySpark, PostgreSQL and Airflow
This project was done at the University of Pisa in collaboration with my colleagues Federica Guiducci and Valentina Olivotto under supervision of professors Anna Monreale and Roberto Pellungrini. It is a Data Warehousing project featuring the use of Analytic SQL and various other tools such as Visual Studio in MS Azure, Microstrategy and Power BI..
Effortlessly analyze YouTube data using Python, SQL, MongoDB, and Streamlit. This repository provides a user-friendly application for retrieving, saving, and querying YouTube channel and video data. Enhance your analytics with Google API integration and uphold ethical data scraping practices.