You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
In the Project Workspace, I'll find a data set containing real messages that were sent during disaster events. I will be creating a machine learning pipeline to categorize these events so that you can send the messages to an appropriate disaster relief agency. This project will include a web app where an emergency worker can input a new message …
ETL pipeline that extracts their data from S3, stages them in Redshift, and transforms data into a set of dimensional tables for their analytics team to continue finding insights in what songs their users are listening to. Then we will test the database and ETL pipeline by running queries given to us by the analytics team from Sparkify and compa…
This projects showcases a Dimensional Data Warehouse model, utilizing random data from Public APIs to populate the DW. The fact table is transactional based. Data endpoints are available through an Flask based API.
Developed a Data pipeline leveraging Google Cloud services to automate data extraction from Google Drive, upload to GCS, and manage BigQuery tables with Dataform. Includes Airflow DAGs for efficient orchestration of ETL processes.