- Arlington - VA
- https://github.com/Kareem1990
- @Z2Kareem
Pinned Loading
-
sparkify-etl-airflow-docker
sparkify-etl-airflow-docker PublicEnd-to-end data engineering project: ETL pipeline for Sparkify using Apache Airflow, Docker, Terraform, and AWS Redshift Serverless. Automates infrastructure, orchestration, and data loading from S…
Python
-
sparkify-redshift-etl-pipeline
sparkify-redshift-etl-pipeline PublicThis project builds an end-to-end cloud data warehouse on AWS Redshift for Sparkify, a fictional music app. It extracts raw song and log data from Amazon S3, loads it into staging tables, and trans…
Python
-
stedi-datalake-full-terraform
stedi-datalake-full-terraform PublicA fully Terraform-managed AWS data lake pipeline for IoT sensor data analytics. Converts raw landing data into trusted and curated layers for machine learning using Glue, S3, and Athena. Originally…
Python
-
stedi-datalake-glue-visual-etl
stedi-datalake-glue-visual-etl PublicEnd-to-end data lakehouse pipeline on AWS using Glue, S3, Athena, and Terraform to curate human balance data for machine learning.
Python
-
Sparkify-Cassandra
Sparkify-Cassandra PublicDesigned a NoSQL data model using Apache Cassandra for Sparkify, a music streaming startup. Cleaned CSV logs with Python, modeled denormalized tables based on analytical queries, and built ETL scri…
Python
-
Bank_Scoop
Bank_Scoop PublicBankScoop is a Python-based ETL pipeline that scrapes global bank rankings, converts market caps to multiple currencies, stores the results in a SQLite database, logs every step, and generates a cl…
Python
If the problem persists, check the GitHub status page or contact support.