- 📫 How to reach me: LinkedIn
- ⚡ Fun fact: Two of my favorites books are A Billion Wicked Thoughts by Ogi Ogas & Sai Gaddam, and How Will You Measure Your Life by Clayton Christensen!
- 📚 I'm currently reading Streaming Data by Andrew Psaltis, Designing Cloud Data Platforms by Danil Zburivsky & Lynda Partner, and Building the Data Lakehouse by Bill Inmon
- Lagos
- medium.com/@ofili
- @ofililewis
Block or Report
Block or report ofili
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abusePinned Loading
-
pyspark-template
pyspark-template PublicStructured Streaming app that can read files from the local system folder as new files are added to the folder as stream data and apply all the operations on the new data and, finally, write the re…
Python
-
data_pipeline_with_airflow
data_pipeline_with_airflow PublicThis project builds a data pipeline that ingests Sparkify's music data into an AWS Redshift Data Warehouse. The ETL pipeline will be run on an hourly basis, scheduled using Airflow.
Python
-
data-pipeline-with-gcp
data-pipeline-with-gcp PublicThis project implements a data ingestion and processing pipeline to collect, store and process time-series data. The pipeline consists of a publisher, a message queue (Pub/Sub), a consumer, a data …
Python 1
-
nyc-taxi-data
nyc-taxi-data PublicThis etl pipeline extracts and integrates NYC Taxi Trip Data with Taxi Zone Lookup Data to create a dataset that can be used for descriptive and predictive analysis. For example, to predict the num…
Jupyter Notebook
-
data-lake
data-lake PublicThis project builds an ETL pipeline for a data lake hosted on S3. We will load data from S3, process the data into analytics tables using Spark, and load them back into S3. We will deploy this Spar…
Python
If the problem persists, check the GitHub status page or contact support.