Uber-Data-Engineering-Project

In this project, I had the opportunity to work on an Uber data engineering project. The main objectives of the project were as follows:

Data Extraction and Loading:

I extracted the TLC Trip Record Data, which consisted of Yellow and Green taxi trip records. The dataset contained essential information such as pick-up and drop-off dates/times, locations, distances, fares, rate types, payment types, and passenger counts. The extracted data was then loaded into Google Cloud Storage (GCS). Google Cloud Storage is a reliable and scalable storage service provided by Google Cloud Platform (GCP).

Data Transformation and Modeling:

Using Jupyter Notebook and Python, I performed extensive data transformation and modeling tasks. This involved applying a fact and dimensional data modeling schema to the dataset. I cleaned, organized, and structured the data to create meaningful relationships between different entities and dimensions, ensuring its suitability for analysis and further processing.

ETL Process and Data Pipeline Implementation:

To streamline the data processing workflow, I implemented an Extract, Transform, Load (ETL) process using a data pipeline tool called Mage. Mage is a modern data pipeline tool that facilitates the efficient extraction, transformation, and loading of data from various sources. The transformed data was then loaded into Google BigQuery, a powerful and fully-managed data warehouse solution offered by GCP. BigQuery enables fast and scalable analysis of large datasets. Development of a Dashboard using Looker:

This project was an exciting and valuable learning experience for me, as it was my first time working with Google Cloud and Mage Technologies. I gained substantial knowledge and hands-on experience in data engineering, and it has motivated me to pursue more thrilling data engineering projects in the future.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
Dashboard.png		Dashboard.png
Exporter.py		Exporter.py
Loader.py		Loader.py
README.md		README.md
Transformer.py		Transformer.py
Uber_data_.ipynb		Uber_data_.ipynb
data_model.jpeg		data_model.jpeg
mega_screenshot_ismail.png		mega_screenshot_ismail.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Uber-Data-Engineering-Project

Data Extraction and Loading:

Data Transformation and Modeling:

ETL Process and Data Pipeline Implementation:

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Uber-Data-Engineering-Project

Data Extraction and Loading:

Data Transformation and Modeling:

ETL Process and Data Pipeline Implementation:

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages