The goal of this project is to perform data analysis on NYC Uber Taxi data using various tools and technologies, including GCP Storage, Python, Compute Instance, Mage Data Pipeline Tool, BigQuery, and Looker Studio.
Programming Language:
- Python
- SQL Google Cloud Platform:
- Google Storage
- Compute Instance
- BigQuery
- Looker Studio Modern Data Pipeine Tool:
- Mage: https://www.mage.ai/
Contribute to this open-source project - https://github.com/mage-ai/mage-ai
TLC Trip Record Data Yellow and green taxi trip records include fields capturing pick-up and drop-off dates/times, pick-up and drop-off locations, trip distances, itemized fares, rate types, payment types, and driver-reported passenger counts.
Dataset used: https://github.com/PreetKothari/Uber_etl_pipeline_data_analytics_project/tree/main/Data
Website - https://www.nyc.gov/site/tlc/about/tlc-trip-record-data.page
Data Dictionary - https://www.nyc.gov/assets/tlc/downloads/pdf/data_dictionary_trip_records_yellow.pdf