-
Notifications
You must be signed in to change notification settings - Fork 0
Home
This project is an analytics demonstration using Python in conjunction with the Google Cloud Platform. An R example is currently in development, but not yet available. The goal is to provide a reproducible example using publicly available data and open source analytics tools.
For those that aren't familiar BigQuery (the primary cloud service used in these examples), it is a 'serverless' database system that is fully managed and always available. Instead of charging per environment/instance/hour of time like many cloud database systems, Google charges based on the amount of data a query processes. For those familiar with Amazon Web Services, BigQuery is more like DynamoDB rather than an Aurora/Redshift/etc instances where you pay per hour.
Start with the setup steps below, then take a look at the code.
Setup:
- Python Environment setup
- Data setup
Source Code:
- Jupyter Notebook (Python)
- Data Exploration