Skip to content
Seth Russell edited this page Sep 29, 2017 · 7 revisions

This project is an analytics demonstration using Python in conjunction with the Google Cloud Platform. An R example is currently in development, but not yet available. The goal is to provide a reproducible example using publicly available data and open source analytics tools.

For those that aren't familiar BigQuery (the primary cloud service used in these examples), it is a 'serverless' database system that is fully managed and always available. Instead of charging per environment/instance/hour of time like many cloud database systems, Google charges based on the amount of data a query processes. For those familiar with Amazon Web Services, BigQuery is more like DynamoDB rather than an Aurora/Redshift/etc instances where you pay per hour.

Start with the setup steps below, then take a look at the code.

Setup:

Source Code:

Clone this wiki locally