Skip to content
Code for EARL 2018 Boston demonstration by Randy Zwitch
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
.gitignore
LICENSE
README.md
earlboston2018.Rproj
earlboston_odbc.Rmd
earlboston_odbc.nb.html
environment.txt
flexdashboard.gif
flexdashboard_reticulate.Rmd
omniscidash.png

README.md

EARL2018

Materials from the talk 'GPU-Accelerated visualisation: A primer on exploratory data analysis at scale' by Randy Zwitch at EARL 2018 Boston.

Data

The data are ~720MM records representing hourly pollution measurements for power generators in the United States from the EPA Air Markets Program. The data can be retrieved from their public FTP: ftp://newftp.epa.gov/DMDnLoad

Summary

This talk demonstrated the current state of R support for working with OmniSci (formerly, MapD):

  • ODBC and reticulate were used to show how to access OmniSci
  • Using dplyr to work with large volumes of data with sub-second speed
  • Creating a flexdashboard as a view against the entire 720MM dataset
You can’t perform that action at this time.