Skip to content
Branch: master
Find file History
Pull request Compare This branch is 159 commits ahead, 52 commits behind BlackrockDigital:master.
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
..
Failed to load latest commit information.
README.md

README.md

July 2019: Reproducible Data Workflows with drake

Presenter: Garrick Aden-Buie

Meeting location

Date/time: Friday, July 19th @ 2pm

Location: Moffitt Stabile Research Building (SRB), David Murphey Conference Room (1st floor)

Meeting details

drake is an R package that provides a powerful, flexible workflow management tool for reproducible data analysis pipelines. drake alleviates the pain of managing large (and even small) data analyses, speeding up iteration and development while providing reproducibility guarantees that are essential for modern research.

https://ropensci.github.io/drake/

In this session, we'll learn how to use drake to manage a data analysis workflow by writing functions that define the steps of the analysis. We'll then learn how drake can keep track of all of these steps, from start to finish, and intelligently update only the outdated steps when your data or code change.

Meeting prerequisites

We'll work through a few examples together, so please bring a laptop with the drake and visNetwork packages installed. (If you don't have a laptop you can share with someone who does at the session.) You would also benefit from installing the tidyverse package for the session.

# drake and dependencies
install.packages(c("drake", "visNetwork"))

# Install the tidyverse (if not currently installed)
# install.packages("tidyverse")

Note: if you've used drake before, please ensure that you have version 7.0.0 or later installed.

Meeting materials

The slides from this talk are available online at https://pkg.garrickadenbuie.com/drake-intro/ and the drake source code and RStudio project are in available on GitHub at https://github.com/gadenbuie/drake-intro. There is also an RStudio Cloud project containing the drake project with all of the required dependencies pre-installed that you can use to explore and run the code from the talk.

You can’t perform that action at this time.