Skip to content

posit-conf-2023/ds-workflows-r

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Data Science Workflows with Posit Tools --- R Focus

posit::conf 2023

by Ryan Johnson and Katie Masiello


🗓️ September 17, 2023
⏰ 09:00 - 17:00
🏨 Plaza A
✍️ https://posit-conf-2023.github.io/ds-workflows-r/


Overview

In this R-focused workshop, we will discuss ways to improve your data science workflows! During the course, we will review packages for data validation, alerting, modeling, and more. We'll use Posit's open source and professional tools to string all the pieces together for an efficient workflow. We'll discuss environments, managing deployed content, working with databases, and interoperability across data products.

This course is for you if you:

  • Build finished data products starting from raw data and are looking to improve your workflow
  • Are looking to expand your knowledge of Posit open source and professional tools
  • Want to improve interoperability between data products in your work or on your team
  • Have experience developing in R. An analogous course with a Python focus is also offered

Slides and Resources from the Workshop

Presentation materials and a summary of resources covered in the workshop are available at https://katie.quarto.pub/ds-workflows-r/

Prework

This workshop requires that you bring your own laptop.

So we can hit the ground running, please view the video below before the Workshop:

👉 https://rstudio.wistia.com/medias/uaettrtu1j

Written instructions for registering with Posit Connect are also below:

  1. Visit https://connect.conf23workflows.training.posit.co.
  2. Click the “Sign Up” button at the top right.
  3. Sign up with your personal email.
  4. Make your username the prefix of your personal email.
  5. Check your email to confirm your account. The email will be from “conf23workflows@training.rstudio.com” (check your junk folder)

We will be using Discord as our main communication method! To make the process go smoothly:

  • Please sign up for an account if you don’t already have one.
  • Make sure your display name is the one you used to register for the conference.
  • In your “About Me,” put the name of your workshop: “Data Science Workflows with Posit Tools — R Focus”

Closer to the start of the conference, we will invite you to the posit::conf Discord server. Once you’ve accepted the invite, we will add you to the channel(s) for your conf workshop(s).

If you have questions in advance of the workshop, please reach out to either Ryan (ryan@posit.co) and/or Katie (katie.masiello@posit.co).

Schedule

Time Activity
9:00 - 10:30 Workshop Introduction
Reading, Cleaning, Writing and Validating Data
10:30 - 11:00 Coffee break
11:00 - 12:30 Creating, Delivering, and Monitoring a Tidymodel
12:30 - 1:30 Lunch Break
1:30-3:00 Reporting
3:00-3:30 Coffee break
3:30-5:00 Advancing your Workflow

Instructors

Ryan Johnson, Data Science Advisor, Posit

Ryan Johnson

Ryan Johnson is a Data Science Advisor at Posit with a background in Microbiology and Bioinformatics. He obtained his PhD from the Uniformed Services University in Maryland and did his postdoctoral training at the National Human Genome Research Institute, NIH. The only thing that rivals his love for infectious diseases is generating 'super cool' visualizations from large data sets using R and RStudio. In his free time, you can find Ryan running marathons/ultramarathons in the DC area or hiking miles along the Appalachian Trail. Ryan resides in Gaithersburg with his wife and two feline co-workers.

Katie Masiello, Solutions Engineer, Posit

Katie Masiello

Katie Masiello is a Solutions Engineer at Posit. A mechanical engineer by training, she found her calling in data science while working statistical analysis in the aerospace industry. A good cup of coffee, reproducibility, and making life easier for the next user are three things she loves most. Katie is an avid knitter and knitr, and she can often be found trying to tame her ridiculously overgrown garden, collecting pebbles, or thinking about taking up running as a hobby.

Notice

The sample data science project used for this workshop provides applications using data that has been modified for use from its original source, www.cityofchicago.org, the official website of the City of Chicago. The City of Chicago makes no claims as to the content, accuracy, timeliness, or completeness of any of the data provided at this site. The data provided at this site is subject to change at any time. It is understood that the data provided at this site is being used at one’s own risk.

This work is licensed under a Creative Commons Attribution 4.0 International License.