Skip to content

Content of the Tutorial "Working with larger than memory data in R with Arrow and DuckDB" taught on 2024-11-19

Notifications You must be signed in to change notification settings

fmichonneau/2024-latinr-duckdb-arrow

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 

Repository files navigation

LatinR 2024 Tutorial -- Working with larger than memory data in R with Arrow and DuckDB

This tutorial holds the dataset used for the tutorial. The dataset is small sample of the American Community Survey Public Use Microdata Sample (PUMS) files that has been prepared by Nic Crane, Jonathan Keane, and Neal Richardson for their book "Scaling Up With R and Arrow". Many of the code examples used in this tutorial have been inspired by the ones used in their book.

Content of the repository

  • code.R is the file you can use to follow along with the talk
  • solutions.R is the file for the code that used during the tutorial. It also includes a few extra practice exercises and answers to questions that were asked during the tutorial.

The slides for the tutorial are available.

When the recording for the tutorial becomes available, I'll add the link here.

About

Content of the Tutorial "Working with larger than memory data in R with Arrow and DuckDB" taught on 2024-11-19

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages