The goal of ppmf is to convert Census Privacy Protected Microdata Files into somewhat wider data aggregated to a geographic level.
You can install the development version from GitHub with:
# install.packages("devtools")
devtools::install_github("christopherkenny/ppmf")
Load the package:
library(ppmf)
Download and read data with:
path <- download_ppmf(dsn = 'filename.csv', dir = 'some/directory', version = '19')
al <- read_ppmf(state = 'AL', path = path)
Version ‘19’ reflects the 19.61 finalized parameters used again for the 2020 Census.
For future use, I recommend storing the path to the data for future sessions using:
add_pmmf19_path(path)
Then the path can be recovered with:
path19 <- Sys.getenv('ppmf19')
Once you’ve read in what you want, you can aggregate it to the right level:
al <- al %>% add_geoid()
blocks <- agg(al)
And aggregated data can use the GEOID to merge with shapefiles:
library(dplyr) # to clean up the data
shp <- tigris::blocks('AL', year = 2010) %>%
select(GEOID10, geometry) %>% rename(GEOID = GEOID10)
shp <- shp %>% left_join(blocks, by = 'GEOID')
# always clean shp!
shp[is.na(shp)] <- 0
For users with the newest package version, there is an added dependency
on censable
, which
allows for an easier workflow. If you’ve used the add_pmmf*_path()
workflow suggested, you don’t even need to supply the paths!
This will not just read the ppmf
data, it will merge it with 2010
Census populations (by major race/ethnicity grouping) and add the
corresponding geometries.
al <- read_merge_ppmf('AL', level = 'block', versions = '19')