Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Convert Tableau Prep process to R #10

Open
1 of 10 tasks
jonathanleek opened this issue Sep 30, 2019 · 1 comment
Open
1 of 10 tasks

Convert Tableau Prep process to R #10

jonathanleek opened this issue Sep 30, 2019 · 1 comment
Labels
Pre-Game Things to hopefully get done before event

Comments

@jonathanleek
Copy link
Collaborator

jonathanleek commented Sep 30, 2019

We ended up creating our dataset using Tableau Prep due to inexperience with R. Would still like to have the process replicated in an R script that can be run along side the rest of our code.

  • Download Parcel.mdb from City Open Data Portal

  • Join Prcl Table to BldgCom table on Handle, keeping all Handles that appear in BldgCom

  • Join Prcl Table to BldgRes table on Handle, keeping all Handles that appear in BldgRes

  • Join Prcl Table to BldgSect table on Handle, keeping all Handles that appear in BldgSect

  • Union Results of above joins together. The resulting table should contain all Handles that appear in BldgCom, BldgRes, or BldgSect, but not all Handles that appear in Prcl. We'll call the resulting table 'Buildings' for now.

  • Join Prcl to Buildings, keeping only Handles that appear in Prcl but not Buildings. The resulting dataset should be all lots without buildings in Saint Louis; we'll call this 'Lots'.

  • Union Buildings and Lots

  • Download and unzip nearby vacant parcels summary.zip. This csv dataset contains the number of vacant properties within set ranges of every handle in Saint Louis.

  • Within that dataset, pivot rows to columns so that each range is a column containing the count of vacant properties within that range.

  • Join the resulting dataset with the 'Buildings and Lots' dataset, keeping all parcels that are contained in 'Buildings and Lots'. The result should be our final working dataset.

@jonathanleek jonathanleek added the Pre-Game Things to hopefully get done before event label Sep 30, 2019
@chris-prener
Copy link

When do you want the reproducible workflow in place?

@bransonf bransonf added this to To do in Data Creation Oct 4, 2019
@bransonf bransonf moved this from To do to In progress in Data Creation Oct 4, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Pre-Game Things to hopefully get done before event
Projects
Data Creation
  
In progress
Development

No branches or pull requests

2 participants