Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Impute county_name for VIP datasets #55

Closed
pratheekrebala opened this issue Oct 1, 2020 · 3 comments
Closed

Impute county_name for VIP datasets #55

pratheekrebala opened this issue Oct 1, 2020 · 3 comments
Assignees

Comments

@pratheekrebala
Copy link
Member

pratheekrebala commented Oct 1, 2020

The data we received from the Voting Information Project often does not contain county information. I have been experimenting with trying to fill these in based on the geo-codes but I've noticed a few issues with this approach:

  • Sometimes VIP data doesn't contain last_line (city, state, zip) information which makes it harder to trust the geocode output.
  • county_name is one of the fields used to validate the geocode output
  • I've also seen a couple of cases in VA where polling place geocodes that were close to the county line got placed in the wrong county.

Another, easier, approach could be to join the precinct_id columns with the open-elections dataset but we'd need to be sure that a precinct name is truly unique which isn't always the case.

Maybe a mix of the two..

@pratheekrebala
Copy link
Member Author

Maybe we can lookup the polling place in the next available non-vip dataset..

@chriszs
Copy link
Contributor

chriszs commented Oct 1, 2020

My experience with doing this with using geocodes for VIP data in Texas tells me it’s always going to be a little fuzzy.

@chriszs
Copy link
Contributor

chriszs commented Oct 1, 2020

I wonder if one way to handle this would be to add a county_name_source or county_name_precision column.

@pratheekrebala pratheekrebala self-assigned this Oct 2, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants