-
Notifications
You must be signed in to change notification settings - Fork 302
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Good sources of messy addresses #16
Comments
@waldoj, could you send us a sample of the addresses in the lobbying data you were working with? |
@jernsthausen, can you send us a sample of the addresses you were looking to parse? |
@fgregg give me a moment to find a dataset with some completely unparsed addresses. |
I have even more useful messy address data on hand, also from the state of Virginia. You can find it in most of the files at Virginia Businesses. I've taken all of the limited partnership addresses from their CSV file and posted it in a gist, which I hope you'll find helpful. |
Ooooh! Thanks! On Wed, Sep 3, 2014 at 11:13 AM, Waldo Jaquith notifications@github.com
773.888.2718 |
@jernsthausen, @waldoj Thanks for the very messy addresses. I think it's at a point where you guys might want to start playing with it. Follow the instructions in the readme, and give it a spin. I think you'll get better results if you feed the data in column by column (i.e. don't concatenate address_1 and address_2 into a single string, but do each separately) |
Will do! Thank you, @fgregg! |
The text was updated successfully, but these errors were encountered: