Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Identify simple addresses #3

Closed
TTitcombe opened this issue Feb 24, 2020 · 1 comment
Closed

Identify simple addresses #3

TTitcombe opened this issue Feb 24, 2020 · 1 comment
Labels
enhancement New feature or request
Milestone

Comments

@TTitcombe
Copy link
Owner

The issue

We must be able to identify columns containing simple addresses. Addresses can get very complicated and there any many edge cases that will need considering at some point, however for this first implementation we should only consider "Housenumber streetname" and "post/zipcode" addresses.

British postcodes are of the format LetterLetterDigit[Character] DigitLetterLetter e.g. SW1A 2AA. Common street suffixes are street, road, way, avenue... any others?

Proposed solution

There should exist a function which, when called with a dataframe, returns the names of columns containing simple addresses

Things to consider

  • Do not consider address edge cases at the moment.
  • Do not consider town names or greater at the moment. I.e. "London" should not be considered a breach of privacy.
@TTitcombe TTitcombe added the enhancement New feature or request label Feb 24, 2020
@TTitcombe TTitcombe added this to the 0.1.0 milestone Feb 24, 2020
This was referenced Feb 24, 2020
@TTitcombe
Copy link
Owner Author

Fixed by #6

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant