US address parsing
Switch branches/tags
Nothing to show
Clone or download
Pull request Compare This branch is 244 commits behind datamade:master.
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
docs
tests
training
usaddress
.gitignore
LICENSE
README.md
requirements.txt
setup.py

README.md

usaddress

usaddress is a python library for parsing unstructured address strings into address components, using advanced NLP methods.

To install

> pip install usaddress

To build and test development code.

> pip install -r requirements.txt
> python setup.py develop
> python training/training.py
> nosetests .

Here's how you use it:

>>> import usaddress
>>> usaddress.parse('123 Main St. Suite 100 Chicago, IL')
[('123', 'AddressNumber'), 
 ('Main', 'StreetName'), 
 ('St.', 'StreetNamePostType'), 
 ('Suite', 'OccupancyType'), 
 ('100', 'OccupancyIdentifier'), 
 ('Chicago,', 'PlaceName'), 
 ('IL', 'StateName')]

Important links