python company processer
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
tests
.gitignore Extended gitignore (using github Python gitignore template) Aug 17, 2015
.travis.yml
CHANGES.txt
LICENSE.txt
README.md
cleanco.py
setup.cfg implement #28 - support testing using 'python setup.py test' and 'tox' Dec 29, 2016
setup.py
termdata.py
tox.ini

README.md

cleanco - clean organization names

What is it / what does it do?

This is a Python package that processes company names, providing cleaned versions of the names by stripping away terms indicating organization type (such as "Ltd." or "Corp").

Using a database of organization type terms, It also provides an utility to deduce the type of organization, in terms of US/UK business entity types (ie. "limited liability company" or "non-profit").

Finally, the system uses the term information to suggest countries the organization could be established in. For example, the term "Oy" in company name suggests it is established in Finland, whereas "Ltd" in company name could mean UK, US or a number of other countries.

How do I install it?

Just use 'pip install cleanco' if you have pip installed (as most systems do). Or download the zip distribution from this site, unzip it and then:

  • Mac: cd into it, and enter sudo python setup.py install along with your system password.
  • Windows: Same thing but without sudo.

How does it work?

Let's look at some sample code. First, create an instance of the module:

>>> from cleanco import cleanco

Prepare a string of a company name that you want to process:

>>> business_name = "Some Big Pharma, LLC"

Throw it into the instance:

>>> x = cleanco(business_name)

You can now get the company types:

>>> x.type()
['Limited Liability Company']

...the possible countries...

>>> x.country()
['United States of America', 'Philippines']

...and a clean version of the company name.

>>> x.clean_name()
'Some Big Pharma'

Are there bugs?

See the issue tracker. If you find a bug or have enhancement suggestion or question, please file an issue and provide a PR if you can. For example, some of the company suffixes may be incorrect or there may be suffixes missing.

To run tests, simply install the package and run python setup.py test. To run tests on multiple Python versions, install tox and run it (see the provided tox.ini).

Special thanks to: