public
Description: Parsers and utilities for the Senate LD-1/LD-2 database.
Homepage:
Clone URL: git://github.com/dhess/lobbyists.git
dhess (author)
Sat Oct 18 15:12:52 -0700 2008
commit  bd6948cd0e30eaa3f117eee8603a6d95f2d58033
tree    c049d4f5ad38be18267c5e29d72c8c4352db4f96
parent  5289a96ef508f9d8101eedb9c78a881aa61b9f8b
name age message
file .gitignore Tue Oct 07 00:49:40 -0700 2008 .gitignore: ignore .egg files. [dhess]
file AUTHORS Sun Oct 12 18:02:40 -0700 2008 Added AUTHORS, ChangeLog and INSTALL files. Ad... [dhess]
file COPYING Loading commit data...
file INSTALL Sun Oct 12 18:02:40 -0700 2008 Added AUTHORS, ChangeLog and INSTALL files. Ad... [dhess]
file Makefile
file README
file TODO
file ez_setup.py Tue Oct 07 00:44:38 -0700 2008 setuptools packaging. Lots of changes in this ... [dhess]
directory lobbyists/
file setup.py
README
Summary
-------

This package provides a reference parser and database importer for the
United States Senate LD-1/LD-2 lobbying disclosure database. The
Senate provides the database as a series of XML documents,
downloadable here:

http://www.senate.gov/legislative/Public_Disclosure/database_download.htm

The SQL database schema used by the importer is a direct translation
of the XML schema used in the Senate documents. This isn't a
particularly useful format for analyzing lobbying data, but it is
useful for analyzing the lobbying records themselves, which often
contain errors or anomalies. In any case, it shouldn't be too
difficult to adapt the importing code in this package to a more useful
schema.

For information on how to interpret the LD-1/LD-2 database, see here:

http://watchdog.jottit.com/lobbying_database

Scripts
-------

The lobbyists-load script loads one or more XML documents into a
database.

The lobbyists-benchmark script loads one XML document into a database,
and reports the amount of time required to a) parse the document and
b) import the parsed records into the database. It's mainly
interesting for developers working on the lobbyists package itself.