public
Description: Parsers and utilities for the Senate LD-1/LD-2 database.
Homepage:
Clone URL: git://github.com/dhess/lobbyists.git
name age message
file .gitignore Tue Oct 07 00:49:40 -0700 2008 .gitignore: ignore .egg files. [dhess]
file AUTHORS Sun Oct 12 18:02:40 -0700 2008 Added AUTHORS, ChangeLog and INSTALL files. Ad... [dhess]
file COPYING Loading commit data...
file INSTALL Sun Oct 12 18:02:40 -0700 2008 Added AUTHORS, ChangeLog and INSTALL files. Ad... [dhess]
file Makefile
file README
file TODO
file ez_setup.py Tue Oct 07 00:44:38 -0700 2008 setuptools packaging. Lots of changes in this ... [dhess]
directory lobbyists/
file setup.py
README
Summary
-------

This package provides a reference parser and database importer for the
United States Senate LD-1/LD-2 lobbying disclosure database. The
Senate provides the database as a series of XML documents,
downloadable here:

http://www.senate.gov/legislative/Public_Disclosure/database_download.htm

The SQL database schema used by the importer is a direct translation
of the XML schema used in the Senate documents. This isn't a
particularly useful format for analyzing lobbying data, but it is
useful for analyzing the lobbying records themselves, which often
contain errors or anomalies. In any case, it shouldn't be too
difficult to adapt the importing code in this package to a more useful
schema.

For information on how to interpret the LD-1/LD-2 database, see here:

http://watchdog.jottit.com/lobbying_database

Scripts
-------

The lobbyists-load script loads one or more XML documents into a
database.

The lobbyists-benchmark script loads one XML document into a database,
and reports the amount of time required to a) parse the document and
b) import the parsed records into the database. It's mainly
interesting for developers working on the lobbyists package itself.