Interactive and searchable House staffer directory, based on House disbursement data.
HTML Ruby CSS JavaScript
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
config Removing capistrano, unused Aug 12, 2014
data updated data Jun 27, 2017
public er, full URL please Nov 22, 2013
vendor/cache updated gems Nov 15, 2016
views display as of date; update readme Jan 30, 2015
.gitignore ignoring unicorn.rb Apr 28, 2014
.ruby-version updated data Jun 27, 2017
CONTRIBUTING.md CONTRIBUTING file Mar 8, 2014
Gemfile hiding pid Apr 28, 2014
Gemfile.lock updated gems Nov 15, 2016
LICENSE Use explicit license Oct 10, 2016
README.md display as of date; update readme Jan 30, 2015
Rakefile moving Ruby to 2.0, beginning to update paths and test things out, si… Apr 28, 2014
config.ru.example moving Ruby to 2.0, beginning to update paths and test things out, si… Apr 28, 2014
csv.rb moving Ruby to 2.0, beginning to update paths and test things out, si… Apr 28, 2014
fabfile.py 2.1.1 Apr 28, 2014
helpers.rb
models.rb updated data Jun 27, 2017
staffers.rb updated data Jun 27, 2017
unicorn.rb.example moving Ruby to 2.0, beginning to update paths and test things out, si… Apr 28, 2014

README.md

House Staff Directory

A staff directory for the House of Representatives, using data culled from the House' quarterly disbursement records.

The House publishes each quarter's data by the end of the following quarter, so the records will generally be 3-6 months out of date.

Setup

Depends on Ruby 1.8.7 - not tested on anything higher. This needs to be updated.

Install dependencies with bundler:

bundle install

Copy config.ru:

cp config.ru.example config.ru

Copy config/config.yml.example:

cp config/config.yml.example config/config.yml

Run with unicorn:

bundle exec unicorn

Loading new staff records

It's an old, blunt, violent process.

  1. Get the CSVs generated from the sunlightlabs/disbursements process.

  2. Put them into the data/ directory and run:

rake load:all
  1. Wait for an hour or more as the data is blown away and reloaded from scratch. It's not very efficient, and the website will show incomplete data for that time.

  2. Update the 'as of' date displayed on the home page in views/index.erb to the last day of the quarter processed.

data/ expectations

The data/ directory expects four CSV files:

  • staffers.csv: Unique staffer names as they appear in the original disbursement data, and any standardizations or corrections to those titles.
  • titles.csv: Unique titles as they appear in the original disbursement data, and any standardizations or corrections to those titles.
  • offices.csv: Unique office names as they appear in the original disbursement data, and any standardizations or corrections to those titles. Expanded details for each office (committee ID, building, room, telephone number) are added by hand.
  • positions.csv: The "join table" that links staffer names, title names, and office names together as a "position" for each quarter.

Why not the Senate?

The Senate's expenditure data is published every 6 months, can be quite out of date, and is much more difficult to parse than the House's.

Our parser for the House is here. The Senate parser is still experimental, but data can be found here.

License

Currently GPLv3.