Update IL scraper for older sessions #105

JoeGermuska · 2011-11-04T19:53:33Z

I see that the ILGA has made some changes to their URL structure since last I worked on this code. I poked around and worked out how to get URLs for the four previous sessions of the assembly. I haven't exhaustively tested things, but I did run the entire bill scraper and several days later, there were no fatal errors and the JSON I've looked at seems sound.

It does seem that there are six kinds of documents which are not currently being passed to bill.add_document and I'll see if I can find time to check those out and process them, but I figured I'd offer what works here rather than defer that indefinitely...

…ata is basically 'right'

IL: Update IL scraper for older sessions

JoeGermuska added 4 commits November 2, 2011 17:14

some refactoring to make it more straightforward to verify that metad…

7926a32

…ata is basically 'right'

move tests to better comply with pattern in other states

533d69f

add more sessions and some logging

96240b1

fix some junk from refactoring

51933df

jamesturk added a commit that referenced this pull request Nov 7, 2011

Merge pull request #105 from JoeGermuska/master

40728ce

IL: Update IL scraper for older sessions

jamesturk merged commit 40728ce into openstates:master Nov 7, 2011

cweber added a commit that referenced this pull request Jul 10, 2012

Added missingInfo link style. [Issue #105]

a6e68d7

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update IL scraper for older sessions #105

Update IL scraper for older sessions #105

JoeGermuska commented Nov 4, 2011

Update IL scraper for older sessions #105

Update IL scraper for older sessions #105

Conversation

JoeGermuska commented Nov 4, 2011