jwheare / bizparse

A scraper for parsing the House of Commons Future Business pages

This URL has Read+Write access

README.txt
bizparse.py

A scraper for parsing the House of Commons Future Business pages
http://www.publications.parliament.uk/pa/cm/cmfbusi/fbusi.htm

Usage:
./bizparse.py

Writes an XML file to bizparseYYYY-MM-DD.xml for the period ending date.
Ouputs human readable debug logging for the data extracted to stdout

Uses BeautifulSoup: http://www.crummy.com/software/BeautifulSoup/