Skip to content


Subversion checkout URL

You can clone with HTTPS or Subversion.

Download ZIP
branch: master
Fetching contributors…

Cannot retrieve contributors at this time

26 lines (15 sloc) 0.56 kb

Scraper Collections


Use the pdftotext tools of xpdf package to convert pdf to text.

pdftotext -layout pdf_file.pdf



    this split question and answer into different files, work in progress. it only handle question and answer, but there is more in the hansard, but the question and answer block is more sane to parse


    This is to parse the order paper in the parliament, but not much user I think

Jump to Line
Something went wrong with that request. Please try again.