Skip to content

Commit

Permalink
Update scraper.py
Browse files Browse the repository at this point in the history
  • Loading branch information
paulbradshaw committed Jan 15, 2017
1 parent 62ef8d0 commit 35d5eca
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion scraper.py
Expand Up @@ -13,7 +13,7 @@

xmldata = scraperwiki.pdftoxml(pdfdata)
print "After converting to xml it has %d bytes" % len(xmldata)
print "The first 2000 characters are: ", xmldata[:2000]
print "The first 5000 characters are: ", xmldata[:5000]

root = lxml.etree.fromstring(xmldata)
pages = list(root)
Expand Down

0 comments on commit 35d5eca

Please sign in to comment.