Fails on postgres pdfs #10

Closed
ralphbean opened this Issue Feb 20, 2014 · 2 comments

Comments

Projects
None yet
2 participants
Contributor

ralphbean commented Feb 20, 2014

Saw this in the logs:

[2014-02-20 10:49:57][summershum    INFO] Ingesting u'postgresql-9.2.7-US.pdf'
[2014-02-20 10:49:58][summershum WARNING] No files extracted from u'/tmp/tmpZy0Uoe/postgresql-9.2.7-US.pdf'
[2014-02-20 10:49:58][summershum    INFO] Done ingesting u'postgresql-9.2.7-US.pdf'
Owner

pypingou commented Feb 21, 2014

Do we want to check if the source is a pdf file using something like: https://pypi.python.org/pypi/PyPDF2/1.20

Or do we just use something like local_filename.endswith('.pdf') ?

ralphbean closed this in ed61a77 Feb 21, 2014

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment