-
Notifications
You must be signed in to change notification settings - Fork 1.1k
Failure to parse a PDF file from https://www.broadcom.com/collateral/pg/5756M-PG101-R.pdf #118
Comments
Note that on Linux using:
creates a single page small PDF doc that has the same issue as the full doc |
@euske any hint of where I could start to help? |
This reverts commit 7c31351.
@pombredanne This works now in the current version of pdfminer |
@chid It does not work for me on Ubuntu LTS 14.04 with Python 2.7.6.
|
I'm on Windows on Python 2.7.10. You could try removing the PDF protection with qpdf first. edit: I just tried it on raspbian and it works fine,
|
@chid Thanks but that's very weird. For me for https://github.com/nexB/scancode-toolkit I cannot afford to mandate to have a special version of Python 2.7 on Ubuntu (it comes built in) and I support windows/linux/mac. |
I might have a go at it in ubuntu with default Python and see if it works |
The file at https://www.broadcom.com/collateral/pg/5756M-PG101-R.pdf fails to be parsed
I verified this is the latest Pypi version and with the HEAD version.
This small snippet reproduces the error:
The text was updated successfully, but these errors were encountered: