New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
pdftotext.Error: Poppler error creating document #17
Comments
Can you attach the PDF in question, |
Its a research paper from arxiv.org |
Okay, I found it. Works fine here:
What version of poppler do you have? |
I'm using Version: 3:4.8.5-1, I had 1 million pdf files and I got this error for many files. I'll try to do this again. I'm not sure if I'm doing something wrong in my function. Please let me know if you think I'm doing something wrong while reading pdf files. Thanks for the help. |
Your code looks fine to me, but if you're using an older version of poppler, errors like this are more common. |
how do I install poppler on windows? After I download a zip, which directory it needs to be placed at. |
I'm getting this error when running using pdftotext==2.2.2, poppler-utils==0.1.0 |
I'm getting this error too when running Edit: nevermind. I found one of the read files was actually not a pdf file but a png image of the document... |
Same issue with same file, Arch Linux, poppler 23.12.0-1, and pdftotext 2.2.2-4 |
while using pdftotext with multiprocessing module on ec2
My code:
Link : arxiv paper
The text was updated successfully, but these errors were encountered: