Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Trailing spaces and NUL characters in PDF cause failure identifying EOF #20

Closed
wants to merge 2 commits into from
Closed

Conversation

freakboy3742
Copy link

I have a collection of PDFs that contain a line of NUL and space characters on the line after the %%EOF marker. The current technique for identifying the %%EOF fails on these PDFs because the 'while not line' check on line 704 of pdf.py (the start of the read() method on PdfFileReader) isn't sufficient to identify this line of NUL and spaces as something worth ignoring.

@jimr
Copy link

jimr commented Dec 6, 2012

Works for me, would be great to see this merged.

@jobo3208
Copy link

I agree. This fixes a major shortcoming of the library IMO. Can't tell you how many PDF's I've encountered with this problem.

This pull request was closed.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants