-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Failure with some files: pypdf2: multiple definitions in dictionary #7
Comments
When you look at the traceback you see at the bottom that it explodes in the PyPDF2 package, so it's not inherently a stapler bug. So I suggest you file a bug there. While I understand that you don't want to share the file, do note that this makes it a lot harder to analyze. Maybe you could figure out what page this is happening on and extract that page and only share that with the authors? |
Thanks @fwenzel . While it said it was I think it's clear now, so I'll close this. I'll see if I can find some reproducible way to create the bug. For now, I was able to just use |
Re-opening after searching around again... I ran into this issue on pypdf2 last night, but didn't dawn on me to try applying to your files. I grepped for
Applying the change reference in the
That spits out a big list of errors about multiple dictionary definitions for Is this still a
|
Aha! Thanks for pointing that out. I don't see why we'd do strict=True. Stapler should handle as many files as possible, even if there are some issues buried in them. For instance, within reason, stapler should probably be able to merge two malformed pages into one two-page document with the same errors inside the pages present. |
No problem, and thanks for taking a look! I left a comment on |
I'm getting this error with some html pages I printed as pdfs and want to concatenate:
I can successfully
cat
other test pdfs together, so I'm not sure what's going on. Unfortunately, it's a backup of an old blog I'd care not to share but I'd be happy to hunt around if you might know what to look for. I took a look at the binarypdf
file innano
and see stuff a bunch of/Type
s:/XObject
,/Catalog
,/Pages
,/Annot
, etc.Is there an issue with the file having many of these?
Perhaps this is unique to
pypdf2
but I wanted to mention it so you're aware. If you can confirm, I'll report there instead. I get this with both manually pulling this as well as installing the Arch AUR package.The text was updated successfully, but these errors were encountered: