You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository has been archived by the owner on Apr 15, 2024. It is now read-only.
With the pdfs I'm working with, PDFDocument is really too slow. Is there any way to speed it up?
I see that there's often the suggestion to "giving -n option which turns off automatic layout analysis.", but I have no idea how to do that from python.
I "only" need to read xrefs and objects/streams, I don't care how the pdf would be rendered/pages.
Can anyone help please? Thanks.
The text was updated successfully, but these errors were encountered:
Sadly I can't share those PFDs.
The thing is that once I got PDFDocument initialized, everything else is really fast.
Maybe it's decompressing the PDF, that could be slowing it.
I'm currently doing this:
With the pdfs I'm working with, PDFDocument is really too slow. Is there any way to speed it up?
I see that there's often the suggestion to "giving -n option which turns off automatic layout analysis.", but I have no idea how to do that from python.
I "only" need to read xrefs and objects/streams, I don't care how the pdf would be rendered/pages.
Can anyone help please? Thanks.
The text was updated successfully, but these errors were encountered: