Ignore text content that is positioned outside visible area of page #109

zagraves opened this Issue Mar 27, 2013 · 3 comments


None yet

2 participants


Thanks for implementing the option to use the CropBox.

Another thing I've noticed in the PDFs I've been working with, is that text outside the bounds of the CropBox is still marked up but displayed off the page.

For example, I have a PDF with printer crop marks, made up of text and images and are positioned outside the CropBox. This is shown in Acrobat here:

It would be nice to strip any text that would be positioned outside the visible area, whether using --use-cropbox or not.

Can provide a PDF sample if needed.



Thanks for reporting.

Yes, please provide a sample PDF file.

I'm not sure if it'll be easy to "physically" remove them, that parameter actually just triggers a switch of a poppler API.


I see. I didn't find an easy way, and this issue can be viewed as a simple case of #39. So marked as duplicated.

@coolwanglu coolwanglu closed this Mar 28, 2013
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment