New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
First column of two column PDF text image skipped #248
Comments
The problem is the I do not send a (You can use I personally don't trust unpaper unsupervised (without reviewing the output) – so you're aware. It looks like I should be sending unpaper |
Thanks for your response. Sounds like I should drop the Thanks for such a useful package too---truly appreciated! |
Yes, drop If you have a lot of files and want to play with you could add |
Great, that seems to have worked. I have three more questions, one related to this issue, and two not so related (let me know if you want me to move these questions somewhere else).
Again, thanks for all the awesome! |
|
Hi again. Thanks for all your help.
Once again, thanks again for all the help and guidance and fine work. |
If this is a commercial project and you'd like support for setting up a batch processing solution that is something I can offer. |
Thanks for all your help on this issues. I will close as your help provided a solution to my problem. Thanks again. |
Hi,
I have run into a problem. I have a file (2008.pdf) that I want to run
ocrymypdf
on. Thepdf
is a photocopy of a book. The photocopy is two pages of the book perpdf
page, in landscape orientation. I guess I think of this as two columns per page (but I could be thinking about this wrong).I have run the following command:
with the output in ocrmypdfddebug.txt.
The resultant output file is 2008b.pdf. As you can see, the
ocr
layer is only on the right side (second column) of each page.Other relevant details:
Not sure if I am doing something wrong, or if this is an issue in the package. Please let me know if you need more info.
The text was updated successfully, but these errors were encountered: