-
Notifications
You must be signed in to change notification settings - Fork 31
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
weird text order #64
Comments
I could not reproduce the issue:
It is possible, that the issue only occurs with some PDF readers? With reader do you use? |
I use the "Document Viewer 1.2.0" which can be launched with "atril". |
As the output format is pdf/a i am worried why some pdf readers cannot handle the text correctly. I see three possible reasons: 1) pdf/a is not well enough defined 2) The reader is really not accaptable 3) The output is not 100% pdf/a |
1 and 3 are not the reason. If I have time, I will analyze how it is possible to improve selection accuracy on basic some PDF-readers. Therefore, I reopen the ticket. |
OCRmyPDF is brilliant but sometimes i have a problem with the order of text that is underlaid. When i select the text starting from top left and go to the right end of the line and then successively down line by line, there are sometimes gaps of text which is not selected. After a few more lines these gaps are suddenly selected. Copying the selected text and pasting it into another application reveals the order, which is unfortunately wrong. I use latest stable version and have no error or warning messages.
http://www.loaditup.de/files/803343_acxm67dsue.pdf
(problem occurs in second paragraph.)
The text was updated successfully, but these errors were encountered: