Auto correct image rotation (-180, -90, 0, +90) #46

fritz-hh · 2014-01-08T22:05:16Z

No description provided.

fritz-hh · 2014-01-12T19:21:54Z

it seems that orientation detection will be supported in the next version of the tesseract command line interface:
http://code.google.com/p/tesseract-ocr/issues/detail?id=955

eloops · 2014-12-10T12:53:52Z

Have been testing with v3.04 (compiled from git source). With -psm 0 it gives the orientation as well as confidence and an integer, but then that means you have to run tesseract-ocr over the page twice (first for orientation and then for OCR).

In -psm 1 mode it adds a 'textangle ###' attribute to the tags in the hocr file, so at the moment I am using the following to detect the rotation and correct it, after hocrTransform.py is called:

# Code removed

$curOCRedPDFRotated translates to a *.ocred.rotated.pdf file so should still be caught by the gs concatenation.

Unfortunately this doesn't work; If I rotate the image after OCR (and orientation detection), but before calling hocrTransform.py, the image is not rotated correctly (retains original dimensions) and the OCR'ed text is overlaid sideways.

If I rotate the image after the PDF is generated, it doesn't rotate correctly and/or the OCR'ed text is correct but not laid out correctly.

So it looks like the only way to do it properly is to call tesseract-ocr twice. Once to determine orientation, rotate the image if necessary and then a second time to perform OCR duties.

Edit:
Removed code. It really doesn't work. I kludged up an extra bit that runs tesseract in -psm 0 mode over the .pnm file and then gets convert (I use graphicsmagick convert, I'll test both and also econvert to see what speed difference there is) to rotate the image before passing it back to tesseract for OCR'ing. I don't think the second pass added much to it, although it would be nice to only have to do one pass.

eloops · 2015-09-08T15:06:41Z

I ported this to a node library (here), part of it was implementing auto-rotation. Added a prototype to find the general rotation (by finding the greatest number of textangles in the hocr). Also by climbing up/down the DOM to the ocr_line class <span> elements and grabbing the textangle I could correct it when writing the words to the canvas. Still not sure yet if it's faster to do a separate -psm 0 (OSD only) and then a -psm 6 for the OCR text or just the -psm 1 (get everything).

OCRmyPDF-issuebot mentioned this issue Sep 14, 2015

Auto correct image rotation (-180, -90, 0, +90) ocrmypdf/OCRmyPDF#4

Closed

jbarlow83 closed this as completed Dec 5, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Auto correct image rotation (-180, -90, 0, +90) #46

Auto correct image rotation (-180, -90, 0, +90) #46

fritz-hh commented Jan 8, 2014

fritz-hh commented Jan 12, 2014

eloops commented Dec 10, 2014

eloops commented Sep 8, 2015

Auto correct image rotation (-180, -90, 0, +90) #46

Auto correct image rotation (-180, -90, 0, +90) #46

Comments

fritz-hh commented Jan 8, 2014

fritz-hh commented Jan 12, 2014

eloops commented Dec 10, 2014

eloops commented Sep 8, 2015