Skip to content
This repository has been archived by the owner on Jan 13, 2023. It is now read-only.

Test LSTM OCR Engine in Tesseract #30

Closed
afolarin opened this issue Jul 8, 2017 · 4 comments
Closed

Test LSTM OCR Engine in Tesseract #30

afolarin opened this issue Jul 8, 2017 · 4 comments

Comments

@afolarin
Copy link
Contributor

afolarin commented Jul 8, 2017

Need to setup some basic benchmarks to test this. Keen to know if it helps with rotations.

https://github.com/tesseract-ocr/tesseract/wiki/4.0-with-LSTM
tesseract-ocr/tesseract#40
tesseract-ocr/tesseract#465

@jstuczyn
Copy link
Contributor

I've just updated the Tesseract version on our dev environment to 4.0.0alpha and rerun the problematic test (the one with document with rather bad quality, where the tables are not aligned, etc) and the result is significantly better than on the older version.
For comparison:
Actual text:

(2) If only one person is ticket *(part of letter "d' is not visible and in fact it looks like a "t")* in the final two columns then they are the nearest relatives.

3.04.01:

{2} Heel}; 0ch persnn is tieket in the final twrr eeltrnms their Hwy are the nearest relatives.

4.0.0alpha:

(2) If only one person is ticket in the final twn columns then they are the nearest relatives,

However, that does not mean 4.0.0 only makes tiny mistakes like two -> twn, but there are significantly less of them and it is possible to understand the meaning of a sentence regardless of them.

@afolarin
Copy link
Contributor Author

Yeah, I was keen to see how much better the LSTM OCR engine is. Given that it's still in alpha, but sounds like it is both more accurate and faster

@afolarin
Copy link
Contributor Author

TODO set as default configuration

@afolarin afolarin added this to BACKLOG in CogStack WIP Nov 2, 2017
@afolarin afolarin moved this from BACKLOG to TODO in CogStack WIP Nov 2, 2017
@afolarin afolarin added this to To do in afolarin-TODO May 15, 2018
@afolarin afolarin moved this from To do to In progress in afolarin-TODO May 18, 2018
@lrog
Copy link
Contributor

lrog commented Nov 23, 2018

The precision and recall of the new tesseract has been significantly improved according to the official tesseract wiki. On few our tests we could also notice significant improvements running locally on MTSamples.

The openjdk v.11 base image (based on Debian) already provides in the repository the new version of tesseract. Hence, with PR #65 we added the new version of tesseract to be used by default within the upcoming version of CogStack Pipeline image. The same for TravisCI builds.

I'm closing this issue, however, creating a proper benchmark suite to gather precision/recall/performance metrics would be a good idea to have a direct metrics for future improvements.

@lrog lrog closed this as completed Nov 23, 2018
CogStack WIP automation moved this from TODO to DONE Nov 23, 2018
afolarin-TODO automation moved this from In progress to Done Nov 23, 2018
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
CogStack WIP
  
DONE
afolarin-TODO
  
Done
Development

No branches or pull requests

3 participants