This repository has been archived by the owner on Jun 8, 2022. It is now read-only.
Tesseract Tuning #12
Labels
bug
Something isn't working
enhancement
New feature or request
question
Further information is requested
So looking further into the configs of tesseract, there may be a way to make tesseract do most of the work for me. It has configs for limiting what characters can be read by the program, but it can also try to search for patterns. It also has the ability to include wordlists with words that may be found in the image.
I tried to work with this in
Python-OutdatedTessExperiment
But results didnt fairly show. The program ran faster, but as a horrendous loss in accuracy. In my experiements, I have never seen any proper difference between using and not using the wordlists or patterns. I may have been doing it wrong. If anyone reading has any experience with tesseracts wordlists, try and see if you could get it working. Currently, I'm treating it as a dead end.If this can be implemented, then it may improve or even solve issues #6 and #11
The text was updated successfully, but these errors were encountered: