How to tune the base model or when are improvements planned #85
Replies: 2 comments 2 replies
-
There is a lot that can still be done to improve the models - mainly expanding, filtering and cleaning the datasets and improving evaluation. In addition there are preprocessing tasks which will help, such as rectifying rotated words before attempting recognition. The models are trained in PyTorch and can be found in https://github.com/robertknight/ocrs-models. Model checkpoints are uploaded to Hugging Face. If you try to use the latest checkpoints with Ocrs you may need to adjust the text detection threshold.
The iOS feature that detects text in photos is very cool and I assume Android is similar? It would be great to reach a similar level of accuracy with this project, but there is still a lot of work to be done as you can see. |
Beta Was this translation helpful? Give feedback.
-
I'm finding ocrs to be far better then tesseract however I'm still getting lots of garbled results or words with letters missing.
I'm blind and my phones screen reader seems to be able to detect text in photos extremely well but I haven't been able to replicate that ability with any software I've found so far.
Is it possible to currently train the existing models further or are improvements to the current model expected soon?
Beta Was this translation helpful? Give feedback.
All reactions