-
Notifications
You must be signed in to change notification settings - Fork 50
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Not working #5
Comments
There is a helpful section in the wiki of the tesserect repo for improving the quality of a captured image before passing it to tesseract. You may also want to consider using a white or blacklist to improve results. Example. You can also play around with the Page Segmentation Mode, I haven't really figured out what this is, but apparently you can set it to things like "single line" or "single character" modes, so that tesserect knows what it is looking for. You can simply change Page Segmentation Mode on an instance of the TessBaseAPI class: TessBaseAPI tessBaseApi = new TessBaseAPI();
tessBaseApi.setPageSegMode(TessBaseAPI.PageSegMode.PSM_AUTO); The PageSegMode inner class has the various constants of the modes that are supported. As a last note: make sure the image you have captured has good quality, try not to shake the camera and so on... I hope I could help, I have just started getting into OCR on android. I am also currently trying to get better results. EDIT: I forgot to mention that there are also different types of trained data:
|
Hi @dennis1995 |
This Project uses the tess-two library. The tess-two library uses Tesseract 3.05. It doesn't use the newest version, which is Tesseract 4. I couldn't find an android implementation that uses Tesseract 4. You could try to create it yourself by using JNI and the andorid NDK. Does this answer your question? |
how to improve accuracy of the OCR? There is no a word captured correctly from camera.
The text was updated successfully, but these errors were encountered: