-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Try approaches to extract images using OCR #30
Labels
enhancement
New feature or request
Comments
I checked the Tesseract OCR tool. It is very good at extracting text from images, however, I cannot clearly see how it can help us to extract images or floorplans. For example. The following page was converted to JPEG and then processed with Tesseract OCR. The result contains the text in the image. Result
|
Merged
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
The following tools have been suggested:
Tesseract OCR: Tesseract is a widely-used OCR engine that supports multiple languages and can be integrated with various programming languages. It has the ability to detect text within an image, and you can leverage its text extraction capabilities to identify regions of the document that do not contain text, which could be potential image regions.
OpenCV with OCR: OpenCV can be used alongside OCR libraries like Tesseract to perform more complex document analysis tasks. You can use OpenCV's image processing functions to preprocess the document, isolate text regions, and then pass those regions to an OCR engine for text extraction. The remaining regions without recognized text are likely to contain images.
Pytesseract: Pytesseract is a Python wrapper for Tesseract OCR. It provides an easy-to-use interface to integrate Tesseract into your Python code. You can combine Pytesseract with OpenCV to preprocess the image document, extract text regions, and identify image regions accordingly.
OCRopus: OCRopus is an OCR system developed by Google that includes various document analysis tools. It provides features for layout analysis, including text and image region identification. OCRopus can be used to preprocess the document and analyze its layout to differentiate between text and image regions.
The text was updated successfully, but these errors were encountered: