This is a small and simple cli ocr script to automatically ocr an image or split a pdf into images and then ocr the images of the pages.
-
Updated
Feb 10, 2024 - Python
This is a small and simple cli ocr script to automatically ocr an image or split a pdf into images and then ocr the images of the pages.
Performs a very fast OCR on a list of images (file path, url, base64, bytes, numpy, PIL ...) using Tesseract and returns the recognized text, its coordinates, and line-based word grouping in a DataFrame.
Multiprocessing OCR with Tesseract
A Python-based project that extracts text from images using Optical Character Recognition (OCR) techniques, leveraging the Tesseract OCR engine.
BizCardX is a Streamlit-based tool that uses OCR to extract and manage business card data. Easily upload cards, extract information, and store it in a PostgreSQL database.
A simple desktop application to extract text from images using OpenCV and Pytesseract-OCR module of Python3.And the GUI is implemented using Tkinter module of python3.
Intelligent File Delivery Tool
ocr resume
Transform images with text into a concise summary using Tesseract OCR and Google's Pegasus model
Create json file by text recognition model. Calc Levenshtein ratio.
[ Extract relevant information from business cards by using easyOCR library and stored into MySQL Database ] | Python | easyOCR | MySQL | Streamlit |
This project is about detect and extracts the numbers from a credit card image using: openCV and Tesseract
Add a description, image, and links to the ocr topic page so that developers can more easily learn about it.
To associate your repository with the ocr topic, visit your repo's landing page and select "manage topics."