ocr

Here are 2,008 public repositories matching this topic...

AlDrAkU / Simple_OCR_cli

This is a small and simple cli ocr script to automatically ocr an image or split a pdf into images and then ocr the images of the pages.

machine-learning ocr tesseract-ocr ocr-recognition

Updated Feb 10, 2024
Python

Performs a very fast OCR on a list of images (file path, url, base64, bytes, numpy, PIL ...) using Tesseract and returns the recognized text, its coordinates, and line-based word grouping in a DataFrame.

fast files ocr position tesseract pandas multiple dataframe coordinates easyocr

Updated Nov 14, 2023
Python

casual-lab / PDF-OCRSearch

Star

用于扫描版 pdf 书籍的内容检索

pdf ocr searching pdf-document-processor

Updated Nov 3, 2022
Python

47h4rv4-b / bankScribe

Star

All R&D related to bank statement transaction categorization and statement analysis.

python pdf ocr

Updated Mar 21, 2023
Python

Paulraj916 / video-to-slide

Star

A python based Tkinder application for converting video to slide

ocr tesseract

Updated Aug 20, 2023
Python

hansalemaos / tesseractmultiprocessing

Star

Multiprocessing OCR with Tesseract

python ocr multiprocessing tesseract threads pytesseract

Updated Mar 10, 2023
Python

asheikho99 / image-to-text

Star

A Python-based project that extracts text from images using Optical Character Recognition (OCR) techniques, leveraging the Tesseract OCR engine.

python ocr computer-vision image-processing text-extraction

Updated Apr 19, 2023
Python

HectorBullejos / PyOCR

Star

This is a simple OCR app to extract and track text in an image

ocr python3

Updated Nov 15, 2022
Python

RajaSoundari / BizCardX-Extracting-Business-Card-Data-with-OCR

Star

BizCardX is a Streamlit-based tool that uses OCR to extract and manage business card data. Easily upload cards, extract information, and store it in a PostgreSQL database.

python ocr postgresql pandas psycopg2 cv2 matplotlib-pyplot streamlit

Updated Sep 6, 2023
Python

sovan580 / Pytesseract-OCR

Star

A simple desktop application to extract text from images using OpenCV and Pytesseract-OCR module of Python3.And the GUI is implemented using Tkinter module of python3.

opencv ocr python3 tkinter ocr-python pytesseract-ocr