Pytesseract-OCR (OpenCV Tkinter)

Allows upload of an image for OCR using Tesseract and deployed using Tkinter. This uses Tkinter, a Python GUI framework based on Tcl/Tkl. OpenCV is used to reduce noise in the image for better processing by pytesseract. Below are 3 images of a job posting taken on a Pixel 2XL phone, and reduced in size using Gimp by adjusting quality.

Getting Started

These instructions will get you a copy of the project up and running on your local machine for development and testing purposes.

Prerequisites

For Windows 10, Tesseract must be installed - you can find installer here:

Tesseract Window 10 Installers

Installing and Running

For ImageTk
$ sudo apt-get update
Install OpenCV
$ sudo apt-get install python3-pil.imagetk
$ pip install opencv-python 
$ pip install opencv-contrib-python
Install Tesseract/Pytesseract
$ sudo apt install tesseract-ocr
$ sudo apt install libtesseract-dev
$ pip install pytesseract

Built With

Python
Tkinter
Pytesseract
OpenCV

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
README.md		README.md
main_app.py		main_app.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Pytesseract-OCR (OpenCV Tkinter)

Getting Started

Prerequisites

Installing and Running

Built With

About

Releases

Packages

Languages

sovan580/Pytesseract-OCR

Folders and files

Latest commit

History

Repository files navigation

Pytesseract-OCR (OpenCV Tkinter)

Getting Started

Prerequisites

Installing and Running

Built With

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages