Skip to content
Python desktop OCR application using Tesseract, OpenCV and Tkinter
Python
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
assets
docs
scans
OCRUsingTesseractandTkinter.pdf
README.md
app.py

README.md

OCR Tesseract OpenCV Tkinter

Allows upload of an image for OCR using Tesseract and deployed using Tkinter. This uses Tkinter, a Python GUI framework based on Tcl/Tkl. OpenCV is used to reduce noise in the image for better processing by pytesseract. Below are 3 images of a job posting taken on a Pixel 2XL phone, and reduced in size using Gimp by adjusting quality.

Getting Started

These instructions will get you a copy of the project up and running on your local machine for development and testing purposes.

Prerequisites

For Windows 10, Tesseract must be installed - you can find installer here:

Installing and Running

For ImageTk
$ sudo apt-get update
Install OpenCV
$ sudo apt-get install python3-pil.imagetk
$ pip install opencv-python 
$ pip install opencv-contrib-python
Install Tesseract/Pytesseract
$ sudo apt install tesseract-ocr
$ sudo apt install libtesseract-dev
$ pip install pytesseract

Also, see requirements.txt file produced using $ pip freeze > requirements.txt

You can use these images to test it - these are photos of a job posting:

Built With

Python
Tkinter
Pytesseract
OpenCV

Resources

Here are some helpful resources on the web that I used for this project.

Acknowledgments

You can’t perform that action at this time.