Assignment 5. Object detection and document recognition

Overview

In this task, we have been required to recognize the content of a small text document. A receipt from supermarket or electricity bill cheque should be fine.

Setup

Prerequisites:

Python 3.10 or higher
Libraries: OpenCV, NumPy, Torch, torchvision, EasyOCR, Matplotlib
A camera or an image file of a skewed document (e.g., receipt or bill)

Installation:

Clone the repository:

   git clone https://github.com/ADA-GWU/a5-object-detection-aliasgerovs.git
   cd a5-object-detection-aliasgerovs

Install the required Python packages:

   pip3 install -r requirements.txt

Running the Code

To run the text recognition:

Execute the script from the command line

   python3 recognition.py

Follow the on-screen instructions to select the corners of the document in the displayed window.
Press "p" to process the image after selecting the corners.

Features

Perspective Transformation: Converts the trapezoidal shape of the document to a rectangular form using manually selected corners.
Text Detection: Utilizes the MSER algorithm to detect regions likely to contain text.
Text Recognition: Applies a pre-trained CNN to recognize the detected text.
Visualization: Shows the processed images and detected text regions with recognized text.

Code Structure

text_recognition.py: Contains the main logic for image processing, text detection, and recognition.
images folder: Contains images.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
images		images
.gitignore		.gitignore
README.md		README.md
recognition.py		recognition.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Assignment 5. Object detection and document recognition

Overview

Setup

Installation:

Running the Code

Features

Code Structure

About

Releases

Packages

Contributors 2

Languages

ADA-GWU/a5-object-detection-aliasgerovs

Folders and files

Latest commit

History

Repository files navigation

Assignment 5. Object detection and document recognition

Overview

Setup

Installation:

Running the Code

Features

Code Structure

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages