Skip to content

orielcoding/Handwriting-Recognition

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

47 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Handwriting-Recognition

nasdaq_image

OCR stands for Optical Character Recognition. It is a technology used to convert different types of documents, such as scanned paper documents, PDF files or images captured by a digital camera, into editable and searchable data. OCR has a wide range of applications and is used to automate data extraction and to improve the efficiency of data processing in numerous industries.

In this repository, we present our fine-tuned TrOCR model for the text lines dataset from the IAM handwriting database. The IAM is publicly accessible and freely available. This dataset contains a general type of handwritten documents and with the fine-tuned model for it, you can use our implementation to turn documents into machine-readable format.

The purpose of this repository is to suggest a possible fine-tuning for general OCR models.

IAM Database

IAM Database Site

Repository Structure:

Medium article:

Running the app:

The TrOCR directory contains several .py files and a configuration file. To run the model:

  1. Download the TrOCR directory
  2. Install the requirements.txt file.
  3. If desired, change the settings of the training through the confing.json file.
  4. ****************Run the 'train.py'. The model will be saved to a file called 'saved_model' in the directory to which you downloaded the TrOCR directory.
  5. Run the 'predict.py' file either from the terminal, calling the 'predict' function and enter the file path to an image you would like to convert to machine-readable format.

Authors

Jonathan Schwarz

Oriel Singer

Mathias Kammoun

Tzaji Minuchin

About

Final project at ITC

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •