Handwriting-Recognition

OCR stands for Optical Character Recognition. It is a technology used to convert different types of documents, such as scanned paper documents, PDF files or images captured by a digital camera, into editable and searchable data. OCR has a wide range of applications and is used to automate data extraction and to improve the efficiency of data processing in numerous industries.

In this repository, we present our fine-tuned TrOCR model for the text lines dataset from the IAM handwriting database. The IAM is publicly accessible and freely available. This dataset contains a general type of handwritten documents and with the fine-tuned model for it, you can use our implementation to turn documents into machine-readable format.

The purpose of this repository is to suggest a possible fine-tuning for general OCR models.

IAM Database

IAM Database Site

Repository Structure:

Medium article:

Running the app:

The TrOCR directory contains several .py files and a configuration file. To run the model:

Download the TrOCR directory
Install the requirements.txt file.
If desired, change the settings of the training through the confing.json file.
****************Run the 'train.py'. The model will be saved to a file called 'saved_model' in the directory to which you downloaded the TrOCR directory.
Run the 'predict.py' file either from the terminal, calling the 'predict' function and enter the file path to an image you would like to convert to machine-readable format.

Authors

Jonathan Schwarz

Oriel Singer

Mathias Kammoun

Tzaji Minuchin

Name		Name	Last commit message	Last commit date
Latest commit History 47 Commits
CNN_Transformers		CNN_Transformers
MS2		MS2
MS4		MS4
trOCR		trOCR
.gitignore		.gitignore
Demo_gradio.py		Demo_gradio.py
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Handwriting-Recognition

IAM Database

Repository Structure:

Medium article:

Running the app:

Authors

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 4

Uh oh!

Languages

orielcoding/Handwriting-Recognition

Folders and files

Latest commit

History

Repository files navigation

Handwriting-Recognition

IAM Database

Repository Structure:

Medium article:

Running the app:

Authors

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 4

Uh oh!

Languages

Packages