Skip to content

Chargrid-OCR: End-to-end Trainable Optical Character Recognition through Semantic Segmentation and Object Detection

Notifications You must be signed in to change notification settings

akkshita/chargrid-ocr

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

41 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Chargrid-ocr (Pytorch)

Implementation of Chargrid-OCR: End-to-end Trainable Optical Character Recognition through Semantic Segmentation and Object Detection

It is a novel approach for optical character recognition (OCR) of printed documents. The proposed method, called Chargrid-OCR, combines instance segmentation and OCR into a single end-to-end trainable neural network. The network first segments the text regions in the document using a modified version of Mask R-CNN and then recognizes the characters in each segmented region using a convolutional neural network (CNN) with a novel Chargrid representation. The Chargrid representation is a grid-based encoding scheme that encodes each character in a grid cell and is designed to be robust to variations in character size and aspect ratio.

About

Chargrid-OCR: End-to-end Trainable Optical Character Recognition through Semantic Segmentation and Object Detection

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published