Chargrid-ocr (Pytorch)

Implementation of Chargrid-OCR: End-to-end Trainable Optical Character Recognition through Semantic Segmentation and Object Detection

It is a novel approach for optical character recognition (OCR) of printed documents. The proposed method, called Chargrid-OCR, combines instance segmentation and OCR into a single end-to-end trainable neural network. The network first segments the text regions in the document using a modified version of Mask R-CNN and then recognizes the characters in each segmented region using a convolutional neural network (CNN) with a novel Chargrid representation. The Chargrid representation is a grid-based encoding scheme that encodes each character in a grid cell and is designed to be robust to variations in character size and aspect ratio.

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
DataLoader		DataLoader
Model		Model
Utils		Utils
Chargrid_Training.ipynb		Chargrid_Training.ipynb
README.md		README.md
network.png		network.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DataLoader

DataLoader

Model

Model

Utils

Utils

Chargrid_Training.ipynb

Chargrid_Training.ipynb

README.md

README.md

network.png

network.png

Repository files navigation

Chargrid-ocr (Pytorch)

About

Releases

Packages

Contributors 2

Languages

akkshita/chargrid-ocr

Folders and files

Latest commit

History

Repository files navigation

Chargrid-ocr (Pytorch)

About

Resources

Stars

Watchers

Forks

Languages