Generate text images for training deep learning ocr model
-
Updated
Apr 3, 2019 - Python
Generate text images for training deep learning ocr model
This repository contains a 403 images dataset for table detection in documents.
EATEN: Entity-aware Attention for Single Shot Visual Text Extraction
EATEN: Entity-aware Attention for Single Shot Visual Text Extraction
Creates synthetic degraded image documents that could be used to train Neural Networks
ScrabbleGAN: Semi-Supervised Varying Length Handwritten Text Generation (CVPR20)
Total Text Dataset. It consists of 1555 images with more than 3 different text orientations: Horizontal, Multi-Oriented, and Curved, one of a kind.
A repository with anonymized invoices
Scripts and results from our OCR roundup, available on Source
Ground truth line annotations for the Berliner Börsen-Zeitung
CORD: A Consolidated Receipt Dataset for Post-OCR Parsing
A synthetic data generator for text recognition
Parliamentary Bills Classification using Document Level Embedding and Bidirectional LongShort-Term Memory
Distorted Document Images dataset (DDI-100).
Official implementation of SynthTIGER (Synthetic Text Image GEneratoR) ICDAR 2021
This is a simple project to generate simple cropped images with characters. You can generate with Chinese or English characters. Backgrounds are also allowed. Medical bills simulation are also included.
Master's thesis work as a part of M2(Advanced Robotics) @ Centrale Nantes
Add a description, image, and links to the aniketdataset topic page so that developers can more easily learn about it.
To associate your repository with the aniketdataset topic, visit your repo's landing page and select "manage topics."