A tensorflow siamese network implementation. Illustrated using singature recognition/identification.
-
Updated
Jun 25, 2019 - Python
A tensorflow siamese network implementation. Illustrated using singature recognition/identification.
~1000 book pages + OpenCV + python = page regions identified as paragraphs, lines, images, captions, etc.
Creates synthetic degraded image documents that could be used to train Neural Networks
Distorted Document Images dataset (DDI-100).
Ground truth line annotations for the Berliner Börsen-Zeitung
Code and procdures for handwriting object detection and recognition
This Web application crawls PDFs from governement websites, performs table detection and displays advanced statistics.
Generate text images for training deep learning ocr model
Dataset for scene text removal
Tools necessary to perform a multi-fold pretrained voting approach utlizing OCRopus.
ScrabbleGAN: Semi-Supervised Varying Length Handwritten Text Generation (CVPR20)
Total Text Dataset - ICDAR 2017. It consists of 1555 images with more than 3 different text orientations: Horizontal, Multi-Oriented, and Curved, one of a kind.
A repository with anonymized invoices
EATEN: Entity-aware Attention for Single Shot Visual Text Extraction
This is a simple project to generate simple cropped images with characters. You can generate with Chinese or English characters. Backgrounds are also allowed. Medical bills simulation are also included.
Master's thesis work as a part of M2(Advanced Robotics) @ Centrale Nantes
Add a description, image, and links to the aniketdata topic page so that developers can more easily learn about it.
To associate your repository with the aniketdata topic, visit your repo's landing page and select "manage topics."