scripts that support hadoop processing
Language/font training files for ODW projects.
Script for combining use of olena and tesseract for word coordinates, using olena for segmentation and tesseract's hocr format for coordinates.
A collection of Supplejack parsers
Try to find regions missed by Tesseract.
Support manuals and documentation for VITA Digital Toolkit
Directory-based OCR processing using Tess4J and PDFBox
Newspaper Inventory for ODW Projects
rework-in-progress for Ember 3.x version of Supplejack client.
Create image-based PDF file with readable text.
Extract detailed glyph information and/or quick text using Tesseract API
Drupal and associated files for INK newspaper project.
Gathering point for Drupal support files.