hocr
Here are 35 public repositories matching this topic...
Read and extract text and other content from PDFs in C# (port of PDFBox)
-
Updated
Jul 1, 2024 - C#
OCR engine for all the languages
-
Updated
Jul 2, 2024 - Python
A Gtk/Qt front-end to tesseract-ocr.
-
Updated
Jun 15, 2024 - C++
Text Overlay plugin for Mirador 3
-
Updated
Jun 7, 2024 - JavaScript
Convert between Tesseract hOCR and ALTO XML using XSL stylesheets
-
Updated
May 28, 2024 - XSLT
Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)
-
Updated
Apr 30, 2024 - JavaScript
TIFF Image - Converted into OCR XML using Tesseract
-
Updated
Mar 9, 2024 - Python
Document Layout Analysis resources repos for development with PdfPig.
-
Updated
Oct 1, 2023 - C#
OCR engine for all the languages
-
Updated
Jan 6, 2023 - Python
A gem that parses positional text from hOCR output and provides convenience methods to find text.
-
Updated
Oct 20, 2022 - Ruby
Improve this page
Add a description, image, and links to the hocr topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the hocr topic, visit your repo's landing page and select "manage topics."