Skip to content


The OCRopus OCR System and Related Software


OCRopus is a collection of neural-network based OCR engines originally developed by Thomas Breuel, with many contributions from students, companies, and researchers. The organization collects many of the repositories.

Please see the repositories below or check out the Wiki for more information.

Consulting / Support

For commercial consulting or support, please contact

Popular repositories Loading

  1. hocr-tools hocr-tools Public

    Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.

    Python 359 78

  2. ocropus4-eval ocropus4-eval Public

    Tools for evaluating OCR performance relative to ground truth.

    Jupyter Notebook 7 1

  3. Public

    6 2

  4. ocrodeg ocrodeg Public

    Forked from NVlabs/ocrodeg

    document image degradation

    Jupyter Notebook 5 2

  5. ocropus4inf ocropus4inf Public

    Jupyter Notebook 4 3

  6. ocropus4train ocropus4train Public

    Jupyter Notebook 1 1


Showing 10 of 10 repositories


This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages


Most used topics