Skip to content

Pinned Loading

  1. tesseract Public

    Tesseract Open Source OCR Engine (main repository)

    C++ 67k 9.9k

  2. tessdata_best Public

    Best (most accurate) trained LSTM models.

    1.3k 401

  3. tessdata Public

    Trained models with fast variant of the "best" LSTM models + legacy models

    6.9k 2.3k

  4. tessdata_fast Public

    Fast integer versions of trained LSTM models

    537 150

Repositories

Showing 10 of 14 repositories
  • tesseract Public

    Tesseract Open Source OCR Engine (main repository)

    C++ 66,969 Apache-2.0 9,898 417 (7 issues need help) 25 Updated May 2, 2025
  • tesstrain Public

    Train Tesseract LSTM with make

    Python 675 Apache-2.0 205 62 2 Updated Apr 18, 2025
  • tessdata_contrib Public

    User contributed (non Google) OCR models for Tesseract

    27 Apache-2.0 24 0 3 Updated Apr 18, 2025
  • langdata Public

    Source training data for Tesseract for lots of languages

    855 Apache-2.0 883 46 (1 issue needs help) 9 Updated Apr 1, 2025
  • tessdoc Public

    Tesseract documentation

    HTML 2,038 392 19 5 Updated Feb 5, 2025
  • tessdata_fast Public

    Fast integer versions of trained LSTM models

    537 Apache-2.0 150 3 0 Updated Aug 1, 2024
  • test Public

    Repository for tesseract testing

    Shell 32 Apache-2.0 31 1 0 Updated Jun 9, 2024
  • tessdata_best Public

    Best (most accurate) trained LSTM models.

    1,346 Apache-2.0 401 22 1 Updated Mar 9, 2024
  • tessdata Public

    Trained models with fast variant of the "best" LSTM models + legacy models

    6,935 Apache-2.0 2,330 51 (2 issues need help) 2 Updated Mar 9, 2024
  • langdata_lstm Public

    Data used for LSTM model training

    117 Apache-2.0 156 24 (1 issue needs help) 5 Updated Mar 9, 2024

Top languages

Loading…