Skip to content

Pinned Loading

  1. tesseract Public

    Tesseract Open Source OCR Engine (main repository)

    C++ 65.6k 9.8k

  2. tessdata_best Public

    Best (most accurate) trained LSTM models.

    1.3k 394

  3. tessdata Public

    Trained models with fast variant of the "best" LSTM models + legacy models

    6.8k 2.3k

  4. tessdata_fast Public

    Fast integer versions of trained LSTM models

    522 150

Repositories

Showing 10 of 14 repositories
  • tesseract Public

    Tesseract Open Source OCR Engine (main repository)

    C++ 65,581 Apache-2.0 9,762 412 (7 issues need help) 26 Updated Feb 12, 2025
  • tessdoc Public

    Tesseract documentation

    HTML 1,986 376 18 5 Updated Feb 5, 2025
  • tessdata_contrib Public

    User contributed (non Google) OCR models for Tesseract

    24 Apache-2.0 23 0 3 Updated Oct 22, 2024
  • tessdata_fast Public

    Fast integer versions of trained LSTM models

    522 Apache-2.0 150 2 0 Updated Aug 1, 2024
  • test Public

    Repository for tesseract testing

    Shell 32 Apache-2.0 31 1 0 Updated Jun 9, 2024
  • tesstrain Public

    Train Tesseract LSTM with make

    Python 660 Apache-2.0 200 61 0 Updated Jun 4, 2024
  • tessdata_best Public

    Best (most accurate) trained LSTM models.

    1,316 Apache-2.0 394 22 1 Updated Mar 9, 2024
  • tessdata Public

    Trained models with fast variant of the "best" LSTM models + legacy models

    6,783 Apache-2.0 2,291 49 (2 issues need help) 2 Updated Mar 9, 2024
  • langdata Public

    Source training data for Tesseract for lots of languages

    850 Apache-2.0 883 45 (1 issue needs help) 8 Updated Mar 9, 2024
  • langdata_lstm Public

    Data used for LSTM model training

    116 Apache-2.0 154 24 (1 issue needs help) 5 Updated Mar 9, 2024