Skip to content

Pinned Loading

  1. tesseract Public

    Tesseract Open Source OCR Engine (main repository)

    C++ 66.5k 9.8k

  2. tessdata_best Public

    Best (most accurate) trained LSTM models.

    1.3k 400

  3. tessdata Public

    Trained models with fast variant of the "best" LSTM models + legacy models

    6.9k 2.3k

  4. tessdata_fast Public

    Fast integer versions of trained LSTM models

    534 150

Repositories

Showing 10 of 14 repositories
  • tesseract Public

    Tesseract Open Source OCR Engine (main repository)

    C++ 66,493 Apache-2.0 9,848 418 (7 issues need help) 25 Updated Apr 27, 2025
  • tesstrain Public

    Train Tesseract LSTM with make

    Python 673 Apache-2.0 203 62 2 Updated Apr 18, 2025
  • tessdata_contrib Public

    User contributed (non Google) OCR models for Tesseract

    26 Apache-2.0 24 0 3 Updated Apr 18, 2025
  • langdata Public

    Source training data for Tesseract for lots of languages

    854 Apache-2.0 883 45 (1 issue needs help) 9 Updated Apr 1, 2025
  • tessdoc Public

    Tesseract documentation

    HTML 2,016 383 19 5 Updated Feb 5, 2025
  • tessdata_fast Public

    Fast integer versions of trained LSTM models

    534 Apache-2.0 150 3 0 Updated Aug 1, 2024
  • test Public

    Repository for tesseract testing

    Shell 31 Apache-2.0 31 1 0 Updated Jun 9, 2024
  • tessdata_best Public

    Best (most accurate) trained LSTM models.

    1,334 Apache-2.0 400 22 1 Updated Mar 9, 2024
  • tessdata Public

    Trained models with fast variant of the "best" LSTM models + legacy models

    6,880 Apache-2.0 2,315 51 (2 issues need help) 2 Updated Mar 9, 2024
  • langdata_lstm Public

    Data used for LSTM model training

    117 Apache-2.0 156 24 (1 issue needs help) 5 Updated Mar 9, 2024