Skip to content

Releases: tesseract-ocr/tesseract

5.0.0-beta-20210815

15 Aug 15:12
5.0.0-beta-20210815
Compare
Choose a tag to compare
5.0.0-beta-20210815 Pre-release
Pre-release

This is a new pre-release of Tesseract 5.0.0.

  • Bug fixes
  • Modernize more code
  • More options for binarization
  • Improved support for ARM NEON
  • No longer depends on Abseil for unit tests
  • Support float for model training and text recognition (faster, requires less RAM)

See also list of all changes.

5.0.0-alpha-20210401

01 Apr 20:23
5.0.0-alpha-20210401
Compare
Choose a tag to compare
5.0.0-alpha-20210401 Pre-release
Pre-release

This is a new pre-release of Tesseract 5.0.0.

  • Replaced all remaining STRING by std::string
  • Replaced lots of GenericVector by std::vector
  • Replaced all malloc / free by C++ code
  • Modernized and formatted code

See also list of all changes.

5.0.0-alpha-20201231

31 Dec 18:52
5.0.0-alpha-20201231
Compare
Choose a tag to compare
5.0.0-alpha-20201231 Pre-release
Pre-release

This is a new pre-release of Tesseract 5.0.0.

It has massive changes in the public API which is a great step towards a final 5.0.0.
All unit tests pass, but because of those changes more practical experience is needed.

  • the public API no longer uses proprietary data types GenericVector, STRING
  • pdf.ttf is no longer needed because it is now integrated into the code

See also list of all changes.

5.0.0-alpha-20201224

24 Dec 08:37
5.0.0-alpha-20201224
Compare
Choose a tag to compare
5.0.0-alpha-20201224 Pre-release
Pre-release

This is a new pre-release of Tesseract 5.0.0.

It is considered to be production ready for end users, but nevertheless not stable because more incompatible API changes are planned.

  • improved performance (also on ARM / ARM64)
  • improved unit tests
  • many fixes
  • faster flat build with automake
  • support for latest macOS (including new M1 processor)

See also list of all changes.

4.1.1 Release

26 Dec 15:43
Compare
Choose a tag to compare
  • Implemented sw build (cppan is deprecated)
  • Improved cmake build
  • Code cleanup and optimization
  • A lot of bug fixes...

4.1.0 Release

07 Jul 13:43
Compare
Choose a tag to compare
  • Added new renderers Alto, LSTMBox, WordStrBox.
  • Added character boxes in hOCR output.
  • Added python training scripts (experimental) as alternative shell scripts.
  • Better support AVX / AVX2 / SSE.
  • Disable OpenMP support by default (see e.g. #1171, #1081).
  • Fix for bounding box problem.
  • Implemented support for whitelist/blacklist in LSTM engine.
  • Improved cmake configuration.
  • Code modernization and improvements.
  • A lot of bug fixes...

Detailed changelog is on wiki.

Windows installer can be downloaded from https://github.com/UB-Mannheim/tesseract/wiki.

4.0.0 Release

29 Oct 09:00
Compare
Choose a tag to compare

Detailed Release notes, Changelog and documentation can be found in project wiki.

Windows installer can be downloaded from https://github.com/UB-Mannheim/tesseract/wiki.

3.05.02 Release

19 Jun 09:16
Compare
Choose a tag to compare

Bug fix release

3.05.01 Release

01 Jun 18:10
Compare
Choose a tag to compare

Bug fix release

3.05.00 Release

16 Feb 18:05
Compare
Choose a tag to compare
  • Made some fine tuning to the hOCR output.
    • Added TSV as another optional output format.
    • Fixed ABI break introduced in 3.04.00 with the AnalyseLayout() method.
    • text2image tool - Enable all OpenType ligatures available in a font. This feature requires Pango 1.38 or newer.
    • Training tools - Replaced asserts with tprintf() and exit(1).
    • Fixed Cygwin compatibility.
    • Improved multipage tiff processing.
    • Improved the embedded pdf font (pdf.ttf).
    • Enable selection of OCR engine mode from command line.
    • Changed tesseract command line parameter '-psm' to '--psm'.
    • Added new C API for orientation and script detection, removed the old one.
    • Increased minimum autoconf version to 2.59.
    • Removed dead code.
    • Fixed many compiler warning.
    • Fixed memory and resource leaks.
    • Fixed some issues with the 'Cube' OCR engine.
    • Fixed some openCL issues.
    • Added option to build Tesseract with CMake build system.
    • Implemented CPPAN support for easy Windows building.