Update Info

openSUSE-2023-130


Recommended update for tesseract-ocr


Type: recommended
Severity: moderate
Issued: 2023-06-15
Description:
This update for tesseract-ocr fixes the following issues:

update to 5.3.1

- Build AVX2 enabled hwcaps library for x86_64-v3

- Define TESSDATA_PREFIX during build to point at /usr/share
  (since it's the prefix) rather than package name, tessdata
  suffix is automatically added.

- Move unversioned libraries to main package 

- Update to version 5.3.0:

  * Fix memory issues in ScrollView::MessageReceiver
  * autotools: Add rule for svpaint executable
  * Replace call of exit function by return statement in main function
  * Fix the build on CodeQL/Analyze by @arseniy-sonar in #3888
  * CI: Remove Ubuntu 18.04
  * configure.ac: fix build on aarch64_be
  * SW CI: Add paths filter
  * Create .mailmap
  * Fix tesseract.pc from cmake to match autotools
  * Update README.md
  * Fixed 2 errors
  * fix issue #3940 - remove colormap before thresholding
  * Update upload-artifact action
  * Update checkout action to version 3
  * Fix Markdownlint
  * Fix broken links in CONTRIBUTING.md
  * pdfrenderer.cpp: Ignore non-text blocks
  * lstm.train: allow .box from .raw.png too
  * Fix a number of performance issues (reported by Coverity Scan)
  * Fix training tools for legacy engine (issue #3925)
  * Fix function tesseract::WriteFeature (issue #3925)
  * Modernize function ObjectCache::DeleteUnusedObjects (fix issue with s…
  * More fixes for issue #3925

- Fixed packaging to include missing shared libs:

  * libcommon_training.so
  * libunicharset_training.so

- Update to version 5.2.0:

  * Improvements and fixes for continuous integration, autoconf and cmake builds
  * Set /Os for some 32 bit MS compilers
  * Improve comments and other documentation
  * Add initial support for Intel AVX512F
  * Fix for very large PDF files on 32 bit hosts
  * Fix NEON detection on FreeBSD
  * Fix regression with UZN files
  * Fix calling delete[] for memory allocated by malloc in C API
  * Add an API function to init tesseract with traineddata from memory
  * Replace direct access to Leptonica internal data structures by function calls and
    support latest releases of Leptonica.
  * Replace std::regex by std::string functions.
  * Use compiled-in TESSDATA_PREFIX also on Windows
  * Add new parameter 'invert_threshold', change the default threshold from 0.5 to 0.7
    and mark parameter 'tessedit_do_invert' as deprecated

- Update to version 5.1.0:

  * Handle image and line regions in output formats ALTO, hOCR and text.
  * New parameter curl_timeout for curl_easy_setop.
  * Build fixes and improvements.
  * Catch nullptr in PageIterator::Orientation to improve robustness.
  * Remove unused code.

- Update to version 5.0.1:

  * Add SPDX-License-Identifier to public include files.
  * Support redirections when running OCR on a URL.
  * Lots of fixes and improvements for cmake builds.
  * Distributions should use the autoconf build.
  * Fix broken msys2 build with gcc 11.
  * Fix parameter certainty_scale (was duplicated).
  * Fix some compiler warnings and clean code.
  * Correctly detect amd64 and i386 on FreeBSD.
  * Add libarchive and libcurl in continuous integration actions.
  * Update submodule googletest to release v1.11.0.

- Update to version 5.0.0:

  * Enable fast float32 LSTM by default
  * Switch to NFC normalisation everywhere
  * Remove banner message
  * Disable music staff detection and removal
  * Add new command line option --loglevel
  * Fix regression for OCR with more than one model file
  * Optimizations
  * Improve training messages
  * Add RowAttributes getter to PageIterator
  * Limit BCER to interval [0,1]
  * Improved build process
  * Cleaned code

- Update to version 4.1.3:

  * Fix broken autoconf build

- Update to version 4.1.2:

  * Allow line images with larger width for training
  * Bugfixes
  * Build updates and fixes


              

References


No references

Packages


  • tesseract-ocr-5.3.1-bp155.3.3.1