Package Info

python-pytesseract


Python wrapper for Google's Tesseract-OCR


Development/Languages/Python

Python-tesseract is an optical character recognition (OCR) tool for Python, that is, it will recognize and "read" the text embedded in images.

Python-tesseract is a wrapper for Google's Tesseract-OCR Engine. It can be used as a stand-alone invocation script to tesseract, as it can read all image types supported by the Python Imaging Library, including JPEG, PNG, GIF, BMP, TIFF, and others, whereas tesseract-ocr, by default, only supports TIFF and BMP. Additionally, if used as a script, python-tesseract will print the recognized text instead of writing it to a file. There is no support for confidence estimates and bounding box data is planned for future releases.


License: GPL-3.0-only
URL: https://github.com/madmaze/python-tesseract

Categories

Releases

Package Version Update ID Released Package Hub Version Platforms Subpackages
0.2.0-bp151.1.3 info GA Release 2019-07-17 15 SP1
  • AArch64
  • ppc64le
  • s390x
  • x86-64
  • python2-pytesseract
  • python3-pytesseract
0.3.0-bp152.1.9 info GA Release 2020-04-17 15 SP2
  • AArch64
  • ppc64le
  • s390x
  • x86-64
  • python2-pytesseract
  • python3-pytesseract
0.3.0-bp153.1.19 info GA Release 2021-03-06 15 SP3
  • AArch64
  • ppc64le
  • s390x
  • x86-64
  • python2-pytesseract
  • python3-pytesseract