Python-tesseract is an optical character recognition (OCR) tool for Python, that is, it will recognize and "read" the text embedded in images.
Python-tesseract is a wrapper for Google's Tesseract-OCR Engine. It can be used as a stand-alone invocation script to tesseract, as it can read all image types supported by the Python Imaging Library, including JPEG, PNG, GIF, BMP, TIFF, and others, whereas tesseract-ocr, by default, only supports TIFF and BMP. Additionally, if used as a script, python-tesseract will print the recognized text instead of writing it to a file. There is no support for confidence estimates and bounding box data is planned for future releases.
Package Version | Update ID | Released | Package Hub Version | Platforms | Subpackages |
---|---|---|---|---|---|
0.3.10-bp156.2.1 info | GA Release | 2023-07-22 | 15 SP6 |
|
|
0.3.10-bp155.1.5 info | GA Release | 2023-05-22 | 15 SP5 |
|
|
0.3.0-bp154.1.43 info | GA Release | 2022-05-12 | 15 SP4 |
|
|
0.3.0-bp154.1.42 info | GA Release | 2022-05-09 | 15 SP4 |
|
|
0.3.0-bp153.1.19 info | GA Release | 2021-03-06 | 15 SP3 |
|
|
0.3.0-bp152.1.9 info | GA Release | 2020-04-17 | 15 SP2 |
|
|
0.2.0-bp151.1.3 info | GA Release | 2019-07-17 | 15 SP1 |
|
|