Popularity
7.2
Growing
Activity
7.2
Declining
2,257
73
357

Description

Python-tesseract is an optical character recognition (OCR) tool for python. That is, it will recognize and "read" the text embedded in images.

Python-tesseract is a wrapper for google's Tesseract-OCR ( http://code.google.com/p/tesseract-ocr/ ). It is also useful as a stand-alone invocation script to tesseract, as it can read all image types supported by the Python Imaging Library, including jpeg, png, gif, bmp, tiff, and others, whereas tesseract-ocr by default only supports tiff and bmp. Additionally, if used as a script, Python-tesseract will print the recognized text in stead of writing it to a file. Support for confidence estimates and bounding box data is planned for future releases.

USAGE:

Code Quality Rank: L5
Programming language: Python
License: GNU General Public License v3.0 only
Tags: OCR    

pytesseract alternatives and similar packages

Based on the "OCR" category

Do you think we are missing an alternative of pytesseract or a related project?

Add another 'OCR' Package