Popularity
8.1
Stable
Activity
7.7
Growing
5,495
108
693

Description

Python-tesseract is an optical character recognition (OCR) tool for python. That is, it will recognize and "read" the text embedded in images.

Python-tesseract is a wrapper for google's Tesseract-OCR ( http://code.google.com/p/tesseract-ocr/ ). It is also useful as a stand-alone invocation script to tesseract, as it can read all image types supported by the Python Imaging Library, including jpeg, png, gif, bmp, tiff, and others, whereas tesseract-ocr by default only supports tiff and bmp. Additionally, if used as a script, Python-tesseract will print the recognized text in stead of writing it to a file. Support for confidence estimates and bounding box data is planned for future releases.

USAGE:

Code Quality Rank: L5
Programming language: Python
License: Apache License 2.0
Tags: OCR    
Latest version: v0.3.7

pytesseract alternatives and similar packages

Based on the "OCR" category.
Alternatively, view pytesseract alternatives based on common mentions on social networks and blogs.

Do you think we are missing an alternative of pytesseract or a related project?

Add another 'OCR' Package