Popularity
6.6
Growing
Activity
7.7
Growing
1,566
62
256

Description

Python-tesseract is an optical character recognition (OCR) tool for python. That is, it will recognize and "read" the text embedded in images.

Python-tesseract is a wrapper for google's Tesseract-OCR ( http://code.google.com/p/tesseract-ocr/ ). It is also useful as a stand-alone invocation script to tesseract, as it can read all image types supported by the Python Imaging Library, including jpeg, png, gif, bmp, tiff, and others, whereas tesseract-ocr by default only supports tiff and bmp. Additionally, if used as a script, Python-tesseract will print the recognized text in stead of writing it to a file. Support for confidence estimates and bounding box data is planned for future releases.

USAGE:

Code Quality Rank: L5
Programming language: Python
License: GNU General Public License v3.0 only
Tags: OCR    

pytesseract alternatives and related packages

Based on the "OCR" category

Do you think we are missing an alternative of pytesseract or a related project?

Add another 'OCR' Package

pytesseract Recommendations

There are no recommendations yet. Be the first to promote pytesseract!

Have you used pytesseract? Share your experience. Write a short recommendation and pytesseract, you and your project will be promoted on Awesome Python.
Recommend pytesseract

Recently added pytesseract resources

Do you know of a usefull tutorial, book or news relevant to pytesseract?
Be the first to add one!