Python: Install Tesseract for Windows 7

I just tried to set up pytesseract and it works ! I have windows 10 and python 2.7 installed.

all you need to do :

  1. Download Visual basic C++ from http://aka.ms/vcpython27 and install it (common installation step)
  2. Download tesseract from python via this link https://pypi.python.org/pypi/pytesseract

  3. Unizip the file.

  4. Go to the directory which contains the unizip file

  5. Run this command " python setup.py install "

  6. (Additional) to test if it's installed, go to your python shell and run this command " import pytesseract "

I hope it works !! Note pytesseract is google based OCR, it works similarly to tesseract.


Step [1] To install tesseract kindly visit

https://github.com/UB-Mannheim/tesseract/wiki

The latest installers can be downloaded from here: e.g., tesseract-ocr-setup-3.05.02-20180621.exe, tesseract-ocr-w32-setup-v4.0.0-beta.1.20180608.exe, tesseract-ocr-w64-setup-v4.0.0-beta.1.20180608.exe (64 bit)

Step [2] Download Microsoft Visual C++ Compiler for Python 2.7 from the link given below https://download.microsoft.com/download/7/9/6/796EF2E4-801B-4FC4-AB28-B59FBF6D907B/VCForPython27.msi

Step [3] Install pytesseract for binding for tesseract using pip

pip install pytesseract

Step [4] Furthermore you can install an image processing library in python, e.g., pillow:

pip install pillow

greetings!! you are done!! :)

Tags:

Python

Ocr