Skip to content

Latest commit

 

History

History
41 lines (25 loc) · 782 Bytes

README.md

File metadata and controls

41 lines (25 loc) · 782 Bytes

OCR Image Processing

Read out the captcha

Author: Lin Dong

Date: July 22nd, 2016

Prerequisites

OS: Mac

brew install imagemagick
brew install tesseract --all-languages

brew install python2
pip install Pillow
pip install pytesseract

Instructions

Just run: python main.py

  • iteration.py: iterate multiple times of original image
  • convert_to_text.py: read out the string from white-gray image,

Screenshots

Progressive Iterations:

Iteration 1:

Iteration 2:

Iteration 3:

References

  1. python 识别验证码
  2. OCR on OS X with tesseract Raw