python 圖像識別

時間 2019-11-20

標籤 python 圖像識別欄目 Python 简体版

原文原文鏈接

這是一個最簡單的圖像識別，將圖片加載後直接利用Python的一個識別引擎進行識別

將圖片中的數字經過 pytesseract.image_to_string(image) 識別後將結果存入到本地的txt文件中

 1 #-*-encoding:utf-8-*-
 2 import pytesseract
 3 from PIL import Image
 4 
 5 class GetImageDate(object):
 6     def m(self):
 7         image = Image.open(u"a.png")
 8         text = pytesseract.image_to_string(image)
 9         return text
10 
11     def SaveResultToDocument(self):
12         text = self.m()
13         f = open(u"Verification.txt","w")
14         print text
15         f.write(str(text))
16         f.close()
17 
18 g = GetImageDate()
19 g.SaveResultToDocument()

具體想要實現上面的代碼須要安裝兩個包和一個引擎

在安裝以前須要先安裝好Python，pip並配置好環境變量

全部包的安裝都是經過pip來安裝的，須要在windows PowerShell中進行，而且是在 C:\Python27\Scripts目錄下

1.第一個包： pytesseract

pip install pytesseract

如果出現安裝錯誤的狀況，安裝不了的時候，能夠將命令改成 pip.exe install pytesseract來安裝

如果將pip修改成pip.exe安裝成功後，那麼下文的全部pip都須要改成pip.exe

2.第二個包：PIL安裝

pip install PIL

如果失敗了能夠以下修改 pip install PILLOW

3.安裝識別引擎tesseract-ocr

https://github.com/tesseract-ocr/tesseract