python中文ocr方案-pytesseract

時間 2019-11-21

標籤 python 中文 ocr 方案 pytesseract 欄目 Python 简体版

原文原文鏈接

pytesseract是google維護的具備學習功能的OCR引擎，3.0之後支持中文識別。學習

安裝：google

1. 安裝tesseract-ocr組件；記得同步下載簡體中文與英文語言包。調試

2. 安裝PIL，需注意Windows64位版本code

3. pip install pytesseract圖片

使用:ip

image = Image.open("1.jpg")  # 打開圖片image.load()  # 加載一下圖片，防止報錯，此處可省略image.show()  # 調用show來展現圖片，調試用，可省略tessdata_dir_config = '--tessdata-dir "C:\\Program Files (x86)\\Tesseract-OCR\\tessdata"'vcode = pytesseract.image_to_string(image, lang='chi_sim', config=tessdata_dir_config)print vcode

1. tesseract-OCR + pytesseract安裝
2. Python 進行 OCR識別 -- pytesseract庫
3. Python 中文OCR
4. python中pytesseract的安裝
5. Python3.6 利用Tesseract進行中英文圖像識別之 PIL,pytesseract,tesseract-ocr安裝
6. mac python 配置pytesseract
7. android中ocr解決方案（tesseract）
8. python使用pytesseract識別圖片中的文字
9. [python] python3.6 安裝 pytesseract 出錯
10. Python - pytesseract 機器視覺
更多相關文章...
• SQLite - Python - SQLite教程
• R 繪圖 - 中文支持 - R 語言教程
• SpringBoot中properties文件不能自動提示解決方法
• Scala 中文亂碼解決

相關標籤/搜索