簡介:html
OCR(Optical Character Recognition):光學字符識別,是指對圖片文件中的文字進行分析識別,獲取的過程。ios
Tesseract:開源的OCR識別引擎,初期Tesseract引擎由HP實驗室研發,後來貢獻給了開源軟件業,後經由Google進行改進,消除bug,優化,從新發布。git
連接:github
xcode
Q&A:url
Q一、關於語言庫錯誤:spa
Error opening data file /usr/local/share/tessdata/eng.traineddata Please make sure the TESSDATA_PREFIX environment variable is set to the parent directory of your "tessdata" directory. Failed loading language eng Tesseract couldn't load any languages! Could not initialize tesseract.
A1:找到語言包以後拖到項目就行,關鍵是拖得時候注意這一點:「Make sure you select the "Create folder references" option, when adding the tessdata folder to your project」,詳細解決方案看這裏code