環境vs 2019 .Net 4.8
新建一個wpf工程,拖放上一個button一個textbox
nuget下載tesseract,版本信息如圖所示
MainWindow.xaml.cs文件代碼以下:git
using Microsoft.Win32; using System.Windows; using Tesseract; namespace TestOCR { /// <summary> /// Interaction logic for MainWindow.xaml /// </summary> public partial class MainWindow : Window { public MainWindow() { InitializeComponent(); } private void Button_Click(object sender, RoutedEventArgs e) { OpenFileDialog ofd = new OpenFileDialog(); if (ofd.ShowDialog() == true) { var pix = Pix.LoadFromFile(ofd.FileName); var ocr = new TesseractEngine("./tessdata", "eng",EngineMode.TesseractAndCube); var page = ocr.Process(pix); textbox.Text = page.GetText(); textbox.FontSize = 30; } } } }
去下載tessdata訓練文件:
https://github.com/tesseract-ocr/tesseract/wiki/Data-Files
下載後解壓,而後一層層文件夾進去找到「tessdata」文件夾,把該文件夾放在wpf工程bin/debug目錄裏:
而後準備一個帶文字的圖片,測試:
github