PDF開發工具Spire.PDF 教程：使用C＃從PDF中的特定矩形區域中提取文本

時間 2019-11-13

標籤 pdf 開發工具 spire.pdf spire 教程使用 c# 特定矩形區域提取文本欄目 C# 简体版

原文原文鏈接

使用Spire.PDF，程序員能夠從PDF文檔中的特定矩形區域提取文本，本文演示如何使用Spire.PDF和C＃實現此功能。程序員

示例文件：ui

詳細步驟：對象

Step 1: 初始化PdfDocument類的對象並加載PDF文件。blog

PdfDocument pdf = new PdfDocument();
pdf.LoadFromFile("Stories.pdf");

Step 2: 獲取第一頁。圖片

PdfPageBase page = pdf.Pages[0];

Step 3: 從頁面中的特定矩形區域中提取文本，以後將文本保存爲.txt文件。ci

string text = page.ExtractText(new RectangleF(50, 50, 500, 100) );
StringBuilder sb = new StringBuilder();
sb.AppendLine(text);
File.WriteAllText("Extract.txt", sb.ToString());

輸出：文檔

完整代碼：get

//Initialize an object of PdfDocument class
PdfDocument pdf = new PdfDocument();
//Load the PDF file
pdf.LoadFromFile("Stories.pdf");

//Get the first page
PdfPageBase page = pdf.Pages[0];

// Extract text from a specific rectangular area within the page
string text = page.ExtractText(new RectangleF(50, 50, 500, 100) );

//Save the text to a .txt file
StringBuilder sb = new StringBuilder();
sb.AppendLine(text);
File.WriteAllText("Extract.txt", sb.ToString());

相關標籤/搜索