jsoup( HTML parser ) cookbook

時間 2019-11-09

標籤 jsoup html parser cookbook 欄目 Java 简体版

原文原文鏈接

Introduction

Parsing and traversing a Document

Input

Parse a document from a String

Parsing a body fragment

Load a Document from a URL

Load a Document from a File

Extracting data

Use DOM methods to navigate a document

Use selector-syntax to find elements

Extract attributes, text, and HTML from elements

Working with URLs

Example program: list links

Modifying data

Set attribute values

Set the HTML of an element

Setting the text content of elements

Cleaning HTML

Sanitize untrusted HTML (to prevent XSS)

Cookbook 目錄

入門

解析和遍歷一個html文檔

輸入

解析一個html字符串

解析一個body片段

從一個URL加載一個Document對象

根據一個文件加載Document對象

數據抽取

使用dom方法來遍歷一個Document對象

使用選擇器語法來查找元素

從元素集合抽取屬性、文本和html內容

URL處理

程序示例：獲取全部連接

數據修改

設置屬性值

設置元素的html內容

設置元素的文本內容

HTML清理

消除不受信任的html (來防止xss攻擊)

相關標籤/搜索

每日一句

每一个你不满意的现在，都有一个你没有努力的曾经。