After parsing a document, and finding some elements, you'll want to get at the data inside those elements.html
To get the value of an attribute, use the Node.attr(String key)
methodjava
For the text on an element (and its combined children), use Element.text()
node
For HTML, use Element.html()
, or Node.outerHtml()
as appropriateapi
For example:app
String html = "<p>An <a href='http://example.com/'><b>example</b></a> link.</p>"; Document doc = Jsoup.parse(html); Element link = doc.select("a").first(); String text = doc.body().text(); // "An example link" String linkHref = link.attr("href"); // "http://example.com/" String linkText = link.text(); // "example"" String linkOuterH = link.outerHtml(); // "<a href="http://example.com"><b>example</b></a>" String linkInnerH = link.html(); // "<b>example</b>"
The methods above are the core of the element data access methods. There are additional others:ide
All of these accessor methods have corresponding setter methods to change the data.orm