Jsoup入門

一、什麼是Jsoup?css

jsoup 是一款Java 的HTML解析器,可直接解析某個URL地址、HTML文本內容。它提供了一套很是省力的API,可經過DOM,CSS以及相似於jQuery的操做方法來取出和操做數據。
html

 

二、pox.xml前端

 

<!-- 添加httpclient支持 -->
  	<dependency>
	    <groupId>org.apache.httpcomponents</groupId>
	    <artifactId>httpclient</artifactId>
	    <version>4.5.2</version>
	</dependency>
	
	<!-- 添加Jsoup支持 -->
	<dependency>
	    <groupId>org.jsoup</groupId>
	    <artifactId>jsoup</artifactId>
	    <version>1.10.3</version>
	</dependency>


三、相關代碼java

 

 

package com.rye;

import java.util.Iterator;

import org.apache.http.HttpEntity;
import org.apache.http.client.methods.CloseableHttpResponse;
import org.apache.http.client.methods.HttpGet;
import org.apache.http.impl.client.CloseableHttpClient;
import org.apache.http.impl.client.HttpClients;
import org.apache.http.util.EntityUtils;
import org.jsoup.Jsoup;
import org.jsoup.nodes.Document;
import org.jsoup.nodes.Element;
import org.jsoup.select.Elements;

public class Demo4 {

	public static void main(String[] args) throws Exception{
		CloseableHttpClient httpclient = HttpClients.createDefault(); // 建立httpclient實例
        HttpGet httpget = new HttpGet("http://www.cnblogs.com/"); // 建立httpget實例,訪問博客園
         
        CloseableHttpResponse response = httpclient.execute(httpget); // 執行get請求
        HttpEntity entity=response.getEntity(); // 獲取返回實體
        String content=EntityUtils.toString(entity, "utf-8");//網頁內容
        response.close(); // 關閉流和釋放系統資源
        
        Document doc = Jsoup.parse(content);//Jsoup解析網頁內容爲文檔
        Elements els = doc.getElementsByTag("title");//經過標籤獲取內容集合
        Element e = els.get(0);//獲取第一個元素
        String text = e.text();//返回元素文本
        String html = e.html();//返回元素html
        System.out.println("text:"+text);
        System.out.println("html:"+html);
        System.out.println("========================");
        
        Element e2 = doc.getElementById("site_nav_top");//獲取指定Id的元素
        String text2 = e2.text();//獲取文本
        System.out.println(text2);
        System.out.println("========================");
        
        Elements els3 = doc.getElementsByClass("titlelnk");//根據樣式查詢
        Iterator it3 = els3.iterator();
        while(it3.hasNext()) {
        	Element el3 = (Element) it3.next();
        	String text3 = el3.text();
        	System.out.println(text3);
        }
        System.out.println("========================");
        
        Elements els4 = doc.getElementsByAttributeValue("width","48");
        for(Element e4:els4) {
        	System.out.println(e4.toString());
        }
        System.out.println("========================");
        
        //選擇器查找
        Elements els5 = doc.select(".post_item .post_item_body h3 a");
        for(Element e5:els5) {
        	System.out.println("標題:"+e5.text());
        	System.out.println("博客地址:"+e5.attr("href"));//獲取元素的屬性的值
        }
        System.out.println("========================");
        
        Elements els6 = doc.select("a[href]");//帶有href的<a>元素
        for(Element e6:els6) {
        	System.out.println("a[href]:"+e6.toString());
        }
        System.out.println("========================");
        
        Elements els7 = doc.select("img[src$=.png]");//查找img標籤,具備src元素,而且以png結尾
        for(Element e7:els7) {
        	System.out.println("img[src$=.png]:"+e7.toString());
        }
        System.out.println("========================");
        
        Element linkElement = doc.select("#friend_link").first();//獲取集合的第一個元素
        System.out.println("純文本:"+linkElement.text());//只要文字去掉裏頭的HTML全部元素
        System.out.println("HTML:"+linkElement.html());//獲取元素下全部的子元素
	}
}

 

 

運行結果:node

 

text:博客園 - 開發者的網上家園
html:博客園 - 開發者的網上家園
========================
代碼改變世界
========================
建立 overlay 網絡 - 天天5分鐘玩轉 Docker 容器技術(50)
Javascript跨域後臺設置攔截
Jenkins發佈MVC應用程序
一步一步學習Vue(十一)
關於「工業互聯網平臺」建設,俺有本要奏
hexo博客MathJax公式渲染問題
使用 Router 實現的模塊化,如何優雅的回到主頁面
C#多線程之旅(7)——終止線程
Android異步處理技術
Docker Machine
BFC(Block Formatting Context)基礎分析
python函數(6):內置函數和匿名函數
Spring-Framework 源碼閱讀之@Autowired和AutowiredAnnotationBeanPostProcessor
你可能須要爲你的APP適配iOS11
Mybatis框架分析
細談字符串及其格式化表達式
[Android FrameWork 6.0源碼學習] View的重繪過程之Layout
Android Binder機制詳解:手寫IPC通訊
Socket 整理以及CocoaAsyncSocket、SRWebSocket源碼解析(一)
【機器學習】反向傳播算法 BP
========================
<img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/775365/20160228114235.png" alt="">
<img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/686418/20170722174325.png" alt="">
<img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/733213/20161113215823.png" alt="">
<img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/418791/20130914174932.png" alt="">
<img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/279374/20160822092145.png" alt="">
<img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/1102323/20170803222538.png" alt="">
<img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/u234895.jpg?id=28112946" alt="">
<img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/414640/20160117210953.png" alt="">
<img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/803699/20160923095030.png" alt="">
<img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/704456/20160908115235.png" alt="">
<img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/881297/20160114203735.png" alt="">
<img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/1054024/20161031172341.png" alt="">
<img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/672564/20150920190853.png" alt="">
<img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/992994/20170504141151.png" alt="">
<img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/1055208/20170725183852.png" alt="">
<img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/1165242/20170630211508.png" alt="">
<img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/615210/20140321143008.png" alt="">
<img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/739006/20160530150513.png" alt="">
<img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/1093385/20170109195643.png" alt="">
========================
標題:建立 overlay 網絡 - 天天5分鐘玩轉 Docker 容器技術(50)
博客地址:http://www.cnblogs.com/CloudMan6/p/7280787.html
標題:Javascript跨域後臺設置攔截
博客地址:http://www.cnblogs.com/woshimrf/p/js-cors.html
標題:Jenkins發佈MVC應用程序
博客地址:http://www.cnblogs.com/5ishare/p/7282493.html
標題:一步一步學習Vue(十一)
博客地址:http://www.cnblogs.com/Johnzhang/p/7282584.html
標題:關於「工業互聯網平臺」建設,俺有本要奏
博客地址:http://www.cnblogs.com/lsjwq/p/7282220.html
標題:hexo博客MathJax公式渲染問題
博客地址:http://www.cnblogs.com/Ai-heng/p/7282110.html
標題:使用 Router 實現的模塊化,如何優雅的回到主頁面
博客地址:http://www.cnblogs.com/plokmju/p/7282113.html
標題:C#多線程之旅(7)——終止線程
博客地址:http://www.cnblogs.com/jackson0714/p/AbortThread.html
標題:Android異步處理技術
博客地址:http://www.cnblogs.com/hustzhb/p/7278113.html
標題:Docker Machine
博客地址:http://www.cnblogs.com/shoufengwei/p/7281873.html
標題:BFC(Block Formatting Context)基礎分析
博客地址:http://www.cnblogs.com/asheng2016/p/7281784.html
標題:python函數(6):內置函數和匿名函數
博客地址:http://www.cnblogs.com/liluning/p/7280832.html
標題:Spring-Framework 源碼閱讀之@Autowired和AutowiredAnnotationBeanPostProcessor
博客地址:http://www.cnblogs.com/liferecord/p/7281655.html
標題:你可能須要爲你的APP適配iOS11
博客地址:http://www.cnblogs.com/wetest/p/7281536.html
標題:Mybatis框架分析
博客地址:http://www.cnblogs.com/lizo/p/7281441.html
標題:細談字符串及其格式化表達式
博客地址:http://www.cnblogs.com/Jeffding/p/7277637.html
標題:[Android FrameWork 6.0源碼學習] View的重繪過程之Layout
博客地址:http://www.cnblogs.com/kezhuang/p/7280746.html
標題:Android Binder機制詳解:手寫IPC通訊
博客地址:http://www.cnblogs.com/DoNetCoder/p/7280481.html
標題:Socket 整理以及CocoaAsyncSocket、SRWebSocket源碼解析(一)
博客地址:http://www.cnblogs.com/taoxu/p/7064103.html
標題:【機器學習】反向傳播算法 BP
博客地址:http://www.cnblogs.com/KID-XiaoYuan/p/7273683.html
========================
a[href]:<a href="https://www.cnblogs.com/" title="開發者的網上家園"><img src="/images/logo_small.gif" alt="博客園Logo" width="142" height="55"></a>
a[href]:<a href="https://home.cnblogs.com/">園子</a>
a[href]:<a href="https://news.cnblogs.com">新聞</a>
a[href]:<a href="https://q.cnblogs.com/">博問</a>
a[href]:<a href="https://ing.cnblogs.com/">閃存</a>
a[href]:<a href="https://group.cnblogs.com/">小組</a>
a[href]:<a href="https://wz.cnblogs.com/">收藏</a>
a[href]:<a href="https://job.cnblogs.com/">招聘</a>
a[href]:<a href="http://kb.cnblogs.com/">知識庫</a>
a[href]:<a href="http://zzk.cnblogs.com/">找找看</a>
a[href]:<a href="/" class="current_nav">首頁</a>
a[href]:<a href="/pick/" title="編輯精選博文">精華</a>
a[href]:<a href="/candidate/" title="候選區的博文">候選</a>
a[href]:<a href="/news/" title="新聞頻道最新新聞">新聞</a>
a[href]:<a href="/following" title="我關注博客的最新博文">關注</a>
a[href]:<a href="/aggsite/mycommented" title="我評論過的博文">我評</a>
a[href]:<a href="/aggsite/mydigged" title="我推薦過的博文">我贊</a>
a[href]:<a id="posts_refresh" href="#" class="refresh" title="刷新博文列表" onclick="aggSite.loadCategoryPostList();return false">刷新</a>
a[href]:<a href="http://feed.cnblogs.com/blog/sitehome/rss"><img src="//common.cnblogs.com/images/icon_rss.gif" alt="點擊訂閱" style="position:relative;top:2px;" title="訂閱博客園文章"></a>
a[href]:<a href="http://www.cnblogs.com/liwanliangblog/p/7275462.html" id="editor_pick_lnk" target="_blank">【編輯推薦】細思極恐:被軟鏈接拯救的數據<span id="editor_pick_count"></span></a>
a[href]:<a href="/aggsite/headline" title="查看更多編輯推薦" class="right_more">»</a>
a[href]:<a href="http://www.cnblogs.com/sxs161028/p/7278219.html" target="_blank" title="閱讀1644, 評論39, 推薦28">[最多推薦]css簡單實現五角星評分、點贊收藏、展現評分(半顆星、1/3顆星)(39/1644)</a>
a[href]:<a href="/aggsite/topdigged24h" title="查看24小時推薦排行" class="right_more">»</a>
a[href]:<a href="http://www.cnblogs.com/mafly/p/allhttps.html" target="_blank" title="閱讀1587, 評論8, 推薦16">[最多評論]全站 HTTPS 沒你想象的那麼簡單(8/1587)</a>
a[href]:<a href="/aggsite/topcommented24h" title="查看24小時評論排行" class="right_more">»</a>
a[href]:<a href="http://news.cnblogs.com/n/575248/" target="_blank" title="閱讀803, 評論1, 推薦5">[新聞頭條]中國高鐵正式進入350千米時代!(1/803)</a>
a[href]:<a href="http://news.cnblogs.com/" title="查看更多新聞" class="right_more">»</a>
a[href]:<a href="http://news.cnblogs.com/n/575224/" target="_blank" title="閱讀1286, 評論11, 推薦7">[推薦新聞]Windows 10 Build 16257發佈:支持眼球交互操做(11/1286)</a>
a[href]:<a href="http://news.cnblogs.com/n/recommend" title="查看更多推薦新聞" class="right_more">»</a>
a[href]:<a class="titlelnk" href="http://www.cnblogs.com/CloudMan6/p/7280787.html" target="_blank">建立 overlay 網絡 - 天天5分鐘玩轉 Docker 容器技術(50)</a>
a[href]:<a href="http://www.cnblogs.com/CloudMan6/" target="_blank"><img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/775365/20160228114235.png" alt=""></a>
a[href]:<a href="http://www.cnblogs.com/CloudMan6/" class="lightblue">CloudMan</a>
a[href]:<a href="http://www.cnblogs.com/CloudMan6/p/7280787.html#commentform" title="" class="gray"> 評論(0)</a>
a[href]:<a href="http://www.cnblogs.com/CloudMan6/p/7280787.html" class="gray">閱讀(12)</a>
a[href]:<a class="titlelnk" href="http://www.cnblogs.com/woshimrf/p/js-cors.html" target="_blank">Javascript跨域後臺設置攔截</a>
a[href]:<a href="http://www.cnblogs.com/woshimrf/" target="_blank"><img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/686418/20170722174325.png" alt=""></a>
a[href]:<a href="http://www.cnblogs.com/woshimrf/" class="lightblue">Ryan.Miao</a>
a[href]:<a href="http://www.cnblogs.com/woshimrf/p/js-cors.html#commentform" title="" class="gray"> 評論(0)</a>
a[href]:<a href="http://www.cnblogs.com/woshimrf/p/js-cors.html" class="gray">閱讀(37)</a>
a[href]:<a class="titlelnk" href="http://www.cnblogs.com/5ishare/p/7282493.html" target="_blank">Jenkins發佈MVC應用程序</a>
a[href]:<a href="http://www.cnblogs.com/5ishare/" target="_blank"><img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/733213/20161113215823.png" alt=""></a>
a[href]:<a href="http://www.cnblogs.com/5ishare/" class="lightblue">社會主義接班人</a>
a[href]:<a href="http://www.cnblogs.com/5ishare/p/7282493.html#commentform" title="" class="gray"> 評論(0)</a>
a[href]:<a href="http://www.cnblogs.com/5ishare/p/7282493.html" class="gray">閱讀(58)</a>
a[href]:<a class="titlelnk" href="http://www.cnblogs.com/Johnzhang/p/7282584.html" target="_blank">一步一步學習Vue(十一)</a>
a[href]:<a href="http://www.cnblogs.com/Johnzhang/" target="_blank"><img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/418791/20130914174932.png" alt=""></a>
a[href]:<a href="http://www.cnblogs.com/Johnzhang/" class="lightblue">JerremyZhang</a>
a[href]:<a href="http://www.cnblogs.com/Johnzhang/p/7282584.html#commentform" title="" class="gray"> 評論(0)</a>
a[href]:<a href="http://www.cnblogs.com/Johnzhang/p/7282584.html" class="gray">閱讀(66)</a>
a[href]:<a class="titlelnk" href="http://www.cnblogs.com/lsjwq/p/7282220.html" target="_blank">關於「工業互聯網平臺」建設,俺有本要奏</a>
a[href]:<a href="http://www.cnblogs.com/lsjwq/" target="_blank"><img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/279374/20160822092145.png" alt=""></a>
a[href]:<a href="http://www.cnblogs.com/lsjwq/" class="lightblue">惟笑志在</a>
a[href]:<a href="http://www.cnblogs.com/lsjwq/p/7282220.html#commentform" title="" class="gray"> 評論(0)</a>
a[href]:<a href="http://www.cnblogs.com/lsjwq/p/7282220.html" class="gray">閱讀(106)</a>
a[href]:<a class="titlelnk" href="http://www.cnblogs.com/Ai-heng/p/7282110.html" target="_blank">hexo博客MathJax公式渲染問題</a>
a[href]:<a href="http://www.cnblogs.com/Ai-heng/" target="_blank"><img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/1102323/20170803222538.png" alt=""></a>
a[href]:<a href="http://www.cnblogs.com/Ai-heng/" class="lightblue">灬習慣丶悸沫</a>
a[href]:<a href="http://www.cnblogs.com/Ai-heng/p/7282110.html#commentform" title="" class="gray"> 評論(0)</a>
a[href]:<a href="http://www.cnblogs.com/Ai-heng/p/7282110.html" class="gray">閱讀(35)</a>
a[href]:<a class="titlelnk" href="http://www.cnblogs.com/plokmju/p/7282113.html" target="_blank">使用 Router 實現的模塊化,如何優雅的回到主頁面</a>
a[href]:<a href="http://www.cnblogs.com/plokmju/" target="_blank"><img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/u234895.jpg?id=28112946" alt=""></a>
a[href]:<a href="http://www.cnblogs.com/plokmju/" class="lightblue">承香墨影</a>
a[href]:<a href="http://www.cnblogs.com/plokmju/p/7282113.html#commentform" title="" class="gray"> 評論(0)</a>
a[href]:<a href="http://www.cnblogs.com/plokmju/p/7282113.html" class="gray">閱讀(64)</a>
a[href]:<a class="titlelnk" href="http://www.cnblogs.com/jackson0714/p/AbortThread.html" target="_blank">C#多線程之旅(7)——終止線程</a>
a[href]:<a href="http://www.cnblogs.com/jackson0714/" target="_blank"><img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/414640/20160117210953.png" alt=""></a>
a[href]:<a href="http://www.cnblogs.com/jackson0714/" class="lightblue">jackson0714</a>
a[href]:<a href="http://www.cnblogs.com/jackson0714/p/AbortThread.html#commentform" title="" class="gray"> 評論(0)</a>
a[href]:<a href="http://www.cnblogs.com/jackson0714/p/AbortThread.html" class="gray">閱讀(101)</a>
a[href]:<a class="titlelnk" href="http://www.cnblogs.com/hustzhb/p/7278113.html" target="_blank">Android異步處理技術</a>
a[href]:<a href="http://www.cnblogs.com/hustzhb/" target="_blank"><img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/803699/20160923095030.png" alt=""></a>
a[href]:<a href="http://www.cnblogs.com/hustzhb/" class="lightblue">行歌</a>
a[href]:<a href="http://www.cnblogs.com/hustzhb/p/7278113.html#commentform" title="2017-08-03 21:51" class="gray"> 評論(1)</a>
a[href]:<a href="http://www.cnblogs.com/hustzhb/p/7278113.html" class="gray">閱讀(42)</a>
a[href]:<a class="titlelnk" href="http://www.cnblogs.com/shoufengwei/p/7281873.html" target="_blank">Docker Machine</a>
a[href]:<a href="http://www.cnblogs.com/shoufengwei/" target="_blank"><img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/704456/20160908115235.png" alt=""></a>
a[href]:<a href="http://www.cnblogs.com/shoufengwei/" class="lightblue">shoufengwei</a>
a[href]:<a href="http://www.cnblogs.com/shoufengwei/p/7281873.html#commentform" title="" class="gray"> 評論(0)</a>
a[href]:<a href="http://www.cnblogs.com/shoufengwei/p/7281873.html" class="gray">閱讀(40)</a>
a[href]:<a class="titlelnk" href="http://www.cnblogs.com/asheng2016/p/7281784.html" target="_blank">BFC(Block Formatting Context)基礎分析</a>
a[href]:<a href="http://www.cnblogs.com/asheng2016/" target="_blank"><img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/881297/20160114203735.png" alt=""></a>
a[href]:<a href="http://www.cnblogs.com/asheng2016/" class="lightblue">肆無忌憚sheng</a>
a[href]:<a href="http://www.cnblogs.com/asheng2016/p/7281784.html#commentform" title="" class="gray"> 評論(0)</a>
a[href]:<a href="http://www.cnblogs.com/asheng2016/p/7281784.html" class="gray">閱讀(48)</a>
a[href]:<a class="titlelnk" href="http://www.cnblogs.com/liluning/p/7280832.html" target="_blank">python函數(6):內置函數和匿名函數</a>
a[href]:<a href="http://www.cnblogs.com/liluning/" target="_blank"><img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/1054024/20161031172341.png" alt=""></a>
a[href]:<a href="http://www.cnblogs.com/liluning/" class="lightblue">布吉島丶</a>
a[href]:<a href="http://www.cnblogs.com/liluning/p/7280832.html#commentform" title="" class="gray"> 評論(0)</a>
a[href]:<a href="http://www.cnblogs.com/liluning/p/7280832.html" class="gray">閱讀(51)</a>
a[href]:<a class="titlelnk" href="http://www.cnblogs.com/liferecord/p/7281655.html" target="_blank">Spring-Framework 源碼閱讀之@Autowired和AutowiredAnnotationBeanPostProcessor</a>
a[href]:<a href="http://www.cnblogs.com/liferecord/" target="_blank"><img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/672564/20150920190853.png" alt=""></a>
a[href]:<a href="http://www.cnblogs.com/liferecord/" class="lightblue">vOoT</a>
a[href]:<a href="http://www.cnblogs.com/liferecord/p/7281655.html#commentform" title="" class="gray"> 評論(0)</a>
a[href]:<a href="http://www.cnblogs.com/liferecord/p/7281655.html" class="gray">閱讀(56)</a>
a[href]:<a class="titlelnk" href="http://www.cnblogs.com/wetest/p/7281536.html" target="_blank">你可能須要爲你的APP適配iOS11</a>
a[href]:<a href="http://www.cnblogs.com/wetest/" target="_blank"><img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/992994/20170504141151.png" alt=""></a>
a[href]:<a href="http://www.cnblogs.com/wetest/" class="lightblue">騰訊WeTest</a>
a[href]:<a href="http://www.cnblogs.com/wetest/p/7281536.html#commentform" title="" class="gray"> 評論(0)</a>
a[href]:<a href="http://www.cnblogs.com/wetest/p/7281536.html" class="gray">閱讀(97)</a>
a[href]:<a class="titlelnk" href="http://www.cnblogs.com/lizo/p/7281441.html" target="_blank">Mybatis框架分析</a>
a[href]:<a href="http://www.cnblogs.com/lizo/" target="_blank"><img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/1055208/20170725183852.png" alt=""></a>
a[href]:<a href="http://www.cnblogs.com/lizo/" class="lightblue">atheva</a>
a[href]:<a href="http://www.cnblogs.com/lizo/p/7281441.html#commentform" title="2017-08-03 20:22" class="gray"> 評論(1)</a>
a[href]:<a href="http://www.cnblogs.com/lizo/p/7281441.html" class="gray">閱讀(166)</a>
a[href]:<a class="titlelnk" href="http://www.cnblogs.com/Jeffding/p/7277637.html" target="_blank">細談字符串及其格式化表達式</a>
a[href]:<a href="http://www.cnblogs.com/Jeffding/" target="_blank"><img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/1165242/20170630211508.png" alt=""></a>
a[href]:<a href="http://www.cnblogs.com/Jeffding/" class="lightblue">JeffD</a>
a[href]:<a href="http://www.cnblogs.com/Jeffding/p/7277637.html#commentform" title="" class="gray"> 評論(0)</a>
a[href]:<a href="http://www.cnblogs.com/Jeffding/p/7277637.html" class="gray">閱讀(73)</a>
a[href]:<a class="titlelnk" href="http://www.cnblogs.com/kezhuang/p/7280746.html" target="_blank">[Android FrameWork 6.0源碼學習] View的重繪過程之Layout</a>
a[href]:<a href="http://www.cnblogs.com/kezhuang/" class="lightblue">柯壯</a>
a[href]:<a href="http://www.cnblogs.com/kezhuang/p/7280746.html#commentform" title="" class="gray"> 評論(0)</a>
a[href]:<a href="http://www.cnblogs.com/kezhuang/p/7280746.html" class="gray">閱讀(94)</a>
a[href]:<a class="titlelnk" href="http://www.cnblogs.com/DoNetCoder/p/7280481.html" target="_blank">Android Binder機制詳解:手寫IPC通訊</a>
a[href]:<a href="http://www.cnblogs.com/DoNetCoder/" target="_blank"><img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/615210/20140321143008.png" alt=""></a>
a[href]:<a href="http://www.cnblogs.com/DoNetCoder/" class="lightblue">zhoumy</a>
a[href]:<a href="http://www.cnblogs.com/DoNetCoder/p/7280481.html#commentform" title="2017-08-03 17:03" class="gray"> 評論(1)</a>
a[href]:<a href="http://www.cnblogs.com/DoNetCoder/p/7280481.html" class="gray">閱讀(147)</a>
a[href]:<a class="titlelnk" href="http://www.cnblogs.com/taoxu/p/7064103.html" target="_blank">Socket 整理以及CocoaAsyncSocket、SRWebSocket源碼解析(一)</a>
a[href]:<a href="http://www.cnblogs.com/taoxu/" target="_blank"><img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/739006/20160530150513.png" alt=""></a>
a[href]:<a href="http://www.cnblogs.com/taoxu/" class="lightblue">Zxiao..旭</a>
a[href]:<a href="http://www.cnblogs.com/taoxu/p/7064103.html#commentform" title="" class="gray"> 評論(0)</a>
a[href]:<a href="http://www.cnblogs.com/taoxu/p/7064103.html" class="gray">閱讀(96)</a>
a[href]:<a class="titlelnk" href="http://www.cnblogs.com/KID-XiaoYuan/p/7273683.html" target="_blank">【機器學習】反向傳播算法 BP</a>
a[href]:<a href="http://www.cnblogs.com/KID-XiaoYuan/" target="_blank"><img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/1093385/20170109195643.png" alt=""></a>
a[href]:<a href="http://www.cnblogs.com/KID-XiaoYuan/" class="lightblue">KID_XiaoYuan</a>
a[href]:<a href="http://www.cnblogs.com/KID-XiaoYuan/p/7273683.html#commentform" title="2017-08-03 16:22" class="gray"> 評論(2)</a>
a[href]:<a href="http://www.cnblogs.com/KID-XiaoYuan/p/7273683.html" class="gray">閱讀(203)</a>
a[href]:<a href="/" class="p_1 current" onclick="aggSite.loadCategoryPostList(1,20);buildPaging(1);return false;">1</a>
a[href]:<a href="/sitehome/p/2" class="p_2 middle" onclick="aggSite.loadCategoryPostList(2,20);buildPaging(2);return false;">2</a>
a[href]:<a href="/sitehome/p/3" class="p_3 middle" onclick="aggSite.loadCategoryPostList(3,20);buildPaging(3);return false;">3</a>
a[href]:<a href="/sitehome/p/4" class="p_4 middle" onclick="aggSite.loadCategoryPostList(4,20);buildPaging(4);return false;">4</a>
a[href]:<a href="/sitehome/p/5" class="p_5 middle" onclick="aggSite.loadCategoryPostList(5,20);buildPaging(5);return false;">5</a>
a[href]:<a href="/sitehome/p/6" class="p_6 middle" onclick="aggSite.loadCategoryPostList(6,20);buildPaging(6);return false;">6</a>
a[href]:<a href="/sitehome/p/7" class="p_7 middle" onclick="aggSite.loadCategoryPostList(7,20);buildPaging(7);return false;">7</a>
a[href]:<a href="/sitehome/p/8" class="p_8 middle" onclick="aggSite.loadCategoryPostList(8,20);buildPaging(8);return false;">8</a>
a[href]:<a href="/sitehome/p/9" class="p_9 middle" onclick="aggSite.loadCategoryPostList(9,20);buildPaging(9);return false;">9</a>
a[href]:<a href="/sitehome/p/10" class="p_10 middle" onclick="aggSite.loadCategoryPostList(10,20);buildPaging(10);return false;">10</a>
a[href]:<a href="/sitehome/p/11" class="p_11 middle" onclick="aggSite.loadCategoryPostList(11,20);buildPaging(11);return false;">11</a>
a[href]:<a href="/sitehome/p/200" class="p_200 last" onclick="aggSite.loadCategoryPostList(200,20);buildPaging(200);return false;">200</a>
a[href]:<a href="/sitehome/p/2" onclick="aggSite.loadCategoryPostList(2,20);buildPaging(2);return false;">Next ></a>
a[href]:<a href="/cate/108698/">.NET技術(3)</a>
a[href]:<a href="/cate/2/">編程語言(1)</a>
a[href]:<a href="/cate/108701/">軟件設計(0)</a>
a[href]:<a href="/cate/108703/">Web前端(1)</a>
a[href]:<a href="/cate/108704/">企業信息化(1)</a>
a[href]:<a href="/cate/108705/">手機開發(1)</a>
a[href]:<a href="/cate/108709/">軟件工程(0)</a>
a[href]:<a href="/cate/108712/">數據庫技術(1)</a>
a[href]:<a href="/cate/108724/">操做系統(0)</a>
a[href]:<a href="/cate/4/">其餘分類(0)</a>
a[href]:<a href="/cate/all/">全部隨筆(159)</a>
a[href]:<a href="/comment/">全部評論(9)</a>
a[href]:<a href="http://space.cnblogs.com/forum/public" target="_blank">反饋或建議</a>
a[href]:<a href="http://www.cnblogs.com/cmt/" target="_blank">官方博客</a>
a[href]:<a href="/skins.aspx" target="_blank">博客模板</a>
a[href]:<a href="http://www.blogjava.net/">Java博客</a>
a[href]:<a href="http://www.cppblog.com/">C++博客</a>
a[href]:<a href="http://www.cnblogs.com/cmt/archive/2009/09/26/1574616.html">手機版</a>
a[href]:<a href="http://click.aliyun.com/m/994/" target="_blank"><img src="//common.cnblogs.com/images/friend_link/aliyun.png" alt="阿里雲"></a>
a[href]:<a href="http://www.hujiang.com" target="_blank">滬江網</a>
a[href]:<a href="http://www.chinaz.com/" target="_blank">站長之家</a>
a[href]:<a href="http://dev.yesky.com" target="_blank">天極網</a>
a[href]:<a href="http://down.admin5.com" target="_blank">A5源碼下載</a>
a[href]:<a href="http://www.gcpowertools.com.cn" target="_blank">葡萄城控件</a>
a[href]:<a href="https://www.upyun.com/?md=cnblogs" target="_blank">又拍雲存儲</a>
a[href]:<a href="http://zt.cnblogs.com/tingyun/" target="_blank">聽雲APP</a>
a[href]:<a href="http://www.rongcloud.cn/" target="_blank">融雲IM雲</a>
a[href]:<a href="http://www.hightopo.com/cn-index.html" target="_blank">圖撲軟件</a>
a[href]:<a href="https://yq.aliyun.com/" target="_blank">阿里雲棲社區</a>
a[href]:<a href="http://www.cnblogs.com/mipengine/" target="_blank">百度MIP博客</a>
a[href]:<a href="http://www.easemob.com" target="_blank">環信IM</a>
a[href]:<a href="http://wetest.qq.com/?from=links_cnblogs" target="_blank">騰訊WeTest</a>
a[href]:<a href="http://www.feige.ee?fm=cnblogs" target="_blank">飛鴿傳書短信平臺</a>
a[href]:<a href="https://www.mtyun.com/" target="_blank">美團雲</a>
a[href]:<a href="/AboutUS.aspx">關於博客園</a>
a[href]:<a href="/ContactUs.aspx">聯繫咱們</a>
a[href]:<a href="http://www.cnblogs.com/">博客園</a>
a[href]:<a href="http://www.miitbeian.gov.cn" target="_blank">滬ICP備09004260號</a>
a[href]:<a href="https://ss.knet.cn/verifyseal.dll?sn=e131108110100433392itm000000&ct=df&a=1&pa=0.25787803245785335" rel="nofollow" target="_blank"><img id="cnnic_img" src="//common.cnblogs.com/images/cnnic.png" alt="" width="64" height="23"></a>
a[href]:<a target="_blank" href="http://www.beian.gov.cn/portal/registerSystemInfo?recordcode=31011502001144" style="display:inline-block;text-decoration:none;height:20px;line-height:20px;"><img src="//common.cnblogs.com/images/ghs.png" alt=""><span style="float:left;height:20px;line-height:20px;margin: 0 5px 0 5px; color:#939393;">滬公網安備 31011502001144號</span></a>
========================
img[src$=.png]:<img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/775365/20160228114235.png" alt="">
img[src$=.png]:<img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/686418/20170722174325.png" alt="">
img[src$=.png]:<img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/733213/20161113215823.png" alt="">
img[src$=.png]:<img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/418791/20130914174932.png" alt="">
img[src$=.png]:<img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/279374/20160822092145.png" alt="">
img[src$=.png]:<img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/1102323/20170803222538.png" alt="">
img[src$=.png]:<img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/414640/20160117210953.png" alt="">
img[src$=.png]:<img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/803699/20160923095030.png" alt="">
img[src$=.png]:<img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/704456/20160908115235.png" alt="">
img[src$=.png]:<img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/881297/20160114203735.png" alt="">
img[src$=.png]:<img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/1054024/20161031172341.png" alt="">
img[src$=.png]:<img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/672564/20150920190853.png" alt="">
img[src$=.png]:<img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/992994/20170504141151.png" alt="">
img[src$=.png]:<img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/1055208/20170725183852.png" alt="">
img[src$=.png]:<img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/1165242/20170630211508.png" alt="">
img[src$=.png]:<img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/615210/20140321143008.png" alt="">
img[src$=.png]:<img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/739006/20160530150513.png" alt="">
img[src$=.png]:<img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/1093385/20170109195643.png" alt="">
img[src$=.png]:<img src="//common.cnblogs.com/images/friend_link/aliyun.png" alt="阿里雲">
img[src$=.png]:<img id="cnnic_img" src="//common.cnblogs.com/images/cnnic.png" alt="" width="64" height="23">
img[src$=.png]:<img src="//common.cnblogs.com/images/ghs.png" alt="">
========================
純文本:友情連接:滬江網站長之家天極網A5源碼下載葡萄城控件又拍雲存儲聽雲APP融雲IM雲圖撲軟件阿里雲棲社區百度MIP博客環信IM騰訊WeTest飛鴿傳書短信平臺美團雲
HTML:友情連接:
<a href="http://click.aliyun.com/m/994/" target="_blank"><img src="//common.cnblogs.com/images/friend_link/aliyun.png" alt="阿里雲"></a>
<a href="http://www.hujiang.com" target="_blank">滬江網</a>
<a href="http://www.chinaz.com/" target="_blank">站長之家</a>
<a href="http://dev.yesky.com" target="_blank">天極網</a>
<a href="http://down.admin5.com" target="_blank">A5源碼下載</a>
<a href="http://www.gcpowertools.com.cn" target="_blank">葡萄城控件</a>
<a href="https://www.upyun.com/?md=cnblogs" target="_blank">又拍雲存儲</a>
<a href="http://zt.cnblogs.com/tingyun/" target="_blank">聽雲APP</a>
<a href="http://www.rongcloud.cn/" target="_blank">融雲IM雲</a>
<a href="http://www.hightopo.com/cn-index.html" target="_blank">圖撲軟件</a>
<a href="https://yq.aliyun.com/" target="_blank">阿里雲棲社區</a>
<a href="http://www.cnblogs.com/mipengine/" target="_blank">百度MIP博客</a>
<a href="http://www.easemob.com" target="_blank">環信IM</a>
<a href="http://wetest.qq.com/?from=links_cnblogs" target="_blank">騰訊WeTest</a>
<a href="http://www.feige.ee?fm=cnblogs" target="_blank">飛鴿傳書短信平臺</a>
<a href="https://www.mtyun.com/" target="_blank">美團雲</a>

 

 

Reference:python

[1] Java1234_小鋒, Jsoup簡介, http://blog.java1234.com/blog/articles/255.html算法

[2] Java1234_小鋒, Jsoup HelloWorld實現, http://blog.java1234.com/blog/articles/256.html數據庫

[3] Java1234_小鋒, 使用Jsoup解析出html中的img元素, http://blog.java1234.com/blog/articles/50.htmlapache

[4] Java1234_小鋒, Jsoup查找DOM元素, http://blog.java1234.com/blog/articles/258.html編程

[5] Java1234_小鋒, Jsoup使用選擇器語法查找DOM元素, http://blog.java1234.com/blog/articles/260.html

[6] Java1234_小鋒, Jsoup獲取DOM元素屬性值, http://blog.java1234.com/blog/articles/262.html

相關文章
相關標籤/搜索