WebCollector 2.x官網和鏡像:java
官網:https://github.com/CrawlScript/WebCollectorgit
鏡像:http://git.oschina.net/webcollector/WebCollectorgithub
WebCollector 2.x教程:web
WebCollector 2.x tutorial 2 (BreadthCrawler中文教程)ajax
WebCollector 2.x 新聞網頁正文自動提取算法算法
WebCollector 2.x 抽取器 (Extractor和MultiExtractorCrawler)cookie
WebCollector爬取JS生成數據spa
WebCollector爬取搜狗搜索(分頁).net
WebCollector爬取JSON數據orm
使用SoupLang腳本同時管理多個頁面爬取 SoupLang腳本
用WebCollector 2.x爬取新浪微博(無需手動獲取cookie)
WebCollector 2.x教程(鏡像):
WebCollector 2.x tutorial 2 (BreadthCrawler中文教程)
WebCollector 2.x 新聞網頁正文自動提取算法
WebCollector 2.x 抽取器 (Extractor和MultiExtractorCrawler)
WebCollector爬取JS生成數據
WebCollector爬取搜狗搜索(分頁)
WebCollector爬取JSON數據