# 列出http://www.cnblogs.com/xiandedanteng/p/中的標題 from bs4 import BeautifulSoup import requests user_agent='Mozilla/4.0 (compatible;MEIE 5.5;windows NT)' headers={'User-Agent':user_agent} html=requests.get('http://www.cnblogs.com/xiandedanteng/p/',headers=headers) #print(html.text); soup= BeautifulSoup(html.text,'html.parser',from_encoding='utf-8') for titleDiv in soup.find_all(class_="postTitl2"): link=titleDiv.find('a') print(link.string)
輸出:html
C:\Users\horn1\Desktop\python\4>python titles.py C:\Users\horn1\AppData\Local\Programs\Python\Python36\lib\site-packages\bs4\__init__.py:146: UserWarning: You provided Unicode markup but also provided a value for from_encoding. Your from_encoding will be ignored. warnings.warn("You provided Unicode markup but also provided a value for from_encoding. Your from_encoding will be ignored.") 如何安裝BeautifulSoup4 在過去的二十多年的時間裏 linux CentOS6.5 yum安裝mysql 5.6(轉載&刪改) Error: Cannot find module 'express' 之 解決方案 使用Nodejs的Nodemailer經過163信箱發送郵件例程 Nodejs 天涯帖子《鹿鼎記中計》 柳成萌著 下載爬蟲 使用js的indexOf,lastIndexOf,slice三函數輕易獲得url的服務器,路徑和頁名 27270圖片批量下載爬蟲1.00 轉帖:心裏如果篤定,何懼未知風雨 求邊長爲一的正方體中,面對角線組成的正四面體體積.
基本達到要求,萬里長征又邁出了一小步python