【python】列出http://www.cnblogs.com/xiandedanteng/p/中的標題

時間 2019-11-20

標籤 python 列出 http www.cnblogs.com www cnblogs com xiandedanteng 標題欄目 Python 简体版

原文原文鏈接

# 列出http://www.cnblogs.com/xiandedanteng/p/中的標題
from bs4 import BeautifulSoup
import requests

user_agent='Mozilla/4.0 (compatible;MEIE 5.5;windows NT)'
headers={'User-Agent':user_agent}
html=requests.get('http://www.cnblogs.com/xiandedanteng/p/',headers=headers)
#print(html.text);
soup= BeautifulSoup(html.text,'html.parser',from_encoding='utf-8')

for titleDiv in soup.find_all(class_="postTitl2"):
    link=titleDiv.find('a')
    print(link.string)

輸出：html

C:\Users\horn1\Desktop\python\4>python titles.py
C:\Users\horn1\AppData\Local\Programs\Python\Python36\lib\site-packages\bs4\__init__.py:146: UserWarning: You provided Unicode markup but also provided a value for from_encoding. Your from_encoding will be ignored.
  warnings.warn("You provided Unicode markup but also provided a value for from_encoding. Your from_encoding will be ignored.")
如何安裝BeautifulSoup4
在過去的二十多年的時間裏
linux CentOS6.5 yum安裝mysql 5.6（轉載&刪改）
Error: Cannot find module 'express'  之  解決方案
使用Nodejs的Nodemailer經過163信箱發送郵件例程
Nodejs 天涯帖子《鹿鼎記中計》 柳成萌著 下載爬蟲
使用js的indexOf,lastIndexOf,slice三函數輕易獲得url的服務器，路徑和頁名
27270圖片批量下載爬蟲1.00
轉帖：心裏如果篤定，何懼未知風雨
求邊長爲一的正方體中，面對角線組成的正四面體體積.