Python中BeautifulSoup中對HTML標籤的提取

時間 2019-11-12

標籤 python beautifulsoup html 標籤提取欄目 Python 简体版

原文原文鏈接

一開始使用了beautifulSoup的get_text()進行字符串的提取，後來一直提取失敗，並提示錯誤爲TypeError: 'NoneType' object is not callablehtml

返回了none類型，多是對Span標籤內容的提取產生錯誤，因而採用name.string進行字符的提取，成功。python

# -*- coding: utf-8 -*-
"""
Created on Wed Jan 11 17:21:54 2017

@author: PE-Monitor
"""
import urllib2
import BeautifulSoup
import sys

reload(sys)
sys.setdefaultencoding('utf-8')
responce = urllib2.urlopen("http://www.pythonscraping.com/pages/warandpeace.html")
html =BeautifulSoup.BeautifulSoup(responce)
nameList=html.findAll('span',{'class':{'green'}})
for name in nameList:
     print(name.string)

相關標籤/搜索