解決問題:UnicodeDecodeError utf-8 codec cant decode byte 0xb5 in position 116:

時間 2020-07-14

標籤解決問題 unicodedecodeerror utf codec decode byte 0xb5 position 欄目字符編碼简体版

原文原文鏈接

爬取的中文編碼格式不是UTF-8,沒法正常顯示，查看編碼格式：html 編碼格式爲ISO-8859-1（長見識啦~）在使用urllib獲取reqest的response的時候，還要進行解碼。編碼解決方法：url txt.decode('utf8', 'ignore') 報錯是沒有了　　可是抓取的漢字　仍是亂碼code 解決辦法來了:htm ＃文字亂碼 req.encoding = 'GB2

>>阅读原文<<