首先須要說的是python3的默認編碼是Unicode,在pycharm中只是用utf-8去解釋,實際的編碼仍是Unicodepython
a = "某gbk編碼格式"windows
a.decode("gbk").encode("utf-8")網絡
#先用gbk方式解碼成unicode,再轉換成utf-8函數
1
2
3
4
|
>>> '€20'.encode('utf-8')
b'\xe2\x82\xac20'
>>> b'\xe2\x82\xac20'.decode('utf-8')
'€20'
|
1
2
3
4
|
>>> '€20'.encode('iso-8859-15')
b'\xa420'
>>> b'\xa420'.decode('iso-8859-15')
'€20'
|
1
2
|
>>> b'\xa420'.decode('windows-1255')
'₪20'
|