1.asci佔用一個字節gb2312是擴展spa
2.unicode 漢字佔用兩個字節code
3.utf-8 漢字佔三個字節utf-8
>>> name=u'中國' >>> name u'\u4e2d\u56fd' #unicode >>> print name 中國ci
>>> name.encode('utf-8') '\xe4\xb8\xad\xe5\x9b\xbd' #utf-8unicode