1 # -*- coding:gbk -*- 2 # 即使設定檔案編碼為gbk,下方定義的字串s1依舊為unicode 3 4 # 獲取默認編碼格式 5 import sys 6 print(sys.getdefaultencoding()) 7 # >>> utf-8 8 9 # 編碼轉換 10 # --------------python 2---------------- 11 # utf-8 --> decode --> unicode 12 # unicode --> encode --> gbk 13 14 s = "你好" 15 # 無decode方法,由于python3默認unicode, utf-8為原本的字符集,傳給decode識別 16 s_unicode = s.decode("utf-8") 17 # gbk為目標字符集,指定給encode進行轉換 18 s_gbk = s_unicode.encode("gbk") 19 20 # 示例 21 s1 = "大家好" 22 s1_to_utf8 = s1.encode("utf-8") 23 print(s1_to_utf8) # >>> b'\xe5\xa4\xa7\xe5\xae\xb6\xe5\xa5\xbd' 24 s1_to_gbk = s1_to_utf8.decode("utf-8").encode("gbk") 25 s1_to_gb2312 = s1_to_utf8.decode("utf-8").encode("gb2312") 26 s1_to_gb2312_chinese = s1_to_gb2312.decode("gb2312") 27 # decode轉換為unicode后可以輸出中文,所以證明默認為unicode 28 print(s1_to_gb2312_chinese) # >>> 大家好 29 print(s1_to_gb2312) # >>> b'\xb4\xf3\xbc\xd2\xba\xc3' 30 print(s1_to_gbk) # >>> b'\xb4\xf3\xbc\xd2\xba\xc3' 31 print(s1) # >>> 大家好
轉載請註明出處,本文鏈接:https://www.uj5u.com/houduan/143934.html
標籤:Python
上一篇:Python--函式&程序
下一篇:Python--字典(三級選單)
