utf8转换为utf16

data="index=索引?" print(data.encode('UTF-16LE')) def convert(s): returnCode=[] temp='' for n in s.encode('utf-16be'): if temp=='': if str.replace(hex(n),'0x','')=='0': temp='00' continue temp+=str.replace(hex(n),'0x','') else: returnCode.append(temp+str.replace(hex(n),'0x','')) temp='' return returnCode print(convert(data))

2条回答

网友

1楼 · 编辑于 2024-10-06 18:18:26

我不确定我是否理解你。

Unicode就像一种类型。在python 3中，所有字符串都是unicode，所以当您编写data = "index=索引?"时，数据已经是unicode了。如果只想获得用于显示的替代表示，可以使用：

def display_unicode(data):
    return "".join(["\\u%s" % hex(ord(l))[2:].zfill(4) for l in data])

>>> data = "index=索引?"
>>> print(display_unicode(data))
\u0069\u006e\u0064\u0065\u0078\u003d\u7d22\u5f15\u003f

请注意，字符串现在有真正的反斜杠和数字表示，而不是unicode字符。

但可能还有其他选择

>>> data.encode('ascii', 'backslashreplace')
b'index=\\u7d22\\u5f15?'
>>> data.encode('unicode_escape')
b'index=\\u7d22\\u5f15?'

网友

2楼 · 编辑于 2024-10-06 18:18:26

尝试先解码，比如：s.decode('utf-8').encode('utf-16be')？

相关问题更多 >

编程相关推荐

热门问题

热门文章