如何处理u前缀utf8字符串（需要解码，但失败时为“u”）？

function ajax2(options) { var querystring = options.params ? urllib.format({ query: options.params }) : '' if (options.loading) swalShowLoading(); reqwest({ url: options.url + querystring, contentType: 'application/json', method: options.method || 'GET', data: JSON.stringify(options.data) }) } // getFileList // because this is a `GET` method, so no data be JSON.stringify here // sorry for wrong explanation before. ajax2({ url: ApiUrl.list, params: { path: encodeURI(path) }, success:success, error:error })

2条回答

网友

1楼 · 编辑于 2024-05-18 14:51:15

unicode字符串可以编码到字节str。
str可以被解码为unicode字符串。在

当您有一个u''字符串文本时，或当您import unicode_literals时，所有的字符串文本都将是unicode字符串。你只能对那些代码进行编码，而不是decode。当您尝试decode一个已经解码的unicode字符串时，您得到的错误源于隐式转换。在

>>> p1.encode('utf-8')
'E:\\filemanager\\data\\c - \xe5\x89\xaf\xe6\x9c\xac'

\x35…表示字符串的原始字节（str）。在

^{pr2}$
这是一个unicode文本，字面意思是“\xe5…”。
当有原始字节表示时，需要确保Python将其视为str，而不是unicode：
>>> p2 = b'E:\\filemanager\\data\\c - \xe5\x89\xaf\xe6\x9c\xac' >>> p2.decode('utf-8') u'E:\\filemanager\\data\\c - \u526f\u672c'
前缀b将文本标记为str，可以将其解码为unicode。在
u''，''与unicode_literals和{}是unicode→encode到{}
b''和{}是str→decode到{}

网友
2楼 · 编辑于 2024-05-18 14:51:15

What I want is convert
u'E:\filemanager\data\c - \xe5\x89\xaf\xe6\x9c\xac' to
either u'E:\filemanager\data\c - \u526f\u672c' or 'E:\filemanager\data\c - \xe5\x89\xaf\xe6\x9c\xac'
\xe5\x89\xaf\xe6\x9c\xac can not be decode due to the u prefix, , that is the key problem!
无法解码Unicode字符串。关键问题是UTF-8编码的字节字符串在一开始就被错误地解码了。在
下面是如何逆转它，但你真正应该解决的是为什么一开始就错了。在
latin1是将前256个Unicode码位直接转换为字节的编解码器：
>>> s = u'E:\\filemanager\\data\\c - \xe5\x89\xaf\xe6\x9c\xac' >>> s.encode('latin1') 'E:\\filemanager\\data\\c - \xe5\x89\xaf\xe6\x9c\xac'
所以“摆脱美国”。现在有一个可以用UTF-8解码的字节字符串：
^{pr2}$

相关问题更多 >

编程相关推荐

热门问题

热门文章