从文件读取十六进制（Python）

2条回答

网友

1楼 · 编辑于 2024-09-29 03:39:21

编辑：请使用马蒂金的解决方案。我还不知道text.decode('string_escape')，当然它要快得多。以下是我最初的答案。在

使用此正则表达式可以取消字符串中所有转义的十六进制表达式：

def unescape(text):
    return re.sub(r'\\\\|\\x([0-9a-fA-F]{2})',
        lambda m: chr(int(m.group(1), 16)) if m.group(1)
                  else '\\', text)

如果您知道输入将不包含后跟x的双反斜杠（例如foo bar \\x41 bloh，可能应该解释为foo bar \x41 bloh，而不是foo bar \A bloh），那么您可以将其简化为：

^{pr2}$

网友

2楼 · 编辑于 2024-09-29 03:39:21

有带有\xhh十六进制转义符的字符串文本。您可以使用string_escape编码对其进行解码：

text.decode('string_escape')

请参阅codecs模块文档的Python Specific Encodings section：

string_escape
Produce a string that is suitable as string literal in Python source code

解码会反转编码：

^{pr2}$

作为一个内置的编解码器，这比使用正则表达式要快得多：

>>> from timeit import timeit
>>> import re
>>> def unescape(text):
...     return re.sub(r'\\x([0-9a-fA-F]{2})',
...         lambda m: chr(int(m.group(1), 16)), text)
...
>>> value = "\\x69\\x73\\x41\\x72\\x72\\x61\\x79"
>>> timeit('unescape(value)', 'from __main__ import unescape, value')
6.254786968231201
>>> timeit('value.decode("string_escape")', 'from __main__ import value')
0.43862390518188477

速度快了14倍。在

相关问题更多 >

编程相关推荐

热门问题

热门文章

从文件读取十六进制（Python）

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >