将字符串“aabbcc”>[“aa”，“bb”，“cc”]拆分为回复spli

网友

1楼 · 编辑于 2024-09-30 18:24:37

通读这些注释，我们发现真正的问题是：解析十六进制RRGGBBAA格式的颜色定义字符串的最快方法是什么。以下是一些选项：

def rgba1(s, unpack=struct.unpack):
    return unpack("BBBB", s.decode("hex"))

def rgba2(s, int=int, xrange=xrange):
    return [int(s[i:i+2], 16) for i in xrange(0, 8, 2)]

def rgba3(s, int=int, xrange=xrange):
    x = int(s, 16)
    return [(x >> i) & 255 for i in xrange(0, 32, 8)]

正如我所料，第一个版本是最快的：

^{pr2}$

网友

2楼 · 编辑于 2024-09-30 18:24:37

In [4]: ["".join(pair) for pair in zip(* 2 * [iter(s)])]
Out[4]: ['aa', 'bb', 'cc']

请参见：How does zip(*[iter(s)]*n) work in Python?以了解对相同str语法的奇怪“2-iter”的解释。在

你在评论中说你想“拥有最快的执行速度”，我不能向你保证这个实现，但是你可以使用^{}来测量的执行。当然记得what Donald Knuth said about premature optimisation。对于手头的问题（现在你已经揭示了它），我想你会发现{}很难克服。在

^{pr2}$

比较

python3.2 -m timeit -c '
s = "aabbcc"
r, g, b = s[0:2], s[2:4], s[4:6]
'
1000000 loops, best of 3: 1.2 usec per loop

网友

3楼 · 编辑于 2024-09-30 18:24:37

Numpy比单个查找的首选解决方案差：

$ python -m timeit -s 'import numpy as np; s="aabbccdd"' 'a = np.fromstring(s.decode("hex"), dtype="uint32"); a.dtype = "uint8"; list(a)'
100000 loops, best of 3: 5.14 usec per loop
$ python -m timeit -s 's="aabbcc";' '[int(s[i:i+2], 16) / 255. for i in xrange(0, len(s), 2)]'
100000 loops, best of 3: 2.41 usec per loop

但是如果你一次做几个转换，numpy要快得多：

^{pr2}$

在我的电脑上，Numpy对于大于2的批处理程序更快。您可以通过将a.shape设置为(number_of_colors, 4)来轻松地对值进行分组，尽管这会使tolist方法慢50%。在

实际上，大部分时间都花在将数组转换为列表上。根据您希望如何处理结果，您可以跳过这一步，并获得一些好处：

$ python -m timeit -s 'import numpy as np; s="aabbccdd" * 100' 'a = np.fromstring(s.decode("hex"), dtype="uint32"); a.dtype = "uint8"; a.shape = (100,4)'
100000 loops, best of 3: 6.76 usec per loop

相关问题更多 >

编程相关推荐

热门问题

热门文章