将列表中的每个元素与另一个列表中的两个元素进行比较，从而提高代码的效率

ca2=[(b>retention[0]>a) for b,a in zip(threshold[::1],threshold[1::1])] ca3=[(b>retention[1]>a) for b,a in zip(threshold[::1],threshold[1::1])] ca4=[(b>retention[2]>a) for b,a in zip(threshold[::1],threshold[1::1])] ca5=[(b>retention[3]>a) for b,a in zip(threshold[::1],threshold[1::1])] ca6=[(b>retention[4]>a) for b,a in zip(threshold[::1],threshold[1::1])]

3条回答

网友

1楼 · 编辑于 2024-09-29 02:29:00

不完全确定您想要实现什么，但是您可以使用bisect在阈值列表中进行二进制搜索，以找到刚好低于给定数字的阈值

retention = [0.19, 1, 0.57, 5, 0.09]
threshold = [0.123, 0.108, 0.102, 0.087]
threshold = [0] + sorted(threshold) # add 0 and sort
bins = {t: [] for t in threshold}
for r in retention:
    k = bisect.bisect(threshold, r) # actually, this is the next threshold
    bins[threshold[k-1]].append(r)  # thus k-1 here to get the lower one
# {0: [], 0.087: [0.09], 0.102: [], 0.108: [], 0.123: [0.19, 1, 0.57, 5]}

与另一个bisect答案（这会产生非常不同的输出）一样，对于retention中的k个元素，每个查询的复杂性是O（logn），n是阈值的数目，总共O（klogn）

网友

2楼 · 编辑于 2024-09-29 02:29:00

您可以生成一个字典，其中保留值作为键，阈值比较列表作为值。此外，如果将zip对象强制转换为列表，则不需要每次迭代都创建它

t = list(zip(threshold, threshold[1:]))
print({i: [(b > i > a) for b, a in t] for i in retention})

网友

3楼 · 编辑于 2024-09-29 02:29:00

要检查每个保留元素是否在阈值的两个元素之间，可以使用对分（即每次检查的日志（n）时间）

代码

from bisect import bisect_left

def binary_search(a, x): 
    """Index of where x would be inserted into a
       return None if x < min(a) or x > max(a)"""
    i = bisect_left(a, x)
    return i if i != len(a) and i > 0 else None

threshold = [0.123,0.108,0.102,0.087]
threshold_asc = threshold[::-1]
retention = [0.123, 0.19,1,0.57,5,0.09, 0.087]

for r in retention:
  print(f'{r} ->> {binary_search(threshold_asc, r)}')

输出

0.123 ->> 3
0.19 ->> None
1 ->> None
0.57 ->> None
5 ->> None
0.09 ->> 1
0.087 ->> None

复杂性

O(log(N)) for each check of retention. This is more efficient than walking the list of thresholds to find pairs of surrounding values which would be O(N).

相关问题更多 >

编程相关推荐

热门问题

热门文章