Python/Numpy在一个数组中快速找到最接近某个值的索引

import numpy as np import timeit t = np.arange(10,100000) # Not always uniform, but in increasing order x = np.random.uniform(10,100000) # Some value to find within t def f1(t, x): ind = np.searchsorted(t, x) # Get index to preserve order ind = min(len(t)-1, ind) # In case x > max(t) ind = max(1, ind) # In case x < min(t) if x < (t[ind-1] + t[ind]) / 2.0: # Closer to the smaller number ind = ind-1 return ind def f2(t, x): return np.abs(t-x).argmin() print t, '\n', x, '\n' print f1(t, x), '\n', f2(t, x), '\n' print t[f1(t, x)], '\n', t[f2(t, x)], '\n' runs = 1000 time = timeit.Timer('f1(t, x)', 'from __main__ import f1, t, x') print round(time.timeit(runs), 6) time = timeit.Timer('f2(t, x)', 'from __main__ import f2, t, x') print round(time.timeit(runs), 6)

3条回答

网友

1楼 · 编辑于 2024-05-18 11:16:45

np.searchsorted是二进制搜索（每次将数组分成两半）。因此，必须以返回最后一个小于x的值而不是返回0的方式实现它。

看看这个算法（从here）：

def binary_search(a, x):
    lo=0
    hi = len(a)
    while lo < hi:
        mid = (lo+hi)//2
        midval = a[mid]
        if midval < x:
            lo = mid+1
        elif midval > x: 
            hi = mid
        else:
            return mid
    return lo-1 if lo > 0 else 0

刚刚替换了最后一行（wasreturn -1）。也改变了论点。

由于循环是用Python编写的，因此它可能比第一个循环慢。。。（未设定基准）

网友

2楼 · 编辑于 2024-05-18 11:16:45

这似乎要快得多（对我来说，Python 3.2-win32，numpy 1.6.0）：

from bisect import bisect_left
def f3(t, x):
    i = bisect_left(t, x)
    if t[i] - x > 0.5:
        i-=1
    return i

输出：

[   10    11    12 ..., 99997 99998 99999]
37854.22200356027
37844
37844
37844
37854
37854
37854
f1 0.332725
f2 1.387974
f3 0.085864

网友

3楼 · 编辑于 2024-05-18 11:16:45

使用搜索排序：

t = np.arange(10,100000)         # Not always uniform, but in increasing order
x = np.random.uniform(10,100000)

print t.searchsorted(x)

编辑：

啊，是的，我知道你在f1就是这么做的。也许下面的f3比f1更容易阅读。

def f3(t, x):
    ind = t.searchsorted(x)
    if ind == len(t):
        return ind - 1 # x > max(t)
    elif ind == 0:
        return 0
    before = ind-1
    if x-t[before] < t[ind]-x:
        ind -= 1
    return ind

相关问题更多 >

编程相关推荐

热门问题

热门文章