使用超大numpy阵列的效率

#samplez is a 3 million element 1-D array #zfit is a 10,000 x 500 2-D array b = np.arange((len(zfit)) for x in samplez: a = x-zfit mask = np.ma.masked_array(a) mask[a <= 0] = np.ma.masked index = mask.argmin(axis=1) # These past 4 lines give me an index array of the smallest positive number # in x - zift d = zfit[b,index] e = zfit[b,index+1] f = (x-d)/(e-d) # f is the calculation I am after if x == samplez[0]: g = f index_stack = index else: g = np.vstack((g,f)) index_stack = np.vstack((index_stack,index))

1条回答

网友

1楼 · 发布于 2024-09-30 01:33:36

最好知道最小的正数永远不会出现在行的末尾。在

在samplez中有100万个唯一值，但在zfit中，每行最多只能有500个唯一值。整个zfit可以有多达5000万个唯一值。如果能大大减少“求最小正数”>；“采样中每个元素”的计算次数，则可以大大加快算法的速度。做所有的5e13比较可能是一个过度的杀戮和仔细的计划将能够消除很大一部分。这很大程度上取决于你的实际基础数学。在

在不知道的情况下，还有一些小事可以做。1，没有太多可能的(e-d)，因此可以从循环中取出。2，循环可以通过map消除。这两个小修正，在我的机器上，结果是大约22%的速度。在

def function_map(samplez, zfit):
    diff=zfit[:,:-1]-zfit[:,1:]
    def _fuc1(x):
        a = x-zfit
        mask = np.ma.masked_array(a)
        mask[a <= 0] = np.ma.masked
        index = mask.argmin(axis=1)
        d = zfit[:,index]
        f = (x-d)/diff[:,index] #constrain: smallest value never at the very end.
        return (index, f)
    result=map(_fuc1, samplez)
    return (np.array([item[1] for item in result]),
           np.array([item[0] for item in result]))

接下来：masked_array可以完全避免（这应该会带来显著的改进）。samplez也需要排序。在

^{pr2}$

所以，这是另一个50%的加速。在

避免了masked_array，这节省了一些RAM。想不出其他方法来减少RAM的使用。可能需要分部分处理samplez。而且，依赖于数据和所需的精度，如果您可以使用float16或{}来代替可以节省大量RAM的默认float64。在

相关问题更多 >

编程相关推荐

热门问题

热门文章