Numpy：如何快速替换矩阵中的相等值？

import numpy as np a = np.array([[0,0,1],[0,2,1],[2,2,1]]) m = a.max() #"naive" in O(n*m) i,j = np.mgrid[range(a.shape[0]), range(a.shape[1])] index_i = [[] for _ in range(m+1)] index_j = [[] for _ in range(m+1)] for k in range(m+1): index_i[k] = i[a==k] index_j[k] = j[a==k] #all the zeros: print(a[index_i[0], index_j[0]]) #all the ones: print(a[index_i[1], index_j[1]]) #all the twos: print(a[index_i[2], index_j[2]]) #"sophisticated" in O(n) index_i = [[] for _ in range(m+1)] index_j = [[] for _ in range(m+1)] for i in range(a.shape[0]): for j in range(a.shape[1]): index_i[a[i,j]].append(i) index_j[a[i,j]].append(j) #all the zeros: print(a[index_i[0], index_j[0]]) #all the ones: print(a[index_i[1], index_j[1]]) #all the twos: print(a[index_i[2], index_j[2]])

1条回答

网友

1楼 · 发布于 2024-06-25 23:22:39

这是一个基于sorting的字典，其目的是在迭代以保存为字典时进行最少的工作，其中键是唯一的元素，值是索引-

shp = a.shape
idx = a.ravel().argsort()
idx_sorted = np.c_[np.unravel_index(idx,shp)]
count = np.bincount(a.ravel())
valid_idx = np.flatnonzero(count!=0)
cs = np.r_[0,count[valid_idx].cumsum()]
out = {e:idx_sorted[i:j] for (e,i,j) in zip(valid_idx,cs[:-1],cs[1:])}

样本输入，输出-

In [155]: a
Out[155]: 
array([[0, 2, 6],
       [0, 2, 6],
       [2, 2, 1]])

In [156]: out
Out[156]: 
{0: array([[0, 0],
        [1, 0]]), 1: array([[2, 2]]), 2: array([[0, 1],
        [1, 1],
        [2, 0],
        [2, 1]]), 6: array([[0, 2],
        [1, 2]])}

如果序列中的所有整数都包含在数组中，我们可以将其简化一点-

shp = a.shape
idx = a.ravel().argsort()
idx_sorted = np.c_[np.unravel_index(idx,shp)]
cs = np.r_[0,np.bincount(a.ravel()).cumsum()]
out = {iterID:idx_sorted[i:j] for iterID,(i,j) in enumerate(zip(cs[:-1],cs[1:]))}

相关问题更多 >

编程相关推荐

热门问题

热门文章