基于一列中的公共值从两个或多个2d numpy数组创建交集

a = [[1, 5.41], [2, 5.42], [3, 12.32], dtype=[('position', '<i4'), ('score', '<f4')]) ] b = [[3, 8.41], [6, 7.42], [4, 6.32], dtype=[('position', '<i4'), ('score', '<f4')]) ] c = [[3, 7.41], [7, 6.42], [1, 5.32], dtype=[('position', '<i4'), ('score', '<f4')]) ]

1条回答

网友

1楼 · 发布于 2024-09-29 21:43:09

这里有一种方法，我相信它应该相当快。我想你要做的第一件事就是计算每个位置的出现次数。此函数将处理：

def count_positions(positions):
    positions = np.sort(positions)
    diff = np.ones(len(positions), 'bool')
    diff[:-1] = positions[1:] != positions[:-1]
    count = diff.nonzero()[0]
    count[1:] = count[1:] - count[:-1]
    count[0] += 1
    uniqPositions = positions[diff]
    return uniqPositions, count

现在使用上面的函数形式，您只需要选择出现3次的位置：

^{pr2}$

我们将使用搜索排序，以便对a b和c进行排序：

a.sort(order='position')
b.sort(order='position')
c.sort(order='position')

现在，我们可以通过用户搜索排序来查找每个数组中的位置，从而找到我们的每个uniqpo：

new_array = np.empty((len(uinqPos), 4))
new_array[:, 0] = uinqPos
index = a['position'].searchsorted(uinqPos)
new_array[:, 1] = a['score'][index]
index = b['position'].searchsorted(uinqPos)
new_array[:, 2] = b['score'][index]
index = c['position'].searchsorted(uinqPos)
new_array[:, 3] = c['score'][index]

使用字典可能有一个更优雅的解决方案，但我首先想到了这个，所以我将把它留给其他人。在

相关问题更多 >

编程相关推荐

热门问题

热门文章

基于一列中的公共值从两个或多个2d numpy数组创建交集

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >