将矩阵划分为2x2个正方形子矩阵maxpooling fprop

for i in range(inputs.shape[0]): for j in range(inputs.shape[1]): for k in range(inputs.shape[2] // 2): for h in range(inputs.shape[3] // 2): outputs[i,j,k,h] = np.amax(np.hsplit(np.vsplit(inputs[i,j], inputs.shape[2] // 2)[k], inputs.shape[1] // 2)[h]) max_ind = np.argmax(np.hsplit(np.vsplit(inputs[i,j], inputs.shape[2] // 2)[k], inputs.shape[1] // 2)[h]) max_ind_y = max_ind // inputs.shape[2] if (max_ind_y == 0): max_ind_x = max_ind else: max_ind_x = max_ind % inputs.shape[3] self.mask[i,j,max_ind_y + 2 * k, max_ind_x + 2 * h] = outputs[i,j,k,h]

2条回答

网友

1楼 · 编辑于 2024-06-26 18:00:20

步骤1：获取max_ind_x，max_ind_y

我们需要得到每个块的max元素的行、列索引-

m,n = inputs.shape
a = inputs.reshape(m//2,2,n//2,2).swapaxes(1,2)
row, col = np.unravel_index(a.reshape(a.shape[:-2] + (4,)).argmax(-1), (2,2))

步骤2：使用argmax places从输入设置输出数组

然后，看看你的代码，你似乎在试图创建一个输出数组，其中的argmax个位置是用输入数组中的值设置的。因此，我们可以-

^{pr2}$

最后，我们可以得到输出的2D形状，这将是一个很好的验证步骤来验证原始输入inputs-

out2d = out.reshape(a.shape[:2]+(2,2)).swapaxes(1,2).reshape(m,n)

样本输入，输出-

In [291]: np.random.seed(0)
     ...: inputs = np.random.randint(11,99,(6,4))

In [292]: inputs
Out[292]: 
array([[55, 58, 75, 78],
       [78, 20, 94, 32],
       [47, 98, 81, 23],
       [69, 76, 50, 98],
       [57, 92, 48, 36],
       [88, 83, 20, 31]])

In [286]: out2d
Out[286]: 
array([[ 0,  0,  0,  0],
       [78,  0, 94,  0],
       [ 0, 98,  0,  0],
       [ 0,  0,  0, 98],
       [ 0, 92, 48,  0],
       [ 0,  0,  0,  0]])

网友

2楼 · 编辑于 2024-06-26 18:00:20

这在skimage.util中实现为^{}：

blocks = skimage.util.view_as_blocks(a,(2,2))
maxs = blocks.max((2,3))

相关问题更多 >

编程相关推荐

热门问题

热门文章