Pythorch中神经网络的前向雅可比矩阵

class Network(torch.nn.Module): def __init__(self): super(Network, self).__init__() self.h_1_1 = torch.nn.Linear(input_1, hidden_1) self.h_1_2 = torch.nn.Linear(hidden_1, hidden_2) self.out = torch.nn.Linear(hidden_2, out_1) def forward(self, x): x = F.tanh(self.h_1_1(x)) x = F.tanh(self.h_1_2(x)) x = (self.out(x)) return x def jacobian(self, x): a = self.h_1_1.weight x = F.tanh(self.h_1_1(x)) tanh_deriv_tensor = 1 - (x ** 2) expanded_deriv = tanh_deriv_tensor.unsqueeze(-1).expand(-1, -1, input_1) partials = expanded_deriv * a.expand_as(expanded_deriv) a = torch.matmul(self.h_1_2.weight, partials) x = F.tanh(self.h_1_2(x)) tanh_deriv_tensor = 1 - (x ** 2) expanded_deriv = tanh_deriv_tensor.unsqueeze(-1).expand(-1, -1, out_1) partials = expanded_deriv*a partials = torch.matmul(self.out.weight, partials) determinant = partials[:, 0, 0] * partials[:, 1, 1] - partials[:, 0, 1] * partials[:, 1, 0] return determinant

1条回答

网友

1楼 · 发布于 2024-09-26 18:06:01

第二次计算'a'在我的机器（cpu）上花费的时间最多。在

# Here you increase the size of the matrix with a factor of "input_1"
expanded_deriv = tanh_deriv_tensor.unsqueeze(-1).expand(-1, -1, input_1)
partials = expanded_deriv * a.expand_as(expanded_deriv)

# Here your torch.matmul() needs to handle "input_1" times more computations than in a normal forward call
a = torch.matmul(self.h_1_2.weight, partials)

在我的机器上，计算雅可比的时间大概就是火炬计算所需的时间

^{pr2}$

我认为不可能在计算上加快速度。除非你能从一个CPU移到另一个GPU，因为GPU在大矩阵上表现得更好。在

相关问题更多 >

编程相关推荐

热门问题

热门文章