用于异常检测的LSTM自动编码器

class LSTM_Detector(Model): def __init__(self, flight_len, param_len, hidden_state=16): super(LSTM_Detector, self).__init__() self.input_dim = (flight_len, param_len) self.units = hidden_state self.encoder = layers.LSTM(self.units, return_state=True, return_sequences=True, activation="tanh", name='encoder', input_shape=self.input_dim) self.decoder = layers.LSTM(self.units, return_sequences=True, activation="tanh", name="decoder", input_shape=(self.input_dim[0],self.units)) self.dense = layers.TimeDistributed(layers.Dense(self.input_dim[1])) def call(self, x): output, hs, cs = self.encoder(x) encoded_state = [hs, cs] # see https://www.tensorflow.org/guide/keras/rnn decoded = self.decoder(output, initial_state=encoded_state) output_decoder = self.dense(decoded) return output_decoder

class Seq2Seq_Detector(Model): def __init__(self, flight_len, param_len, hidden_state=16): super(Seq2Seq_Detector, self).__init__() self.input_dim = (flight_len, param_len) self.units = hidden_state self.encoder = layers.LSTM(self.units, return_state=True, return_sequences=False, activation="tanh", name='encoder', input_shape=self.input_dim) self.repeat = layers.RepeatVector(self.input_dim[0]) self.decoder = layers.LSTM(self.units, return_sequences=True, activation="tanh", name="decoder", input_shape=(self.input_dim[0],self.units)) self.dense = layers.TimeDistributed(layers.Dense(self.input_dim[1])) def call(self, x): output, hs, cs = self.encoder(x) encoded_state = [hs, cs] # see https://www.tensorflow.org/guide/keras/rnn repeated_vec = self.repeat(output) decoded = self.decoder(repeated_vec, initial_state=encoded_state) output_decoder = self.dense(decoded) return output_decoder

1条回答

网友

1楼 · 发布于 2024-05-08 07:33:42

在模型1中，77个特征的每个点都以以下方式进行压缩和解压缩：77->；16->；16->；77以及前面步骤中的一些信息。在这种情况下，似乎用just TimeDistributed（density（…）替换LSTMs也可以，但不能肯定，因为我不知道数据。第三个图像可能会变得更好

当输入中没有有用的信号时，通常会发生预测模型2的情况，并且模型能做的最好的事情（好的，优化做）就是预测训练集的平均目标值

在模型2中，您有：

...
    self.encoder = layers.LSTM(self.units,
                  return_state=True,
                  return_sequences=False,
...

然后

    self.repeat = layers.RepeatVector(self.input_dim[0])

所以，事实上，当它

    repeated_vec = self.repeat(output)
    decoded = self.decoder(repeated_vec, initial_state=encoded_state)

它只需从编码器获取最后一个输出（在本例中代表1500的最后一步），将其复制1500次（input_dim[0]），并尝试根据最后几个值的信息预测所有1500个值。这里是模型丢失大部分有用信号的地方。它没有关于输入的足够/任何信息，为了最小化损失函数（我假设在本例中是MSE或MAE），它能学到的最好的东西是预测每个特征的平均值

此外，seq-to-seq模型通常将解码器步骤的预测作为输入传递给下一个解码器步骤，在当前情况下，它总是相同的值

TL；DR1）seq-to-seq不是这种情况下的最佳模型；2） 由于瓶颈问题除了预测每个特征的平均值外，它无法真正学会做任何更好的事情

相关问题更多 >

编程相关推荐

热门问题

热门文章