Keras报告的验证精度指标模型.拟合记录和Sklearn.metrics.confusion_矩阵彼此不匹配

inputs = Input((Tx, )) n_e = 30 embeddings = Embedding(n_x, n_e, input_length=Tx)(inputs) out = Bidirectional(LSTM(32, recurrent_dropout=0.5, return_sequences=True))(embeddings) out = Bidirectional(LSTM(16, recurrent_dropout=0.5, return_sequences=True))(out) out = Bidirectional(LSTM(16, recurrent_dropout=0.5))(out) out = Dense(3, activation='softmax')(out) modelo = Model(inputs=inputs, outputs=out) modelo.summary()

_________________________________________________________________ Layer (type) Output Shape Param # ================================================================= input_1 (InputLayer) (None, 100) 0 _________________________________________________________________ embedding (Embedding) (None, 100, 30) 86610 _________________________________________________________________ bidirectional (Bidirectional (None, 100, 64) 16128 _________________________________________________________________ bidirectional_1 (Bidirection (None, 100, 32) 10368 _________________________________________________________________ bidirectional_2 (Bidirection (None, 32) 6272 _________________________________________________________________ dense (Dense) (None, 3) 99 ================================================================= Total params: 119,477 Trainable params: 119,477 Non-trainable params: 0 _________________________________________________________________

Train on 505 samples, validate on 127 samples Epoch 1/30 500/505 [============================>.] - ETA: 0s - loss: 0.6135 - acc: 0.6667 [...] Epoch 10/30 500/505 [============================>.] - ETA: 0s - loss: 0.1403 - acc: 0.9633 Epoch 00010: val_acc improved from 0.77953 to 0.79528, saving model to modelo-10-melhor-modelo.hdf5 505/505 [==============================] - 21s 41ms/sample - loss: 0.1393 - acc: 0.9637 - val_loss: 0.5203 - val_acc: 0.7953 Epoch 11/30 500/505 [============================>.] - ETA: 0s - loss: 0.0865 - acc: 0.9840 Epoch 00011: val_acc did not improve from 0.79528 505/505 [==============================] - 21s 41ms/sample - loss: 0.0860 - acc: 0.9842 - val_loss: 0.5257 - val_acc: 0.7953 Epoch 12/30 500/505 [============================>.] - ETA: 0s - loss: 0.0618 - acc: 0.9900 Epoch 00012: val_acc improved from 0.79528 to 0.81102, saving model to modelo-10-melhor-modelo.hdf5 505/505 [==============================] - 21s 42ms/sample - loss: 0.0615 - acc: 0.9901 - val_loss: 0.5472 - val_acc: 0.8110 Epoch 13/30 500/505 [============================>.] - ETA: 0s - loss: 0.0415 - acc: 0.9940 Epoch 00013: val_acc improved from 0.81102 to 0.82152, saving model to modelo-10-melhor-modelo.hdf5 505/505 [==============================] - 21s 42ms/sample - loss: 0.0413 - acc: 0.9941 - val_loss: 0.5853 - val_acc: 0.8215 Epoch 14/30 500/505 [============================>.] - ETA: 0s - loss: 0.0443 - acc: 0.9933 Epoch 00014: val_acc did not improve from 0.82152 505/505 [==============================] - 21s 42ms/sample - loss: 0.0453 - acc: 0.9921 - val_loss: 0.6043 - val_acc: 0.8136 Epoch 15/30 500/505 [============================>.] - ETA: 0s - loss: 0.0360 - acc: 0.9933 Epoch 00015: val_acc improved from 0.82152 to 0.84777, saving model to modelo-10-melhor-modelo.hdf5 505/505 [==============================] - 21s 42ms/sample - loss: 0.0359 - acc: 0.9934 - val_loss: 0.5663 - val_acc: 0.8478 [...] Epoch 30/30 500/505 [============================>.] - ETA: 0s - loss: 0.0039 - acc: 1.0000 Epoch 00030: val_acc did not improve from 0.84777 505/505 [==============================] - 20s 41ms/sample - loss: 0.0039 - acc: 1.0000 - val_loss: 0.8340 - val_acc: 0.8110

1条回答

网友

1楼 · 发布于 2024-10-01 00:14:53

你的问题是不适定的；正如已经说过的，你没有计算你的scikit学习模型的实际精度，因此你似乎在比较苹果和桔子。从一个标准化的混淆矩阵计算（TP+TN）/2不能给出准确度。下面是一个简单的使用玩具数据的除臭剂，从docs改编plot_confusion_matrix：

import numpy as np
import matplotlib.pyplot as plt
from sklearn.metrics import confusion_matrix

# toy data
y_true = [0, 1, 0, 1, 0, 0, 0, 1]
y_pred =  [1, 1, 1, 0, 1, 1, 0, 1]
class_names=[0,1]

# plot_confusion_matrix function

def plot_confusion_matrix(y_true, y_pred, classes,
                          normalize=False,
                          title=None,
                          cmap=plt.cm.Blues):
    """
    This function prints and plots the confusion matrix.
    Normalization can be applied by setting `normalize=True`.
    """
    if not title:
        if normalize:
            title = 'Normalized confusion matrix'
        else:
            title = 'Confusion matrix, without normalization'

    # Compute confusion matrix
    cm = confusion_matrix(y_true, y_pred)

    if normalize:
        cm = cm.astype('float') / cm.sum(axis=1)[:, np.newaxis]
        print("Normalized confusion matrix")
    else:
        print('Confusion matrix, without normalization')

    print(cm)

    fig, ax = plt.subplots()
    im = ax.imshow(cm, interpolation='nearest', cmap=cmap)
    ax.figure.colorbar(im, ax=ax)
    # We want to show all ticks...
    ax.set(xticks=np.arange(cm.shape[1]),
           yticks=np.arange(cm.shape[0]),
           # ... and label them with the respective list entries
           xticklabels=classes, yticklabels=classes,
           title=title,
           ylabel='True label',
           xlabel='Predicted label')

    # Rotate the tick labels and set their alignment.
    plt.setp(ax.get_xticklabels(), rotation=45, ha="right",
             rotation_mode="anchor")

    # Loop over data dimensions and create text annotations.
    fmt = '.2f' if normalize else 'd'
    thresh = cm.max() / 2.
    for i in range(cm.shape[0]):
        for j in range(cm.shape[1]):
            ax.text(j, i, format(cm[i, j], fmt),
                    ha="center", va="center",
                    color="white" if cm[i, j] > thresh else "black")
    fig.tight_layout()
    return ax

计算标准化混淆矩阵得到：

^{pr2}$

根据您的错误理由，准确度应为：

(0.67 + 0.2)/2
# 0.435

（注意在标准化矩阵中，行相加为100%，这在完全混淆矩阵中是不会发生的）

但现在让我们看看非标准化混淆矩阵的实际精确度是什么：

plot_confusion_matrix(y_true, y_pred, classes=class_names) # normalize=False by default
# result
Confusion matrix, without normalization
[[1 4]
 [1 2]]

其中，根据精度定义为（TP+TN）/（TP+TN+FP+FN），我们得到：

(1+2)/(1+2+4+1)
# 0.375

当然，我们不需要混淆矩阵来获得准确度这样简单的东西；正如评论中已经建议的那样，我们可以简单地使用scikit learn的内置accuracy_score方法：

from sklearn.metrics import accuracy_score
accuracy_score(y_true, y_pred)
# 0.375

毫不奇怪，这与我们从混淆矩阵直接计算的结果是一致的。在

底线：

如果存在特定的方法（如accuracy_score），那么最好使用它们而不是特别的灵感，尤其是当某些东西看起来不对劲时（比如Keras和scikit learn报告的准确度之间的差异）
事实上，在这个例子中，实际的精确度比你自己计算的要低，这显然不能说明你所报告的具体问题
如果即使在计算了您的数据的正确准确度后，仍然存在与Keras的差异，请按照新的情况更改问题，因为这会使答案无效，尽管它突出了您方法中的一个错误点，请打开一个新问题

相关问题更多 >

编程相关推荐

热门问题

热门文章