为什么我的precisionrecall和ROC曲线不平滑？

precision, recall, _ = precision_recall_curve(y_test, y_pred) plt.step(recall, precision, color='b', alpha=0.2,where='post') plt.fill_between(recall, precision, step='post', alpha=0.2, color='b') fpr, tpr, _ = roc_curve(y_test, y_pred) roc_auc = auc(fpr, tpr) plt.plot(fpr, tpr, color='darkorange', lw=2, label='ROC curve (area = %0.2f)' % roc_auc) plt.plot([0, 1], [0, 1], color='navy', lw=2, linestyle='--')

2条回答

网友

1楼 · 编辑于 2024-09-20 07:24:00

在precision_recall_curve内，y_pred必须是目标类的probabilities，而不是实际的预测类。在

因为您使用的是RandomForestClassifier，所以使用predict_proba(X)来获得概率。在

rf = RandomForestClassifier()
probas_pred = rf.predict_proba(X_test)

precision, recall, _ = precision_recall_curve(y_true, probas_pred)
plt.step(recall, precision, color='b', alpha=0.2,where='post')
plt.fill_between(recall, precision, step='post', alpha=0.2, color='b')

网友

2楼 · 编辑于 2024-09-20 07:24:00

我怀疑你用了RandomForestClassifier.predict（）方法，根据预测的类生成0或1。在

要得到概率，即为特定类投票的树的分数，必须使用RandomForestClassifier.predict_proba（）方法。在

使用这些概率作为曲线计算的输入应该可以解决这个问题。在

编辑：scikit learn的曲线生成方法首先根据预测得分对预测结果进行排序，然后根据实际值/观察值对预测值进行排序，因此曲线具有这些“弯折”。在

相关问题更多 >

编程相关推荐

热门问题

热门文章