在python的sklearn中获取集群大小

#Apply DBSCAN (sims == my data as list of lists) db1 = DBSCAN(min_samples=1, metric='precomputed').fit(sims) db1_labels = db1.labels_ db1n_clusters_ = len(set(db1_labels)) - (1 if -1 in db1_labels else 0) #Returns the number of clusters (E.g., 10 clusters) print('Estimated number of clusters: %d' % db1n_clusters_)

2条回答

网友

1楼 · 编辑于 2024-09-30 14:27:04

你可以Bincount Function in Numpy得到标签的频率。例如，我们将使用scikit learn使用example for DBSCAN：

#Store the labels
labels = db.labels_

#Then get the frequency count of the non-negative labels
counts = np.bincount(labels[labels>=0])

print counts
#Output : [243 244 245]

然后使用argsort in numpy获得前3个值。在我们的示例中，由于只有3个簇，因此我将提取前2个值：

^{pr2}$

网友

2楼 · 编辑于 2024-09-30 14:27:04

另一个选择是使用numpy.unique：

db1_labels = db1.labels_
labels, counts = np.unique(db1_labels[db1_labels>=0], return_counts=True)
print labels[np.argsort(-counts)[:3]]

相关问题更多 >

编程相关推荐

热门问题

热门文章

在python的sklearn中获取集群大小

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >