基于pyclus的加权聚类

import numpy as np import Pycluster as pc points = np.asarray([ [1.0, 20, 30, 50], [1.2, 15, 34, 50], [1.6, 13, 20, 55], [0.1, 16, 40, 26], [0.3, 26, 30, 23], [1.4, 20, 28, 20], ]) # would like to specify 6 weights for each of the elements in `points` weights = np.asarray([1.0, 1.0, 1.0, 1.0]) clusterid, error, nfound = pc.kcluster( points, nclusters=2, transpose=0, npass=10, method='a', dist='e', weight=weights ) centroids, _ = pc.clustercentroids(points, clusterid=clusterid) print centroids

1条回答

网友

1楼 · 发布于 2024-05-18 12:04:29

加权单个数据点不是KMeans算法的一个特性。这是在算法定义中：它在pycluster、MLlib或TrustedAnalytics中不可用。在

但是，可以添加重复的数据点。例如，如果希望第二个数据点的计数是原来的两倍，请将列表更改为：

points = np.asarray([
    [1.0, 20, 30, 50],
    [1.2, 15, 34, 50],
    [1.2, 15, 34, 50],
    [1.6, 13, 20, 55],
    [0.1, 16, 40, 26],
    [0.3, 26, 30, 23],
    [1.4, 20, 28, 20],
])

相关问题更多 >

编程相关推荐

热门问题

热门文章