KMeans ResultIndex在第二次运行时不同

def findClosestCentroids(X, centroids): K = centroids.shape[0] m = X.shape[0] dist = np.zeros((K,1)) idx = np.zeros((m,1), dtype=int) #number of columns defines my number of data points for i in range(m): #Every column is one data point x = X[i,:] #number of rows defines my number of centroids for j in range(K): #Every row is one centroid c = centroids[j,:] #distance of the two points c and x dist[j] = np.linalg.norm(c-x) #if last centroid is processed if (j == K-1): #the Result idx is set with the index of the centroid with minimal distance idx[i] = np.argmin(dist) return idx def runkMeans(X, initial_centroids, max_iters): #Initialize values m,n = X.shape K = initial_centroids.shape[0] centroids = initial_centroids previous_centroids = centroids for i in range(max_iters): print("K_Means iteration:",i) #For each example in X, assign it to the closest centroid idx = findClosestCentroids(X, centroids) #Given the memberships, compute new centroids centroids = computeCentroids(X, idx, K) return centroids,idx

1条回答

网友

1楼 · 发布于 2024-09-26 23:16:19

K-means是一种非确定性算法。通常通过设置随机种子来控制。例如，SciKit Learn的实现为此提供了random_state参数：

from sklearn.cluster import KMeans
import numpy as np
X = np.array([[1, 2], [1, 4], [1, 0], [10, 2], [10, 4], [10, 0]])
kmeans = KMeans(n_clusters=2, random_state=0).fit(X)

参见https://scikit-learn.org/stable/modules/generated/sklearn.cluster.KMeans.html上的文档

相关问题更多 >

编程相关推荐

热门问题

热门文章