from sklearn.cluster import KMeans
import numpy as np
#insert your csv file as 2D numpy array without the header or row index. Your input data size should be total number of students * 8.
X = np.array(input_data)
#set n_clusters to half the number of students
kmeans = KMeans(n_clusters=num_students/2, random_state=0).fit(X)
kmeans.labels_
您可以运行“最近邻”算法。 例如: 见scikit NearestNeighbors
您将分数编码为8维空间中的点(例如50%->;0.5),并使用n=1运行算法
您可以使用kmeans集群。见:https://scikit-learn.org/stable/modules/generated/sklearn.cluster.KMeans.html
相关问题 更多 >
编程相关推荐