has_output
has_specified_output
has specified output
has_specified_output
achieves_planned_objective
dataset creating
clustered data set
k-means clustering
A k-means clustering is a data transformation which achieves a class discovery or partitioning objective, which takes as input a collection of objects (represented as points in multidimensional space) and which partitions them into a specified number k of clusters. The algorithm attempts to find the centers of natural clusters in the data. The most common form of the algorithm starts by partitioning the input points into k initial sets, either at random or using some heuristic data. It then calculates the mean point, or centroid, of each set. It constructs a new partition by associating each point with the closest centroid. Then the centroids are recalculated for the new clusters, and the algorithm repeated by alternate applications of these two steps until convergence, which is obtained when the points no longer switch clusters (or alternatively centroids are no longer changed).
Philippe Rocca-Serra
Elisabetta Manduchi
James Malone
WEB: http://en.wikipedia.org/wiki/K-means
k-means clustering
partitioning data transformation
class discovery data transformation
class discovery objective
pending final vetting