Terminology Service for NFDI4Health

k-means clustering

Go to external page http://purl.obolibrary.org/obo/OBI_0200041


A k-means clustering is a data transformation which achieves a class discovery or partitioning objective, which takes as input a collection of objects (represented as points in multidimensional space) and which partitions them into a specified number k of clusters. The algorithm attempts to find the centers of natural clusters in the data. The most common form of the algorithm starts by partitioning the input points into k initial sets, either at random or using some heuristic data. It then calculates the mean point, or centroid, of each set. It constructs a new partition by associating each point with the closest centroid. Then the centroids are recalculated for the new clusters, and the algorithm repeated by alternate applications of these two steps until convergence, which is obtained when the points no longer switch clusters (or alternatively centroids are no longer changed).

Term info

Label

k-means clustering

definition source

WEB: http://en.wikipedia.org/wiki/K-means

editor preferred term

k-means clustering

has curation status

http://purl.obolibrary.org/obo/IAO_0000125

term editor

Elisabetta Manduchi, James Malone, Philippe Rocca-Serra