cluster

This package provides functions for cluster analysis.

FunctionDescription
agnesComputes agglomerative hierarchical clustering of the data set.
bannerplotDraws a “banner,” i.e., basically a horizontal barplot visualizing the (agglomerative or divisive) hierarchical clustering or another binary dendrogram structure.
claraComputes a "clara" object, a list representing a clustering of the data into k clusters.
clusplotDraws a two-dimensional (2D) “clusplot” on the current graphics device.
coef.hclustComputes the “agglomerative coefficient,” measuring the clustering structure of the data set.
daisyComputes all the pairwise dissimilarities (distances) between observations in the data set.
dianaComputes a divisive hierarchical clustering of the data set, returning an object of class diana.
ellipsoidPointsComputes points on the ellipsoid boundary, mostly for drawing.
ellipsoidhullComputes the “ellipsoid hull” or “spanning ellipsoid,” i.e., the ellipsoid of minimal volume (“area” in 2D) such that all given points lie just inside or on the boundary of the ellipsoid.
fannyComputes a fuzzy clustering of the data into k clusters.
lower.to.upper.tri.indsComputes index vectors for extracting or reordering of lower or upper triangular matrices that are stored as contiguous vectors.
monaReturns a list representing a divisive hierarchical clustering of a data set with binary variables only.
pamPartitioning (clustering) of the data into k clusters “around medoids,” a more robust version of k-means clustering.
pltreeGeneric function drawing a clustering tree (“dendrogram”) on the current graphics device. There is a twins method; see pltree.twins for usage and examples.
predict.ellipsoidComputes points on the ellipsoid boundary, mostly for drawing.
silhouetteComputes silhouette information according to a given clustering in k clusters.
sizeDissReturns the number of observations (sample size) corresponding to a dissimilarity-like object or, equivalently, the number of rows or columns of a matrix when only the lower or upper triangular part (without diagonal) is given. It is nothing else but the inverse function of f(n) = n(n − 1)/2.
sortSilhouetteComputes silhouette information according to a given clustering in k clusters.
upper.to.lower.tri.indsComputes index vectors for extracting or reordering of lower or upper triangular matrices that are stored as contiguous vectors.
volumeComputes the volume of a planar object. This is a generic function and a method for ellipsoid objects.
Data SetClassDescription
agriculturedata.frameGross national product (GNP) per capita and percentage of the population working in agriculture for each country belonging to the European Union in 1993.
animalsdata.frameThis data set considers 6 binary attributes for 20 animals.
chorSubmatrixThis is a small rounded subset of the C-horizon data.
flowerdata.frameThis data set consists of 8 characteristics for 18 popular flowers.
plantTraitsdata.frameThis data set constitutes a description of 136 plant species according to biological attributes (morphological or reproductive).
plutondata.frameThe pluton data frame has 45 rows and 4 columns, containing percentages of isotopic composition of 45 plutonium batches.
ruspinidata.frameThe Ruspini data set, consisting of 75 points in 4 groups, is popular for illustrating clustering techniques.
votes.repubdata.frameA data frame with the percents of votes given to the Republican candidates in presidential elections from 1856 to 1976. Rows represent the 50 states, and columns the 31 elections.
xclaradata.frameAn artificial data set consisting of 3,000 points in 3 well-separated clusters of size 1,000 each.