cluster

This package provides functions for cluster analysis.

Functions

Function	Description
agnes	Computes agglomerative hierarchical clustering of the data set.
bannerplot	Draws a “banner,” i.e., basically a horizontal `barplot` visualizing the (agglomerative or divisive) hierarchical clustering or another binary dendrogram structure.
clara	Computes a `"clara"` object, a list representing a clustering of the data into `k` clusters.
clusplot	Draws a two-dimensional (2D) “clusplot” on the current graphics device.
coef.hclust	Computes the “agglomerative coefficient,” measuring the clustering structure of the data set.
daisy	Computes all the pairwise dissimilarities (distances) between observations in the data set.
diana	Computes a divisive hierarchical clustering of the data set, returning an object of class `diana`.
ellipsoidPoints	Computes points on the ellipsoid boundary, mostly for drawing.
ellipsoidhull	Computes the “ellipsoid hull” or “spanning ellipsoid,” i.e., the ellipsoid of minimal volume (“area” in 2D) such that all given points lie just inside or on the boundary of the ellipsoid.
fanny	Computes a fuzzy clustering of the data into `k` clusters.
lower.to.upper.tri.inds	Computes index vectors for extracting or reordering of lower or upper triangular matrices that are stored as contiguous vectors.
mona	Returns a list representing a divisive hierarchical clustering of a data set with binary variables only.
pam	Partitioning (clustering) of the data into `k` clusters “around medoids,” a more robust version of k-means clustering.
pltree	Generic function drawing a clustering tree (`“dendrogram”`) on the current graphics device. There is a `twins` method; see `pltree.twins` for usage and examples.
predict.ellipsoid	Computes points on the ellipsoid boundary, mostly for drawing.
silhouette	Computes silhouette information according to a given clustering in k clusters.
sizeDiss	Returns the number of observations (sample size) corresponding to a dissimilarity-like object or, equivalently, the number of rows or columns of a matrix when only the lower or upper triangular part (without diagonal) is given. It is nothing else but the inverse function of f(n) = n(n − 1)/2.
sortSilhouette	Computes silhouette information according to a given clustering in k clusters.
upper.to.lower.tri.inds	Computes index vectors for extracting or reordering of lower or upper triangular matrices that are stored as contiguous vectors.
volume	Computes the volume of a planar object. This is a generic function and a method for `ellipsoid` objects.

Data Sets

Data Set	Class	Description
agriculture	data.frame	Gross national product (GNP) per capita and percentage of the population working in agriculture for each country belonging to the European Union in 1993.
animals	data.frame	This data set considers 6 binary attributes for 20 animals.
chorSub	matrix	This is a small rounded subset of the C-horizon data.
flower	data.frame	This data set consists of 8 characteristics for 18 popular flowers.
plantTraits	data.frame	This data set constitutes a description of 136 plant species according to biological attributes (morphological or reproductive).
pluton	data.frame	The `pluton` data frame has 45 rows and 4 columns, containing percentages of isotopic composition of 45 plutonium batches.
ruspini	data.frame	The Ruspini data set, consisting of 75 points in 4 groups, is popular for illustrating clustering techniques.
votes.repub	data.frame	A data frame with the percents of votes given to the Republican candidates in presidential elections from 1856 to 1976. Rows represent the 50 states, and columns the 31 elections.
xclara	data.frame	An artificial data set consisting of 3,000 points in 3 well-separated clusters of size 1,000 each.