Cluster analysis


Hierarchical clustering
  • single linkage
  • complete linkage
  • average linkage
  • Ward's linkage (including Ward's method)
  • weighted average linkage
  • centroid linkage
  • median linkage

Nonhierarchical

  • kmeans
  • kmedians

Cluster on observations

Cluster using any proximity matrix

Dendrograms

  • full trees
  • subtrees
  • upper portion of tree
  • vertical or horizontal orientation
  • branch counts

Stopping rules

  • Calínski and Harabasz pseudo-F index
  • Duda and Hart Je(2)/Je(1) index

Support tools

  • generate summary and grouping variables
  • attach notes to analyses

Similarity/dissimilarity measures for continuous data

  • L2/Euclidean
  • L1/absolute/cityblock/manhattan
  • L(#)
  • Canberra
  • correlation
  • angular
 
Similarity/dissimilarity measures for binary data
  • matching
  • Jaccard
  • Russell
  • Hamann
  • Dice
  • antidice
  • Sneath
  • Rogers
  • Ochiai
  • Yule
  • Anderberg
  • Kulczynski
  • Gower2
  • Pearson

Result-management utilities

  • dir
  • list
  • drop
  • use
  • rename

User-extensible commands

  • ability to add new clustering methods and utilities
  • full set of tools to ease making additions

© Copyright 2005 Stata Corporation.