www.tlab.it

Cluster Analysis


Cluster analysis is a set of statistical techniques the aim of which is to detect groups of objects with two complementary features:

A - High internal (within cluster) homogeneity;

B - High external (between cluster) heterogeneity.

In statistical language, the characteristics "A" and "B" respectively correspond to the within and between cluster variance.

In general, there are two kinds of Cluster Analysis techniques:

T-LAB uses both types of algorithms.

In particular:

· the Co-Word Analysis option uses a hierarchical method;
· the Cluster Analysis option allows the use of three different methods: one hierarchical and two partitional (K-means and Kohonen Maps);
· the Thematic Analysis of Elementary Contexts and Thematic Document Classification options use a bisecting K-means algorithm .

Some of the publications quoted in the Bibliography provide further information on the general aspects of the various methods (Bolasco S., 1999; Lebart L., A. Morineau, M. Piron, 1995), the specific aspects relating to the Kohonen Maps (Kohonen T., 1989) and the bisecting K-means method (Steinbach, M., G. Karypis, V. Kumar, 2000; Savaresi S.M., D.L. Boley, 2001).