Correspondence Analysis

The main idea...

Correspondence analysis (CA) may be used to calculate and visualise the degree of 'correspondence' between the rows and columns of a table of frequency data, such as count, presence-absence, or abundance data. Roughly, 'correspondence' is a measure of how strongly the frequencies across rows and columns of the data table deviate from a null model of "no association" (See Legendre & Legendre, 1998 for more detail). CA is best applied to data sets where variables (columns) show unimodal distributions across objects (rows). In a CA ordination (Figure 1), each row and column will be represented by a point. The positions of these points indicate how strongly they 'correspond' to one another.

In contrast to other indirect gradient analysis methods such as principal components analysis (PCA), CA does not attempt to maximise the amount of variance explained by its reduced-space ordination. Instead, CA tries to maximise the representation of 'correspondence' between the rows and columns of the table. If you had a "sites x species" table, CA would attempt to find a reduced-space answer to questions such as "Which sites do my species prefer?" or "Which sites to my species correspond to?" This is accomplished by performing eigenanalysis on a transformation of the original data table such that each value represents a contribution to the Pearson's χ2 (chi-squared) statistic computed for the data. When calculating the χ2 coefficient, "double zeros" are excluded. See the (dis)similarity page of this guide for more information on the χ2 coefficient and the "double zero" problem. CA can ordinate either the rows or the columns with accuracy (see Reading a CA plot, below).

 

Figure 1: A representation of a CA "joint plot" where both objects (circles, representing rows) and variables (triangles, representing columns) are ordinated. Variable symbols are placed in the 'centre of gravity' or 'centre of inertia' of the objects they best describe. The distances between object and variable symbols indicate the probability of observing a given object when the influence of a given variable is present.

Results and interpretation 

Perhaps the most interesting result of CA is a "joint plot" (Figure 1) which visualises the correspondence between objects and variables. Other essential output includes:

Total inertia

Eigenvalues

Object and variable scores

The total inertia of a CA solution conveys the degree to which the values of rows and columns correspond to each other. More specifically, it reflects the degree to which rows and columns deviate from the null hypothesis of "no association", according to the logic of the Pearson's χ2 statistic.

Each CA axes has an associated eigenvalue which will represent a fraction of the total inertia. Axes with larger eigenvalues will generally be more informative than those with smaller eigenvalues. A CA joint plot typically visualises the first two or three axes with the largest eigenvalues.

Objects and variables will have scores along each CA axis calculated. These scores are used as the new set of coordinates in the CA ordination.

Reading a CA plot

Scaling

The type of scaling used for CA will determine whether object-to-object or variable-to-variable distances are meaningful. In general, object-to-variable distances are not ordinated with great accuracy; however, smaller object-to-variable distances roughly indicate the increased probability of a given variable being 'present at', 'abundant at', or otherwise influential for a given object. Regardless of the scaling chosen, the distances between points in a CA plot are χ2 distances, and must be interpreted as such. 

 Type I 

 Type II 

Type I scaling ordinates objects (rows) with accuracy. The closer object points are to one another, the more similar their distribution of values across variables.

Type II scaling ordinates variables (columns) with accuracy. The closer variable points are to one another, the more similar their distribution of values across objects.

There are alternative scalings (e.g. "Type III" or "symmetric" scaling) that attempt to reach a compromise between Type I and Type II scaling. Interpretations of these ordinations should be performed with caution, bearing in mind that both object-to-object and object-to-variable distances are likely to be distorted.

Axes

Usually, the two axes with the most inertia and thus the highest eigenvalues (see above) are shown; however, CA axes other than first two or three may still be informative for some objects and variables and should be inspected. As with PCA, the "broken stick model" and the Kaiser-Guttman criterion may be used to suggest the optimal number of axes to plot. CA axes are orthogonal to one another, meaning that they are uncorrelated.

Post-analysis

The scores of objects and variables along one or more CA axes may be used in subsequent analysis. For example, one may correlate object scores along the first axis of a CA solution using Type I scaling with an environmental variable (as an explanatory variable). Such indirect methods are informative, but more recent direct gradient analysis methods, such as canonical correspondence analysis (CCA), are perhaps better suited to questions of this kind.

Different ordinations CA ordinations may be compared using Procrustes analysis.

Key assumptions

Warnings

Walkthroughs featuring correspondence analysis

Implementations

References