Skip to main content

PERMANOVA+ for PRIMER: Guide to Software and Statistical Methods

M J Anderson, R N Gorley & K R Clarke (2008)

Introduction and overview

Chapter 1: Permutational ANOVA and MANOVA (PERMANOVA)

Key references: Method: Anderson (2001a), McArdle & Anderson (2001) Permutation techniques:...

1.1 General description

Key references Method: , Permutation techniques: ,   PERMANOVA is a routine for testing the...

1.2 Partitioning

We shall begin by considering the balanced one-way (single factor) ANOVA experimental design. A f...

1.3 Huygens’ theorem

This partitioning is fine and perfectly valid for Euclidean distances. But what happens if we wis...

1.4 Sums of squares from a distance matrix

We can now consider the structure of a distance/dissimilarity matrix and how sums of squares for ...

1.5 The pseudo-F statistic

Once the partitioning has been done we are ready to calculate a test statistic associated with th...

1.6 Test by permutation

An appropriate distribution for the pseudo-F statistic under a true null hypothesis is obtained b...

1.7 Assumptions

Recall that for traditional one-way ANOVA, the assumptions are that the errors are independent, t...

1.8 One-way example (Ekofisk oil-field macrofauna)

Our first real example comes from a study by , who studied changes in community structure of soft...

1.9 Creating a design file

We shall formally test the hypothesis of no differences in community structure among the four gro...

1.10 Running PERMANOVA

To run PERMANOVA on the Ekofisk data, click on the resemblance matrix and select PERMANOVA+ > PER...

1.11 Pair-wise comparisons

Pair-wise comparisons among all pairs of levels of a given factor of interest are obtained by doi...

1.12 Monte Carlo P-values (Victorian avifauna)

In some situations, there are not enough possible permutations to get a reasonable test. Consider...

1.13 PERMANOVA versus ANOSIM

The analysis of similarities (ANOSIM), described by is also available within PRIMER and can be u...

1.14 Two-way crossed design (Subtidal epibiota)

The primary advantage of PERMANOVA is its ability to analyse complex experimental designs. The pa...

1.15 Interpreting interactions

What do we mean by an “interaction” between two factors in multivariate space? Recall that for a ...

1.16 Additivity

Central to an understanding of what an interaction means for linear models25 is the idea of addit...

1.17 Methods of permutations

As for the one-way case, the distribution of each of the pseudo-F ratios in a multi-way design is...

1.18 Additional assumptions

Recall that we assume for the analysis of a one-way design by PERMANOVA that the multivariate obs...

1.19 Contrasts

In some cases, what is of interest in a particular experimental design is not necessarily the com...

1.20 Fixed vs random factors (Tasmanian meiofauna)

All of the factors considered so far have been fixed, but factors can be either fixed or random. ...

1.21 Components of variation

For any given ANOVA design, PERMANOVA identifies a component of variation for each term in the mo...

1.22 Expected mean squares (EMS)

An important consequence of the choice made for each factor as to whether it be fixed or random i...

1.23 Constructing $F$ from EMS

The determination of the EMS’s gives a direct indication of how the pseudo-F ratio should be cons...

1.24 Exchangeable units

The denominator mean square of the pseudo-F ratio for any particular term in the analysis is impo...

1.25 Inference space and power

It is worthwhile pausing to consider how the above tests correspond to meaningful hypotheses for ...

1.26 Testing the design

Given the fact that so many important aspects of the results (pseudo-F ratios, P-values, power, t...

1.27 Nested design (Holdfast invertebrates)

We have seen how a crossed design is identifiable by virtue of every level of one factor being pr...

1.28 Estimating components of variation

The EMS’s also yield another important insight: they provide a direct method to get unbiased esti...

1.29 Pooling or excluding terms

For a given design file, PERMANOVA, by default, will do a partitioning according to all terms tha...

1.30 Designs that lack replication (Plankton net study)

A topic related to the issue of pooling is the issue of designs that lack replication. Familiar e...

1.31 Split-plot designs (Woodstock plants)

Another special case of a design lacking appropriate replication is known as a split-plot design....

1.32 Repeated measures (Victorian avifauna, revisited)

While randomised blocks, latin squares and split-plot designs lack spatial replication, a special...

1.33 Unbalanced designs

Virtually all of the examples thus far have involved the analysis of what are known as balanced e...

1.34 Types of sums of squares (Birds from Borneo)

When the design is unbalanced, there will be a number of different ways to do the partitioning, w...

1.35 Designs with covariates (Holdfast invertebrates, revisited)

A topic that is related (perhaps surprisingly) to the topic of unbalanced designs is the analysis...

1.36 Linear combinations of mean squares (NZ fish assemblages)

Several aspects of the above analysis demonstrate its affinity with unbalanced designs. Note that...

1.37 Asymmetrical designs (Mediterranean molluscs)

Although a previous section has been devoted to the analysis of unbalanced designs, there are som...

1.38 Environmental impacts

Some further comments are appropriate here regarding experimental designs to detect environmental...

Chapter 2: Tests of homogeneity of dispersions (PERMDISP)

Key reference Method: Anderson (2006)

2.1 General description

Key reference Method:   PERMDISP is a routine for testing the homogeneity of multivariate dis...

2.2 Rationale

There are various reasons why one might wish to perform an explicit test of the null hypothesis o...

2.3 Multivariate Levene’s test (Bumpus’ sparrows)

proposed doing an analysis of variance (ANOVA) on the absolute values of deviations of observati...

2.4 Generalisation to dissimilarities

Of course, in many applications that we will encounter (especially in the case of community data)...

2.5 $P$-values by permutation

The other hurdle that must be cleared is to recognise that, in line with the philosophy of all of...

2.6 Test based on medians

Levene’s test (for univariate data) can be made more robust (i.e. less affected by outliers) by u...

2.7 Ecological example (Tikus Island corals)

An ecological example of the test for homogeneity is provided by considering a study by on coral...

2.8 Choice of measure

An extremely important point is that the test of dispersion is going to be critically affected by...

2.9 Dispersion as beta diversity (Norwegian macrofauna)

When used on species composition (presence/absence) data in conjunction with certain resemblance ...

2.10 Small sample sizes

There is one necessary restriction on the use of PERMDISP, which is that the number of replicate ...

2.11 Dispersion in nested designs (Okura macrofauna)

In many situations, the experimental design is not as simple as a one-way analysis among groups. ...

2.12 Dispersion in crossed designs (Cryptic fish)

When two factors are crossed with one another, there may be several possible hypotheses concernin...

2.13 Concluding remarks

PERMDISP is designed to test the null hypothesis of no differences in dispersions among a priori ...

Chapter 3: Principal coordinates analysis (PCO)

Key references Method: Torgerson (1958), Gower (1966)

Chapter 4: Distance-based linear models (DISTLM) and distance-based redundancy analysis (dbRDA)

Key references Method: Legendre & Anderson (1999), McArdle & Anderson (2001) Permutatio...

4.1 General description

Key references Method:, Permutation methods: , , ,   DISTLM is a routine for analysing and ...

4.2 Rationale

Just as PERMANOVA does a partitioning of variation in a data cloud described by a resemblance mat...

4.3 Partitioning

Consider an (N × p) matrix of response variables Y, where N = the number of samples and p = the n...

4.4 Simple linear regression (Clyde macrofauna)

In our first example of DISTLM, we will examine the relationship between the Shannon diversity (H...

4.5 Conditional tests

More generally, when X contains more than one variable, we may also be interested in conditional ...

4.6 (Holdfast invertebrates)

To demonstrate conditional tests in DISTLM, we will consider the number of species inhabiting hol...

4.7 Assumptions & diagnostics

Thus far, we have only done examples for a univariate response variable in Euclidean space, using...

4.8 Building models

In many situations, a scientist may have measured a large number of predictor variables that coul...

4.9 Cautionary notes

Before proceeding, a few cautionary notes are appropriate with respect to building models. First,...

4.10 (Ekofisk macrofauna)

We shall now use the DISTLM tool to identify potential parsimonious models for benthic macrofauna...

4.11 Visualising models: dbRDA

We may wish to visualise a given model in the multivariate space of our chosen resemblance matrix...

4.12 Vector overlays in dbRDA

Something which certainly should come as no surprise is to see the X variables playing an importa...

4.13 dbRDA plot for Ekofisk

Let us examine the constrained dbRDA ordination for the parsimonious model obtained earlier using...

4.14 Analysing variables in sets (Thau lagoon bacteria)

In some situations, it is useful to be able to partition variability in the data cloud according ...

4.15 Categorical predictor variables (Oribatid mites)

Sometimes the predictor variables of interest are not quantitative, continuous variables, but rat...

4.16 DISTLM versus BEST/ BIOENV

On the face of it, the DISTLM routine might be thought of as playing a similar role to PRIMER’s B...

Chapter 5: Canonical analysis of principal coordinates (CAP)

Key references Method: Anderson & Robinson (2003), Anderson & Willis (2003)

5.1 General description

Key references Method: ,   CAP is a routine for performing canonical analysis of principal co...

5.2 Rationale (Flea-beetles)

In some cases, we may know that there are differences among some pre-defined groups (for example,...

5.3 Mechanics of CAP

Details of CAP and how it is related to other methods are provided by and . In brief, a classica...

5.4 Discriminant analysis (Poor Knights Islands fish)

We will begin with an example provided by Trevor Willis and Chris Denny (; ), examining temperate...

5.5 Diagnostics

How did the CAP routine choose an appropriate number of PCO axes to use for the above discriminan...

5.6 Cross-validation

The procedure of pulling out one sample at a time and checking the ability of the model to correc...

5.7 Test by permutation (Anderson’s irises)

CAP can be used to test for significant differences among the groups in multivariate space. The t...

5.8 CAP versus PERMANOVA

It might seem confusing that both CAP and PERMANOVA can be used to test for differences among gro...

5.9 Caveats on using CAP (Tikus Island corals)

When using the CAP routine, it should come as no surprise that the hypothesis (usually) is eviden...

5.10 Adding new samples

A new utility of the windows-based version of the CAP routine in PERMANOVA+ is the ability to pla...

5.11 Canonical correlation: single gradient (Fal estuary biota)

So far, the focus has been on hypotheses concerning groups and the use of CAP for discriminant an...

5.12 Canonical correlation: multiple X’s

In some cases, interest lies in finding axes through the cloud of points so as to maximise correl...

5.13 Sphericising variables

It was previously stated that CAP effectively “sphericises” the data clouds as part of the proces...

5.14 CAP versus dbRDA

So, how does CAP differ from dbRDA for relating two sets of variables? First, dbRDA is directiona...

5.15 Comparison of methods using SVD

The relationship between dbRDA and CAP can also be seen if we consider their formulation using si...

5.16 (Hunting spiders)

A study by explored the relationships between two sets of variables: the abundances of hunting s...

Appendices