1.16 Additivity

Central to an understanding of what an interaction means for linear models²⁵ is the idea of additivity. Consider the example of a two-way crossed design for a univariate response variable, where the cell means and marginal means are as shown in Fig. 1.19a. Note that the marginal means are the means of the levels of each factor ignoring the other factor. In an additive model, the difference between two levels of a factor (say between B1 and B2) between individual cells (i.e., within each level of A, that is to say, within each column) are equal to the differences in the marginal means (i.e., the difference between the mean of B1 and B2 if factor A were to be ignored). This can be contrasted with the situation where the differences in cell means are quite different from the differences in marginal means (e.g., Fig. 1.19b), in which case, there is an interaction between the factors. So, this is another way to articulate what is meant by a significant interaction: effects of factors within levels of other factors are non-additive and thus do not match the corresponding shifts in marginal means. The interaction term, in fact, measures the deviation of the cell means we actually got from what we would expect them to be if they were to follow the marginal means, as would be the case if the effects of the two factors were purely additive.

Fig. 1.19. Marginal and cell means for a univariate crossed design showing examples of (a) additive effects, (b) multiplicative effects and (c) additivity after log₁₀-transformation of (b).

Clearly, the additivity (or not) of the effects of factors is also going to depend on whether or not the data have been transformed (or standardised or ranked) prior to analysis. This is as true for multivariate data as it is for univariate data. For example, if a log (base 10) transformation is applied to the means shown in Fig. 1.19b, then we would have an additive model with no significant interaction (Fig. 1.19c). Such a situation typifies phenomena where the true effects are multiplicative, rather than being additive. In univariate analysis, transformations can often be used to remove significant interaction terms, yielding additivity ( Tukey (1949) , Box & Cox (1964) , Kruskal (1965) , Winsberg & Ramsay (1980) ).

For multivariate analysis of ecological data, however, transformations are usually applied neither to fulfill assumptions, nor in order to remove significant interactions, but rather as a method of changing the relative emphasis of the analysis on rare versus more abundant species (e.g., Clarke & Green (1988) , Clarke & Warwick (2001) ). In PRIMER, a blanket transformation can be applied to all variables by choosing Analyse > Pre-treatment > Transform (overall) and then choosing from a range, in increasing severity, from no transformation, square root, fourth root or log(x+1) down to a reduction of the values to binary presence (1) or absence (0). An approach using an intermediate-level transformation (square root or fourth root) has been recommended as a way to reduce the contribution of highly abundant species in relation to less abundant ones in the calculation of the Bray-Curtis measure; rare species will contribute more, the more severe the transformation ( Clarke & Green (1988) , Clarke & Warwick (2001) ).

In addition to the transformation, additivity of effects in multivariate analysis is also going to depend on whether or not the dissimilarities are ranked before analysis (yet another reason why patterns in a non-metric MDS, which preserves ranks only, may not necessarily clearly reflect what is given in the PERMANOVA output). The choice of dissimilarity measure itself is also very important here. By performing the partitioning, PERMANOVA is effectively applying a linear model to a multivariate data cloud, as defined by these choices. So the presence of a significant interaction (or not) by PERMANOVA will naturally depend on them. Nevertheless, the choice of an appropriate dissimilarity measure (and also the choice of transformation, if any) should genuinely be driven by the biology and ecology (or other nature) of the system being studied and what is appropriate regarding your hypotheses, and not by reference to these statistical issues (unlike typical traditional univariate ANOVA).

²⁵ The ANOVA models analysed by PERMANOVA are linear only in the space of the multivariate cloud defined by the dissimilarity measure of choice; they are not linear in the space of the original variables (unless the resemblance measure chosen was Euclidean distance).

0.1 Title page

0.2 Contact details and installation of the PERMANOVA+ software

0.3 Introduction to the methods of PERMANOVA+

0.4 Changes from DOS to PERMANOVA+ for PRIMER

0.5 Using this manual

1.1 General description

1.2 Partitioning

1.3 Huygens’ theorem

1.4 Sums of squares from a distance matrix

1.5 The pseudo-F statistic

1.6 Test by permutation

1.7 Assumptions

1.8 One-way example (Ekofisk oil-field macrofauna)

1.9 Creating a design file

1.10 Running PERMANOVA

1.11 Pair-wise comparisons

1.12 Monte Carlo P-values (Victorian avifauna)

1.13 PERMANOVA versus ANOSIM

1.14 Two-way crossed design (Subtidal epibiota)

1.15 Interpreting interactions

1.16 Additivity

1.17 Methods of permutations

1.18 Additional assumptions

1.19 Contrasts

1.20 Fixed vs random factors (Tasmanian meiofauna)

1.21 Components of variation

1.22 Expected mean squares (EMS)

1.23 Constructing $F$ from EMS

1.24 Exchangeable units

1.25 Inference space and power

1.26 Testing the design

1.27 Nested design (Holdfast invertebrates)

1.28 Estimating components of variation

1.29 Pooling or excluding terms

1.30 Designs that lack replication (Plankton net study)

1.31 Split-plot designs (Woodstock plants)

1.32 Repeated measures (Victorian avifauna, revisited)

1.33 Unbalanced designs

1.34 Types of sums of squares (Birds from Borneo)

1.35 Designs with covariates (Holdfast invertebrates, revisited)

1.36 Linear combinations of mean squares (NZ fish assemblages)

1.37 Asymmetrical designs (Mediterranean molluscs)

1.38 Environmental impacts

2.1 General description

2.2 Rationale

2.3 Multivariate Levene’s test (Bumpus’ sparrows)

2.4 Generalisation to dissimilarities

2.5 $P$-values by permutation

2.6 Test based on medians

2.7 Ecological example (Tikus Island corals)

2.8 Choice of measure

2.9 Dispersion as beta diversity (Norwegian macrofauna)

2.10 Small sample sizes

2.11 Dispersion in nested designs (Okura macrofauna)

2.12 Dispersion in crossed designs (Cryptic fish)

2.13 Concluding remarks

3.1 General description

3.2 Rationale

3.3 Mechanics of PCO

3.4 Example: Victorian avifauna

3.5 Negative eigenvalues

3.6 Vector overlays

3.7 PCO versus PCA (Clyde environmental data)

3.8 Distances among centroids (Okura macrofauna)

3.9 PCO versus MDS

4.1 General description

4.2 Rationale

4.3 Partitioning

4.4 Simple linear regression (Clyde macrofauna)

4.5 Conditional tests

4.6 (Holdfast invertebrates)

4.7 Assumptions & diagnostics

4.8 Building models

4.9 Cautionary notes

4.10 (Ekofisk macrofauna)

4.11 Visualising models: dbRDA

4.12 Vector overlays in dbRDA

4.13 dbRDA plot for Ekofisk

4.14 Analysing variables in sets (Thau lagoon bacteria)

4.15 Categorical predictor variables (Oribatid mites)