van der Aart, P.J. and Smeek-Enserink, N. (1975) ‘Correlations between distributions of hunting spiders (Lycosidae, Ctenidae) and environmental characteristics in a dune area’, Netherlands Journal of Zoology, 25, pp. 1–45.
van Valen, L. (1978) ‘The statistics of variation’, Evolutionary Theory, 4, pp. 33–43, 202.
Vellend, M. (2001) ‘Do commonly used indices of $\beta$-diversity measure species turnover?’, Journal of Vegetation Science, 12, pp. 545–552.
Verdonschot, P.F.M. and ter Braak, C.J.F. (1994) ‘An experimental manipulation of oligochaete communities in mesocosms treated with chlorpyrifos or nutrient additions: multivariate analyses with Monte Carlo permutation tests’, Hydrobiologia, 278, pp. 251–266.
Warton, D.I. and Hudson, H.M. (2004) ‘A MANOVA statistic is just as powerful as distance-based statistics, for multivariate abundances’, Ecology, 85, pp. 858–874.
Warton, D.I. and Weber, N.C. (2002) ‘Common slope tests for bivariate errors-in-variables models’, Biometrical Journal, 44, pp. 161–174.
Warton, D.I., Wright, I.J., Falster, D.S. and Westoby, M. et al. (2006) ‘Bivariate line-fitting methods for allometry’, Biological Reviews, 81, pp. 259–291.
---
Warwick, R.M. and Clarke, K.R. (1993) ‘Increased variability as a symptom of stress in marine communities’, Journal of Experimental Marine Biology and Ecology, 172, pp. 215–226.
---
Warwick, R.M. and Clarke, K.R. (1995) ‘New `biodiversity’ measures reveal a decrease in taxonomic distinctness with increasing stress’, Marine Ecology Progress Series, 129, pp. 301–305.
---
Warwick, R.M., Clarke, K.R. and Gee, J.M. (1990) ‘The effect of disturbance by soldier crab Mictyris platycheles H. Milne Edwards on meiobenthic community structure’, Journal of Experimental Marine Biology and Ecology, 135, pp. 19–33.
---
Warwick, R.M., Clarke, K.R., and Suharsono (1990) ‘A statistical analysis of coral community responses to the 1982-83 El Niño in the Thousand Islands, Indonesia’, Coral reefs (Online), 8, pp. 171–179.
---
Welsh, A.H., Cunningham, R.B., Donnelly, C.F. and Lindenmayer D.B. (1996) ‘Modelling the abundance of rare species: statistical models for counts with extra zeros’, Ecological Modeling, 88, pp. 297–308.
---
Wessel, J. and Schork, N.J. (2006) ‘Generalized genomic distance-based regression methodology for multilocus association analysis’, American Journal of Human Genetics, 79, pp. 792–806.
---
Wheldon, M.C., Anderson, M.J. and Johnson, B.W. (2007) ‘Identifying treatment effects in multi-channel measurements in electroencephalographic studies: multivariate permutation tests and multiple comparisons’, Australian & New Zealand Journal of Statistics, 49, pp. 397–413.
---
Whittaker, R.H. (1960) ‘Vegetation of the siskiyou mountains, oregon and california’, Ecological Monographs, 22, pp. 1–44.
---
Whittaker, R.H. (1972) ‘Evolution and measurement of species diversity’, Taxon, 21, pp. 213–251.
---
Wickens, T.D. (1995) The geometry of multivariate statistics. Hillsdale, New Jersey: Lawrence Erlbaum Associates.
---
Willis, T.J. and Anderson, M.J. (2003) ‘Structure of cryptic reef fish assemblages: relationships with habitat characteristics and predator density’, Marine Ecology Progress Series, 257, pp. 209–221.
---
Willis, T.J. and Denny, C.M. (2000) ‘Effects of poor knights islands marine reserve on demersal fish populations’, Report to the Department of Conservation, Research Grant No, 2519.
---
Willis, T.J., Millar, R.B. and Babcock, R.C. (2003) ‘Protection of exploited fish in temperate regions: high density and biomass of snapper Pagrus auratus (Sparidae) in northern New Zealand marine reserves’, Journal of Applied Ecology, 40, pp. 214–227.
---
Winer, B.J., Brown, D.R. and Michels, K.M. (1991) Statistical principles in experimental design, 3rd edition. New York: McGraw-Hill.
---
Winsberg, S. and Ramsay, J.O. (1980) ‘Monotonic transformations to additivity using splines’, Biometrika, 67, pp. 669–674.
---
Winsor, C.P. and Clarke, G.L. (1940) ‘A statistical study of variation in the catch of plankton nets’, Journal of Marine Research, 3, pp. 1–34.
---
Yee, T.W. (2006) ‘Constrained additive ordination’, Ecology, 87, pp. 203–213.
---
Zhang, P. (1992) ‘On the distributional properties of model selection criteria’, Journal of the American Statistical Association, 87, pp. 732–737.
---
Zhu, M., Hastie, T.J. and Walter, G. (2005) ‘Constrained ordination analysis with flexible response functions’, Ecological Modelling, 187, pp. 524–536.
# A3 Index to mathematical notation and symbols
#### Matrices and vectors
**A** = matrix containing elements $a _ {ij} = - \frac{1}{2} d _ {ij} ^ 2 $
**B** = matrix of variables (*N* × *s*) that are linear combinations of normalised **X** variables having maximum correlation with CAP axes
**C** = matrix of CAP axes (*N* × *s*), standardised by the square root of their respective eigenvalues
**D** = matrix containing elements $d _ {ij}$ corresponding to distances or dissimilarities
**G** = Gower’s centred matrix, consisting of elements $g _ {ij} = a_ {ij} - \bar{a} _ {i \bullet} - \bar{a} _ {\bullet j} + \bar{a} _ {\bullet \bullet}$
**H** = ‘hat’ matrix = **X[X′X]$^ {-1}$X′**, used as a projection matrix for regression models
**I** = identity matrix, with 1’s along the diagonal and 0’s elsewhere
**Q** = matrix of PCO axes, standardised by the square root of their respective eigenvalues
**Q**$^0$ = matrix of PCO axes, orthonormalised to SSCP = **I** (‘sphericised’)
**U** = matrix whose columns contain the left singular vectors from a singular value decomposition (SVD) of a matrix (e.g., **X** = **UWV′**); if **X** is (*N* × *q*) and *q* < *N*, then **U** is (*N* × *q*)
**V** = matrix whose columns contain the right singular vectors from a singular value decomposition (SVD) of a matrix (e.g., **X** = **UWV′**); if **X** is (*N* × *q*) and *q* < *N*, then **V** is (*q* × *N*)
**W** = diagonal matrix of eigenvalues from a singular value decomposition (SVD) of a matrix (e.g., **X** = **UWV′**); if **X** is (*N* × *q*) and *q* < *N*, then **W** is (*q* × *q*)
**X** = matrix of predictor variables (*N* × *q*) (often a set of environmental variables)
**X**$^0$ = matrix of **X** variables, orthonormalised to SSCP = **I** (‘sphericised’)
**Y** = matrix of response variables (*N* × *p*) (often a set of species variables)
**Y**$^0$ = matrix of **Y** variables, orthonormalised to SSCP = **I** (‘sphericised’)
$\hat{ {\bf Y}}$ = **HY** = matrix of fitted values (*N* × *p*)
**y**$_ {ij} $ = vector of *p* response variables for the *j*th observation in the *i*th group
$\bar{ {\bf y}}$ = the centroid vector of *p* response variables for group *i*
**Z** = matrix of dbRDA canonical axes (*N* × *s*)
#### Letters
*a*, *b*, *c*, etc… = number of levels of factor A, B, C, etc… in an ANOVA experimental design
*AIC* = multivariate analogue to Akaike’s information criterion
*AIC*$_c$ = multivariate analogue to the small-sample-size corrected version of *AIC*
*B*$_l$ = the *l*th variable in the space of normalised **X** variables that has maximum correlation with the *l*th coordinate axis (*C*$_l$) from a CAP analysis
*BIC* = multivariate analogue to Schwarz’s ‘Bayesian information criterion’
*C*$ _l $ = the *l*th coordinate axis scores from a CAP analysis
*d*$ _ {ij} $ = distance or dissimilarity between sample *i* and sample *j*
*df* = degrees of freedom
*F* = pseudo-*F* statistic for testing hypotheses in PERMANOVA or DISTLM
*i* = index used for samples (i.e., *i* = 1, …, *N*) or index used for groups (*i* = 1, …, *a*)
*j* = second index used for samples (i.e., *j* = 1, …, *N*) or index used for replicates within a group (*j* = 1,…, *n*)
*k* = index used for variables (i.e., *k* = 1, …, *p* or else *k* = 1, …, *q*)
*l* = index used for canonical axes or eigenvalues for either dbRDA or CAP (i.e., *l* = 1, …, *s*) or either the abbreviation for ‘log-likelihood’ or the ‘length’ of a vector (depending on context).
*m* = number of PCO axes chosen as a subset for analysis by CAP
*MC* = Monte Carlo
*MS* = mean square
*N* = total number of samples
*n* = number of samples (replicates) within a group or cell in an experimental design
*P* = *P*-value associated with the test of a null hypothesis
*p* = number of multivariate response variables in matrix **Y**
*q* = total number of predictor variables in matrix **X**
*r* = Pearson correlation coefficient
*R* = the ANOSIM *R* statistic (see Clarke 1993)
*R*$^2$ = proportion of explained variation from a model
*s* = number of canonical eigenvalues and associated canonical axes obtained from either a dbRDA *or* a CAP analysis
*SS* = sum of squares
*SSCP* = sum of squares and cross products
*SVD* = singular value decomposition
*t* = pseudo-*t* statistic = $\sqrt{}$pseudo- *F*
*tr* = ‘trace’ of a matrix = the sum of the diagonal elements
*X*$ _ k $ = the *k*th predictor variable
*Y*$ _ k $ = the *k*th response variable
*z*$ _ {ij} $ = distance to group centroid for the *j*th replicate within the *i*th group.
#### Greek symbols and matrices
$ \alpha$ = significance level chosen for a test (usually $\alpha$ = 0.05).
$ \delta _ l ^ 2$ = the *l*th eigenvalue from a CAP analysis, a squared canonical correlation
$ \Delta$ = diagonal matrix containing the square roots of the eigenvalues from a CAP analysis (a capital delta)
$ \gamma _ l ^ 2$ = the *l*th eigenvalue from a dbRDA analysis, a portion of the explained (regression) sum of squares from a dbRDA model.
$ \Gamma$ = diagonal matrix containing the square roots of the eigenvalues from a dbRDA analysis (a capital gamma)
$ \lambda _ i $ = the *i*th eigenvalue from a PCO analysis
$ \Lambda $ = diagonal matrix of eigenvalues from a PCO analysis (a capital lambda)
$ \nu$ = number of parameters in a particular model during model selection
$ \rho $= Spearman rank correlation (rho)
$ \sum $ = sum over the relevant index
# A4 Index to data sets used in examples
Below is an index to the data sets used in examples, listed in order of appearance in the text. With each dataset are given the name and location of the data file, the original reference, a description of its use as an example in the manual and the page number where this can be found (italicised and in parentheses).
1. Ekofisk oil-field macrofauna (
ekma.pri in Examples v6\Ekofisk), {{@324#bkmrk-gray1990a}} - demonstrate one-way PERMANOVA (*1.8*), model selection procedures, diagnostics and building models in DISTLM (*4.10*) and visualising models using dbRDA (*4.11*).
2. Victorian avifauna (
vic.pri in Examples add-on\VictAvi), {{@324#bkmrk-macnally2005a}} – demonstrate Monte Carlo *P* values (*1.12*). Also used at the level of individual surveys (
vicsurv.pri) to demonstrate a repeated measures design (*1.32*) and also PCO (*3.4*), negative eigenvalues (*3.5*), scree plots (*3.5*) and vector overlays (*3.6*).
3. Subtidal epibiota (
sub.pri in Examples add-on\SubEpi), {{@324#bkmrk-glasby1999a}} – demonstrate a two-way crossed design (*1.14*) and contrasts (*1.19*) in PERMANOVA.
4. Tasmanian meiofauna (
tas.pri in Examples add-on\TasMei), {{@324#bkmrk-warwick1990a}} – demonstrate fixed versus random factors (*1.20*), components of variation (*1.21*), expected mean squares (*1.22*), constructing F from EMS (*1.23*), exchangeable units (*1.24*), inference space and power (*1.25*), and testing the design (*1.26*).
5. Holdfast invertebrates (
hold.pri,
holdenv.pri and
Mollusca.agg in Examples add-on\HoldNZ), {{@324#bkmrk-anderson2005a}} – demonstrate a nested design (*1.27*), estimating components of variation (*1.28*), and pooling or excluding terms (*1.29*). Also used later to demonstrate analyses with covariates in PERMANOVA (*1.35*) and marginal and conditional tests with DISTLM (*4.6*).
6. Plankton net study (
plank.pri in Examples add-on\Plankton), {{@324#bkmrk-winsor1940a}} – demonstrate designs that lack replication (*1.30*) and increased power as a result of blocking (*1.30*).
7. Woodstock plants (
wsk.pri in Examples add-on\Woodstock), {{@324#bkmrk-prober2007a}} – demonstrate a split-plot design (*1.31*).
8. Birds from Borneo (
born.pri in Examples add-on\BorneoBirds), {{@324#bkmrk-cleary2005a}} – demonstrate an unbalanced design (*1.34*).
9. New Zealand fish (
fishNZ.pri in Examples add-on\FishNZ), {{@324#bkmrk-anderson2004b}} – demonstrate analyses involving linear combinations of mean squares (*1.36*).
10. Mediterranean molluscs (
medmoll.pri in Examples add-on\MedMoll), {{@324#bkmrk-terlizzi2005a}} – demonstrate an asymmetrical design (*1.37*).
11. Bumpus’ sparrows (
spar.pri in Examples add-on\BumpSpar), {{@324#bkmrk-bumpus1898a}} – demonstrate test of dispersion in Euclidean space (*2.3*).
12. Tikus Island corals (
tick.pri in Examples v6\Corals), {{@324#bkmrk-warwick1990b}} – demonstrate test of dispersion for ecological data (*2.7*) and how choice of dissimilarity measure matters (*2.8*). Also used later to demonstrate how CAP tells you nothing about relative within-group dispersions (*5.9*).
13. Norwegian macrofauna (
norbio.pri and
norenv.pri in Examples add-on\NorMac), {{@324#bkmrk-ellingsen2002a}} – demonstrate use of the test of dispersion to investigate beta diversity (*2.9*).
14. Okura macrofauna (
okura.pri, in Examples add-on\Okura), {{@324#bkmrk-anderson2004a}} – demonstrate tests of dispersion in nested designs (*2.11*). Also used to demonstrate PCO of distances among centroids (*3.8*) and PCO versus MDS when samples are split into groups (*3.9*).
15. Cryptic fish assemblages (
cryptic.pri in Examples add-on\Cryptic), {{@324#bkmrk-willis2003a}} – demonstrate PERMDISP for a two-factor crossed design, in conjunction with PERMANOVA (*2.12*).
16. Clyde macrofauna and environmental data (
clma.pri and
clev.pri, in Examples v6\Clydemac), {{@324#bkmrk-pearson1984a}} – demonstrate PCO versus PCA for environmental data (*3.7*) and simple linear regression using DISTLM (*4.4*).
17. Thau lagoon bacteria (
thbac.pri and
thevsp.pri in Examples add-on\Thau), {{@324#bkmrk-amanieu1989a}} – demonstrate analysing variables in sets using DISTLM (*4.14*).
18. Oribatid mites (
ormites.pri and
orenvgeo.pri in Examples add-on\OrbMit), {{@324#bkmrk-borcard1992a}} – demonstrate analysing categorical predictor variables using DISTLM (*4.15*).
19. Flea-beetles (
flea.pri in Examples add-on\FleaBeet), {{@324#bkmrk-lubischew1962a}} – demonstrate the rationale for CAP by comparing unconstrained vs constrained ordination (*5.2*).
20. Poor Knights Islands fish (
pkfish.pri in Examples add-on\PKFish), {{@324#bkmrk-willis2000a}} – demonstrate discriminant analysis based on Bray-Curtis using CAP (*5.4*).
21. Iris data (
iris.pri in Examples add-on\Irises), {{@324#bkmrk-anderson1935a}} – demonstrate classical discriminant analysis and MANOVA test statistics using CAP (*5.6*). Also used later to show how the positions of new samples are added into a discriminant-type analysis, with prediction of group membership (*5.10*).
22. Fal estuary biota (
Fa.xls in Examples v6\Fal;
falbio.pri and
falenv.pri in Examples add-on\FalEst), {{@324#bkmrk-somerfield1994a}} – demonstrate canonical correlation analysis with CAP based on the Bray-Curtis measure relating biota to a single environmental gradient (*5.11*).
23. Hunting spiders (
hspi.pri and
hspienv.pri in Examples add-on\Spiders), {{@324#bkmrk-vanderaart1975a}} – demonstrate a canonical correlation-type analysis using CAP on the basis of chi-squared distances (*5.16*).