Detrended correspondence analysis

Detrended correspondence analysis is a multivariate statistical technique widely used by ecologists to find the main factors or gradients in large, species-rich but usually sparse data matrices that typify ecological community data. DCA is frequently used to suppress artifacts inherent in most other multivariate analyses when applied to gradient data.

History

DCA was created in 1979 by Mark Hill of the United Kingdom's Institute for Terrestrial Ecology and implemented in FORTRAN code package called DECORANA, a correspondence analysis method. DCA is sometimes erroneously referred to as DECORANA; however, DCA is the underlying algorithm, while DECORANA is a tool implementing it.

Issues addressed

According to Hill and Gauch, DCA suppresses two artifacts inherent in most other multivariate analyses when applied to gradient data. An example is a time-series of plant species colonising a new habitat; early successional species are replaced by mid-successional species, then by late successional ones. When such data are analysed by a standard ordination such as a correspondence analysis:

the ordination scores of the samples will exhibit the 'edge effect', i.e. the variance of the scores at the beginning and the end of a regular succession of species will be considerably smaller than that in the middle,
when presented as a graph the points will be seen to follow a horseshoe shaped curve rather than a straight line, even though the process under analysis is a steady and continuous change that human intuition would prefer to see as a linear trend.

Outside ecology, the same artifacts occur when gradient data are analysed because the curved projection is an accurate representation of the shape of the data in multivariate space.
Ter Braak and Prentice cite a simulation study analysing two-dimensional species packing models resulting in a better performance of DCA compared to CA.

Method

DCA is an iterative algorithm that has shown itself to be a highly reliable and useful tool for data exploration and summary in community ecology. It starts by running a standard ordination on the data, to produce the initial horse-shoe curve in which the 1st ordination axis distorts into the 2nd axis. It then divides the first axis into segments, and rescales each segment to have mean value of zero on the 2nd axis - this effectively squashes the curve flat. It also rescales the axis so that the ends are no longer compressed relative to the middle, so that 1 DCA unit approximates to the same rate of turnover all the way through the data: the rule of thumb is that 4 DCA units mean that there has been a total turnover in the community.
Ter Braak and Prentice warn against the non-linear rescaling of the axes due to robustness issues and recommend using detrending-by-polynomials only.

Drawbacks

No significance tests are available with DCA, although there is a constrained version called DCCA in which the axes are forced by Multiple linear regression to correlate optimally with a linear combination of other variables; this allows testing of a null model by Monte-Carlo permutation analysis.

Example

The example shows an ideal data set: The species data is in rows, samples in columns. For each sample along the gradient, a new species is introduced but another species is no longer present. The result is a sparse matrix. Ones indicate the presence of a species in a sample. Except at the edges each sample contains five species.

	1	2	3	4	5	6	7	8	9	10	11	12	13	14	15	16	17	18	19	20
SP1	1	1	1	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0
SP2	1	1	1	1	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0
SP3	1	1	1	1	1	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0
SP4	0	1	1	1	1	1	0	0	0	0	0	0	0	0	0	0	0	0	0	0
SP5	0	0	1	1	1	1	1	0	0	0	0	0	0	0	0	0	0	0	0	0
SP6	0	0	0	1	1	1	1	1	0	0	0	0	0	0	0	0	0	0	0	0
SP7	0	0	0	0	1	1	1	1	1	0	0	0	0	0	0	0	0	0	0	0
SP8	0	0	0	0	0	1	1	1	1	1	0	0	0	0	0	0	0	0	0	0
SP9	0	0	0	0	0	0	1	1	1	1	1	0	0	0	0	0	0	0	0	0
SP10	0	0	0	0	0	0	0	1	1	1	1	1	0	0	0	0	0	0	0	0
SP11	0	0	0	0	0	0	0	0	1	1	1	1	1	0	0	0	0	0	0	0
SP12	0	0	0	0	0	0	0	0	0	1	1	1	1	1	0	0	0	0	0	0
SP13	0	0	0	0	0	0	0	0	0	0	1	1	1	1	1	0	0	0	0	0
SP14	0	0	0	0	0	0	0	0	0	0	0	1	1	1	1	1	0	0	0	0
SP15	0	0	0	0	0	0	0	0	0	0	0	0	1	1	1	1	1	0	0	0
SP16	0	0	0	0	0	0	0	0	0	0	0	0	0	1	1	1	1	1	0	0
SP17	0	0	0	0	0	0	0	0	0	0	0	0	0	0	1	1	1	1	1	0
SP18	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	1	1	1	1	1
SP19	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	1	1	1	1
SP20	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	1	1	1

The plot of the first two axes of the correspondence analysis result on the right hand side clearly shows the disadvantages of this procedure: the edge effect, i.e. the points are clustered at the edges of the first axis, and the arch effect.

Popular movies

The Hunger Games (film) - 2012 American dystopian action thriller science fiction-adventure film directed by Gary Ross and based on Suzanne Collins’s 2008 novel of the same name. It is the first insta...
untitled Captain Marvel sequel - part of Marvel Cinematic Universe....
Killers of the Flower Moon (film project) - Killers of the Flower Moon - film project in United States of America. It was presented as drama, detective fiction, thriller. The film project starred Leonardo Dicaprio, Robert De Niro. Director of...
Five Nights at Freddy's (film) - Five Nights at Freddy's - film published in 2017 in United States of America. Scenarist of the film - Scott Cawthon....

Popular books

Book of Revelation - The Book of Revelation is the final book of the New Testament, and consequently is also the final book of the Christian Bible. Its title is derived from the first word of the Koine Greek text: apok...
Book of Genesis - account of the creation of the world, the early history of humanity, Israel's ancestors and the origins...
Gospel of Matthew - The Gospel According to Matthew is the first book of the New Testament and one of the three synoptic gospels. It tells how Israel's Messiah, rejected and executed in Israel, pronounces judgement on ...
Michelin Guide - Michelin Guides are a series of guide books published by the French tyre company Michelin for more than a century. The term normally refers to the annually published Michelin Red Guide , the oldest...
Psalms - The Book of Psalms , commonly referred to simply as Psalms , the Psalter or "the Psalms", is the first book of the Ketuvim , the third section of the Hebrew Bible, and thus a book of th...
Ecclesiastes - Ecclesiastes is one of 24 books of the Tanakh , where it is classified as one of the Ketuvim . Originally written c. 450–200 BCE, it is also among the canonical Wisdom literature of the Old Tes...
The 48 Laws of Power - non-fiction book by American author Robert Greene. The book...

Popular television series

The Crown (TV series) - historical drama web television series about the reign of Queen Elizabeth II, created and principally written by Peter Morgan, and produced by Left Bank Pictures and Sony Pictures Tel...
Friends - American sitcom television series, created by David Crane and Marta Kauffman, which aired on NBC from September 22, 1994, to May 6, 2004, lasting ten seasons. With an ensemble cast sta...
Young Sheldon - spin-off prequel to The Big Bang Theory and begins with the character Sheldon...
Modern Family - American television mockumentary family sitcom created by Christopher Lloyd and Steven Levitan for the American Broadcasting Company. It ran for eleven seasons, from September 23...
Loki (TV series) - upcoming American web television miniseries created for Disney+ by Michael Waldron, based on the Marvel Comics character of the same name. It is set in the Marvel Cinematic Universe, shar...
Game of Thrones - American fantasy drama television series created by David Benioff and D. B. Weiss for HBO. It...
Shameless (American TV series) - American comedy-drama television series developed by John Wells which debuted on Showtime on January 9, 2011. It...