Perils of Parsimony: Properties of Reduced-Rank Estimates of Genetic Covariance Matrices

Eigenvalues and eigenvectors of covariance matrices are important statistics for multivariate problems in many applications, including quantitative genetics. Estimates of these quantities are subject to different types of bias. This article reviews and extends the existing theory on these biases, co...

Full description

Saved in:
Bibliographic Details
Published inGenetics (Austin) Vol. 180; no. 2; pp. 1153 - 1166
Main Authors Meyer, Karin, Kirkpatrick, Mark
Format Journal Article
LanguageEnglish
Published United States Genetics Soc America 01.10.2008
Genetics Society of America
Subjects
Online AccessGet full text
ISSN0016-6731
1943-2631
1943-2631
DOI10.1534/genetics.108.090159

Cover

Loading…
More Information
Summary:Eigenvalues and eigenvectors of covariance matrices are important statistics for multivariate problems in many applications, including quantitative genetics. Estimates of these quantities are subject to different types of bias. This article reviews and extends the existing theory on these biases, considering a balanced one-way classification and restricted maximum-likelihood estimation. Biases are due to the spread of sample roots and arise from ignoring selected principal components when imposing constraints on the parameter space, to ensure positive semidefinite estimates or to estimate covariance matrices of chosen, reduced rank. In addition, it is shown that reduced-rank estimators that consider only the leading eigenvalues and -vectors of the “between-group” covariance matrix may be biased due to selecting the wrong subset of principal components. In a genetic context, with groups representing families, this bias is inverse proportional to the degree of genetic relationship among family members, but is independent of sample size. Theoretical results are supplemented by a simulation study, demonstrating close agreement between predicted and observed bias for large samples. It is emphasized that the rank of the genetic covariance matrix should be chosen sufficiently large to accommodate all important genetic principal components, even though, paradoxically, this may require including a number of components with negligible eigenvalues. A strategy for rank selection in practical analyses is outlined.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
Communicating editor: J. B. Walsh
Corresponding author: Animal Genetics and Breeding Unit, University of New England, Armidale NSW 2351, Australia. E-mail: kmeyer@didgeridoo.une.edu.au
This work is a joint venture with NSW Agriculture.
ISSN:0016-6731
1943-2631
1943-2631
DOI:10.1534/genetics.108.090159