Spectrum estimation: A unified framework for covariance matrix estimation and PCA in large dimensions

Covariance matrix estimation and principal component analysis (PCA) are two cornerstones of multivariate analysis. Classic textbook solutions perform poorly when the dimension of the data is of a magnitude similar to the sample size, or even larger. In such settings, there is a common remedy for bot...

Full description

Saved in:
Bibliographic Details
Published inJournal of multivariate analysis Vol. 139; pp. 360 - 384
Main Authors Ledoit, Olivier, Wolf, Michael
Format Journal Article
LanguageEnglish
Published Elsevier Inc 01.07.2015
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Covariance matrix estimation and principal component analysis (PCA) are two cornerstones of multivariate analysis. Classic textbook solutions perform poorly when the dimension of the data is of a magnitude similar to the sample size, or even larger. In such settings, there is a common remedy for both statistical problems: nonlinear shrinkage of the eigenvalues of the sample covariance matrix. The optimal nonlinear shrinkage formula depends on unknown population quantities and is thus not available. It is, however, possible to consistently estimate an oracle nonlinear shrinkage, which is motivated on asymptotic grounds. A key tool to this end is consistent estimation of the set of eigenvalues of the population covariance matrix (also known as the spectrum), an interesting and challenging problem in its own right. Extensive Monte Carlo simulations demonstrate that our methods have desirable finite-sample properties and outperform previous proposals.
ISSN:0047-259X
1095-7243
DOI:10.1016/j.jmva.2015.04.006