Annales de l'Institut Henri Poincaré, Probabilités et Statistiques

Central limit theorems for eigenvalues in a spiked population model

Zhidong Bai and Jian-feng Yao

Source: Ann. Inst. H. Poincaré Probab. Statist. Volume 44, Number 3 (2008), 447-474.

Abstract

In a spiked population model, the population covariance matrix has all its eigenvalues equal to units except for a few fixed eigenvalues (spikes). This model is proposed by Johnstone to cope with empirical findings on various data sets. The question is to quantify the effect of the perturbation caused by the spike eigenvalues. A recent work by Baik and Silverstein establishes the almost sure limits of the extreme sample eigenvalues associated to the spike eigenvalues when the population and the sample sizes become large. This paper establishes the limiting distributions of these extreme sample eigenvalues. As another important result of the paper, we provide a central limit theorem on random sesquilinear forms.

Résumé

Dans un modèle de variances hétérogènes, les valeurs propres de la matrice de covariance des variables sont toutes égales à l’unité sauf un faible nombre d’entre elles. Ce modèle a été introduit par Johnstone comme une explication possible de la structure des valeurs propres de la matrice de covariance empirique constatée sur plusieurs ensembles de données réelles. Une question importante est de quantifier la perturbation causée par ces valeurs propres différentes de l’unité. Un travail récent de Baik et Silverstein établit la limite presque sûre des valeurs propres empiriques extrêmes lorsque le nombre de variables tend vers l’infini proportionnellement à la taille de l’échantillon. Ce travail établit un théorème limite central pour ces valeurs propres empiriques extrêmes. Il est basé sur un nouveau théorème limite central pour les formes sesquilinéaires aléatoires.

Primary Subjects: 62H25, 62E20
Secondary Subjects: 60F05, 15A52
Keywords: Sample covariance matrices; Spiked population model; Central limit theorems; Largest eigenvalue; Extreme eigenvalues; Random sesquilinear forms; Random quadratic forms

Full-text: Access denied (no subscription detected)

We're sorry, but we are unable to provide you with the full text of this article because we are not able to identify you as a subscriber.
If you have a personal subscription to this journal, then please login. If you are already logged in, then you may need to update your profile to register your subscription. Read more about accessing full-text
Links and Identifiers

Permanent link to this document: http://projecteuclid.org/euclid.aihp/1211819420
Digital Object Identifier: doi:10.1214/07-AIHP118
Zentralblatt MATH identifier: 05611448

References

[1] Z. D. Bai, B. Q. Miao and C. R. Rao. Estimation of direction of arrival of signals: Asymptotic results. Advances in Spectrum Analysis and Array Processing, S. Haykins (Ed.), vol. II, pp. 327–347. Prentice Hall’s West Nyack, New York, 1991.
[2] Z. D. Bai. A note on limiting distribution of the eigenvalues of a class of random matrice. J. Math. Res. Exposition 5 (1985) 113–118.
Mathematical Reviews (MathSciNet): MR842111
[3] Z. D. Bai. Methodologies in spectral analysis of large dimensional random matrices, a review. Statist. Sinica 9 (1999) 611–677.
Mathematical Reviews (MathSciNet): MR1711663
[4] Z. D. Bai and J. W. Silverstein. CLT for linear spectral statistics of large-dimensional sample covariance matrices. Ann. Probab. 32 (2004) 553–605.
Mathematical Reviews (MathSciNet): MR2040792
Digital Object Identifier: doi:10.1214/aop/1078415845
Project Euclid: euclid.aop/1078415845
[5] Z. D. Bai and J. W. Silverstein. No eigenvalues outside the support of the limiting spectral distribution of large dimensional sample covariance matrices. Ann. Probab. 26 (1998) 316–345.
Mathematical Reviews (MathSciNet): MR1617051
Digital Object Identifier: doi:10.1214/aop/1022855421
Project Euclid: euclid.aop/1022855421
[6] J. Baik and J. W. Silverstein. Eigenvalues of large sample covariance matrices of spiked population models. J. Multivariate Anal. 97 (2006) 1382–1408.
Mathematical Reviews (MathSciNet): MR2279680
Digital Object Identifier: doi:10.1016/j.jmva.2005.08.003
[7] J. Baik, G. Ben Arous and S. Péché. Phase transition of the largest eigenvalue for nonnull complex sample covariance matrices. Ann. Probab. 33 (2005) 1643–1697.
Mathematical Reviews (MathSciNet): MR2165575
Digital Object Identifier: doi:10.1214/009117905000000233
Project Euclid: euclid.aop/1127395869
[8] R. A. Horn and C. R. Johnson. Matrix Analysis. Cambridge University Press, 1985.
Mathematical Reviews (MathSciNet): MR832183
[9] I. Johnstone. On the distribution of the largest eigenvalue in principal components analysis. Ann. Statist. 29 (2001) 295–327.
Mathematical Reviews (MathSciNet): MR1863961
Digital Object Identifier: doi:10.1214/aos/1009210544
Project Euclid: euclid.aos/1009210544
[10] V. A. Marčenko and L. A. Pastur. Distribution of eigenvalues for some sets of random matrices. Math. USSR-Sb 1 (1967) 457–483.
[11] M. L. Mehta. Random Matrices. Academic Press, New York, 1991.
Mathematical Reviews (MathSciNet): MR1083764
[12] D. Paul. Asymptotics of the leading sample eigenvalues for a spiked covariance model. Statistica Sinica 17 (2007) 1617–1642.
Mathematical Reviews (MathSciNet): MR2399865
[13] S. J. Sheather and M. C. Jones. A reliable data-based bandwidth selection method for kernel density estimation. J. Roy. Stat. Soc. Ser. B 53 (1991) 683–690.
Mathematical Reviews (MathSciNet): MR1125725

2009 © Institut Henri Poincaré