The Annals of Probability

Phase transition of the largest eigenvalue for nonnull complex sample covariance matrices

Jinho Baik, Gérard Ben Arous, and Sandrine Péché
Source: Ann. Probab. Volume 33, Number 5 (2005), 1643-1697.

Abstract

We compute the limiting distributions of the largest eigenvalue of a complex Gaussian sample covariance matrix when both the number of samples and the number of variables in each sample become large. When all but finitely many, say r, eigenvalues of the covariance matrix are the same, the dependence of the limiting distribution of the largest eigenvalue of the sample covariance matrix on those distinguished r eigenvalues of the covariance matrix is completely characterized in terms of an infinite sequence of new distribution functions that generalize the Tracy–Widom distributions of the random matrix theory. Especially a phase transition phenomenon is observed. Our results also apply to a last passage percolation model and a queueing model.

First Page: Show Hide
Primary Subjects: 15A52, 41A60, 60F99, 62E20, 62H20
Full-text: Open access
Links and Identifiers

Permanent link to this document: http://projecteuclid.org/euclid.aop/1127395869
Digital Object Identifier: doi:10.1214/009117905000000233
Zentralblatt MATH identifier: 1086.15022
Mathematical Reviews number (MathSciNet): MR2165575

References

Andréief, C. (1883). Note sur une relation les intégrales définies des produits des fonctions. Mém. de la Soc. Sci. Bordeaux 2.
Bai, Z. (1999). Methodologies in spectral analysis of large-dimensional random matrices, a review. Statist. Sinica 9 611--677.
Mathematical Reviews (MathSciNet): MR1711663
Zentralblatt MATH: 0949.60077
Bai, Z. and Silverstein, J. (1995). On the empirical distribution of eigenvalues of a class of large dimensional random matrices. J. Multivariate Anal. 54 175--192.
Mathematical Reviews (MathSciNet): MR1345534
Digital Object Identifier: doi:10.1006/jmva.1995.1051
Baik, J. (2005). Painlevé formulas of the limiting distributions for non-null complex sample covariance matrices. Preprint. arXiv:math.PR/0504606.
Mathematical Reviews (MathSciNet): MR2227847
Zentralblatt MATH: 05130631
Baik, J. and Rains, E. M. (2000). Limiting distributions for a polynuclear growth model with external sources. J. Statist. Phys. 100 523--541.
Mathematical Reviews (MathSciNet): MR1788477
Digital Object Identifier: doi:10.1023/A:1018615306992
Zentralblatt MATH: 0976.82043
Baik, J. and Rains, E. M. (2001). Algebraic aspects of increasing subsequences. Duke Math. J. 109 1--65.
Mathematical Reviews (MathSciNet): MR1844203
Digital Object Identifier: doi:10.1215/S0012-7094-01-10911-3
Project Euclid: euclid.dmj/1091737220
Zentralblatt MATH: 1007.05096
Baik, J. and Rains, E. M. (2001). The asymptotics of monotone subsequences of involutions. Duke Math. J. 109 205--281.
Mathematical Reviews (MathSciNet): MR1845180
Digital Object Identifier: doi:10.1215/S0012-7094-01-10921-6
Project Euclid: euclid.dmj/1091737272
Zentralblatt MATH: 1007.60003
Borodin, A. and Forrester, P. (2003). Increasing subsequences and the hard-to-soft edge transition in matrix ensembles. J. Phys. A. 36 2963--2981.
Mathematical Reviews (MathSciNet): MR1986402
Digital Object Identifier: doi:10.1088/0305-4470/36/12/307
Zentralblatt MATH: 1033.60006
Buja, A., Hastie, T. and Tibshirani, R. (1995). Penalized discriminant analysis. Ann. Statist. 23 73--102.
Mathematical Reviews (MathSciNet): MR1331657
Deift, P. and Zhou, X. (1995). Asymptotics for the Painlevé II equation. Comm. Math. Phys. 48 277--337.
Mathematical Reviews (MathSciNet): MR1322812
Forrester, P. (2000). Painlevé transcendent evaluation of the scaled distribution of the smallest eigenvalue in the Laguerre orthogonal and symplectic ensembles. arXive:nlin.SI/0005064.
Forrester, P. (2005). Log-Gases and Random Matrices. Available at www.ms.unimelb. edu.au/~matpjf/matpjf.html. In progress.
Forrester, P. and Rains, E. (2005). Interpretations of some parameter dependent generalizations of classical matrix ensembles. Probab. Theory Related Fields 131 1--61.
Mathematical Reviews (MathSciNet): MR2105043
Digital Object Identifier: doi:10.1007/s00440-004-0375-6
Zentralblatt MATH: 1056.05142
Forrester, P. J. (1993). The spectrum edge of random matrix ensembles. Nuclear Phys. B 402 709--728.
Mathematical Reviews (MathSciNet): MR1236195
Digital Object Identifier: doi:10.1016/0550-3213(93)90126-A
Zentralblatt MATH: 1043.82538
Geman, S. (1980). A limit theorem for the norm of random matrices. Ann. Probab. 8 252--261.
Mathematical Reviews (MathSciNet): MR566592
Gessel, I. (1990). Symmetric functions and P-recursiveness. J. Combin. Theory Ser. A 53 257--285.
Mathematical Reviews (MathSciNet): MR1041448
Digital Object Identifier: doi:10.1016/0097-3165(90)90060-A
Zentralblatt MATH: 0704.05001
Hastings, S. and McLeod, J. (1980). A boundary value problem associated with the second Painlevé transcendent and the Korteweg de Vries equation. Arch. Rational Mech. Anal. 73 31--51.
Mathematical Reviews (MathSciNet): MR555581
Digital Object Identifier: doi:10.1007/BF00283254
Hoyle, D. and Rattray, M. (2003). Limiting form of the sample covariance eigenspectrum in PCA and kernel PCA. Proceedings of Neural Information Processing Systems 2003. To appear.
James, A. (1964). Distributions of matrix variates and latent roots derived from normal samples. Ann. Math. Statist. 35 475--501.
Mathematical Reviews (MathSciNet): MR181057
Digital Object Identifier: doi:10.1214/aoms/1177703550
Johansson, K. (2003). Private communication.
Johansson, K. (2000). Shape fluctuations and random matrices. Comm. Math. Phys. 209 437--476.
Mathematical Reviews (MathSciNet): MR1737991
Digital Object Identifier: doi:10.1007/s002200050027
Zentralblatt MATH: 0969.15008
Johansson, K. (2001). Discrete orthogonal polynomial ensembles and the Plancherel measure. Ann. of Math. (2) 153 259--296.
Mathematical Reviews (MathSciNet): MR1826414
Project Euclid: euclid.annm/1026916789
Johnstone, I. M. (2001). On the distribution of the largest Principal Component. Ann. Statist. 29 295--327.
Mathematical Reviews (MathSciNet): MR1863961
Digital Object Identifier: doi:10.1214/aos/1009210544
Project Euclid: euclid.aos/1009210544
Zentralblatt MATH: 1016.62078
Koekoek, R. and Swarttouw, R. (1994). The Askey-scheme of hypergeometric orthogonal polynomials and its q-analogue. Available at aw.twi.tudelft.nl/~koekoek/askey.html.
Laloux, L., Cizeau, P., Potters, M. and Bouchaud, J. (2000). Random matrix theory and financial correlations. International Journal Theoretical Applied Finance 3 391--397.
Malevergne, Y. and Sornette, D. (2002). Collective origin of the coexistence of apparent RMT noise and factors in large sample correlation matrices. arxiv:cond-mat/0210115.
Marcenko, V. A. and Pastur, L. A. Distribution of eigenvalues for some sets of random matrices. Sb. Math. 1 457--486.
Mehta, M. (1991). Random Matrices, 2nd ed. Academic Press, San Diego.
Zentralblatt MATH: 0780.60014
Mathematical Reviews (MathSciNet): MR1083764
Muirhead, R. (1982). Aspects of Multivariate Statistical Theory. Wiley, New York.
Zentralblatt MATH: 0556.62028
Mathematical Reviews (MathSciNet): MR652932
Okounkov, A. (2001). Infinite wedge and random partitions. Selecta Math. (N.S.) 7 57--81.
Mathematical Reviews (MathSciNet): MR1856553
Zentralblatt MATH: 0986.05102
Digital Object Identifier: doi:10.1007/PL00001398
Péché, S. (2003). Universality of local eigenvalue statistics for random sample covariance matrices. Ph.D. thesis, Ecole Polytechnique Fédérale de Lausanne.
Plerous, V., Gopikrishnan, P., Rosenow, B., Amaral, L., Guhr, T. and Stanley, H. (2002). Random matrix approach to cross correlations in financial data. Phys. Rev. E 65 066126.
Prähofer, M. and Spohn, H. (2000). Current fluctuations for the totally asymmetric simple exclusion process. In In and Out of Equilibrium (V. Sidoravicius, ed.) 185--204. Birkhäuser, Boston.
Mathematical Reviews (MathSciNet): MR1901953
Sear, R. and Cuesta, J. (2003). Instabilities in complex mixtures with a large number of components. Phys. Rev. Lett. 91 245701.
Soshnikov, A. (2001). A note on universality of the distribution of the largest eigenvalues in certain sample covariance matrices. J. Statist. Phys. 108 1033--1056.
Mathematical Reviews (MathSciNet): MR1933444
Zentralblatt MATH: 1018.62042
Digital Object Identifier: doi:10.1023/A:1019739414239
Stanley, R. P. (1999). Enumerative Combinatorics. Cambridge Univ. Press.
Mathematical Reviews (MathSciNet): MR1676282
Telatar, E. (1999). Capacity of multi-antenna Gaussian channels. European Transactions on Telecommunications 10 585--595.
Tracy, C. and Widom, H. (1994). Fredholm determinants, differential equations and matrix models. Comm. Math. Phys. 163 33--72.
Mathematical Reviews (MathSciNet): MR1277933
Zentralblatt MATH: 0813.35110
Digital Object Identifier: doi:10.1007/BF02101734
Project Euclid: euclid.cmp/1104270379
Tracy, C. and Widom, H. (1994). Level spacing distributions and the Airy kernel. Comm. Math. Phys. 159 33--72.
Mathematical Reviews (MathSciNet): MR1257246
Zentralblatt MATH: 0789.35152
Digital Object Identifier: doi:10.1007/BF02100489
Project Euclid: euclid.cmp/1104254495
Tracy, C. and Widom, H. (1996). On orthogonal and symplectic matrix ensembles. Comm. Math. Phys. 177 727--754.
Mathematical Reviews (MathSciNet): MR1385083
Zentralblatt MATH: 0851.60101
Digital Object Identifier: doi:10.1007/BF02099545
Project Euclid: euclid.cmp/1104286442
Tracy, C. and Widom, H. (1998). Correlation functions, cluster functions and spacing distributions for random matrices. J. Statist. Phys. 92 809--835.
Mathematical Reviews (MathSciNet): MR1657844
Zentralblatt MATH: 0942.60099
Digital Object Identifier: doi:10.1023/A:1023084324803

2013 © Institute of Mathematical Statistics

The Annals of Probability

The Annals of Probability

Turn MathJax Off
What is MathJax?