The Annals of Statistics
previous :: next

Nonlinear principal components and long-run implications of multivariate diffusions

Xiaohong Chen, Lars Peter Hansen, and José Scheinkman

Source: Ann. Statist. Volume 37, Number 6B (2009), 4279-4312.

Abstract

We investigate a method for extracting nonlinear principal components (NPCs). These NPCs maximize variation subject to smoothness and orthogonality constraints; but we allow for a general class of constraints and multivariate probability densities, including densities without compact support and even densities with algebraic tails. We provide primitive sufficient conditions for the existence of these NPCs. By exploiting the theory of continuous-time, reversible Markov diffusion processes, we give a different interpretation of these NPCs and the smoothness constraints. When the diffusion matrix is used to enforce smoothness, the NPCs maximize long-run variation relative to the overall variation subject to orthogonality constraints. Moreover, the NPCs behave as scalar autoregressions with heteroskedastic innovations; this supports semiparametric identification and estimation of a multivariate reversible diffusion process and tests of the overidentifying restrictions implied by such a process from low-frequency data. We also explore implications for stationary, possibly nonreversible diffusion processes. Finally, we suggest a sieve method to estimate the NPCs from discretely-sampled data.

Primary Subjects: 62H25, 47D07
Secondary Subjects: 35P05
Keywords: Nonlinear principal components; multivariate diffusion; quadratic form; conditional expectations operator; low-frequency data

Full-text: Access denied (no subscription detected)

We're sorry, but we are unable to provide you with the full text of this article because we are not able to identify you as a subscriber.
If you have a personal subscription to this journal, then please login. If you are already logged in, then you may need to update your profile to register your subscription. Read more about accessing full-text
Links and Identifiers

Permanent link to this document: http://projecteuclid.org/euclid.aos/1256303544
Digital Object Identifier: doi:10.1214/09-AOS706

References

[1] Agmon, S. (1965). Lectures on Elliptic Boundary Problems. Van Nostrand, Princeton, NJ.
Mathematical Reviews (MathSciNet): MR178246
Zentralblatt MATH: 0142.37401
[2] Azencott, R. (1974). Behavior of diffusion semi-groups at infinity. Bull. Soc. Math. France 102 193–240.
Mathematical Reviews (MathSciNet): MR356254
Zentralblatt MATH: 0293.60071
[3] Banon, G. (1978). Nonparametric identification of diffusions. SIAM J. Control Optim. 16 380–395.
Mathematical Reviews (MathSciNet): MR492159
[4] Benko, M., Hardle, W. and Kneip, A. (2009). Common functional principal components. Ann. Statist. 37 1–34.
Mathematical Reviews (MathSciNet): MR2488343
Zentralblatt MATH: 1169.62057
Digital Object Identifier: doi:10.1214/07-AOS516
Project Euclid: euclid.aos/1232115926
[5] Beurling, A. and Deny, J. (1958). Espaces de Dirichlet i, le cas elementaire. Acta Math. 99 203–224.
Mathematical Reviews (MathSciNet): MR98924
Digital Object Identifier: doi:10.1007/BF02392426
[6] Bhattacharya, R. N. (1982). On the functional central limit theorem and the law of the iterated logarithm for Markov processes. Z. Wahrsch. Verw. Gebiete 60 185–201.
Mathematical Reviews (MathSciNet): MR663900
[7] Box, G. E. P. and Tiao, G. C. (1977). Canonical analysis of multiple time series. Biometrika 64 355–365.
Mathematical Reviews (MathSciNet): MR519089
Zentralblatt MATH: 0362.62091
Digital Object Identifier: doi:10.1093/biomet/64.2.355
[8] Brezis, H. (1983). Analyse Fonctionelle. Masson, Paris.
Mathematical Reviews (MathSciNet): MR697382
[9] Chen, X., Hansen, L. P. and Scheinkman, J. (1998). Shape-preserving estimation of diffusions. Working paper, Univ. Chicago.
[10] Cobb, L. P., Kopstein, P. and Chen, N. (1983). Estimation and moment recursion relations for multimodal distributions in the exponential family. J. Amer. Statist. Assoc. 78 124–130.
[11] Darolles, S., Florens, J. P. and Gourieroux, C. (2004). Kernel based nonlinear canonical analysis and time reversibility. J. Econometrics 119 323–353.
Mathematical Reviews (MathSciNet): MR2057103
Digital Object Identifier: doi:10.1016/S0304-4076(03)00199-4
[12] Dauxois, J. and Nkiet, G. M. (1998). Nonlinear canonical analysis and independence tests. Ann. Statist. 26 1254–1278.
Mathematical Reviews (MathSciNet): MR1647653
Zentralblatt MATH: 0934.62061
Digital Object Identifier: doi:10.1214/aos/1024691242
Project Euclid: euclid.aos/1024691242
[13] Davies, E. B. (1985). l1 properties of second order operators. Bull. London Math. Soc. 17 417–436.
Mathematical Reviews (MathSciNet): MR806008
Zentralblatt MATH: 0583.35032
Digital Object Identifier: doi:10.1112/blms/17.5.417
[14] Davies, E. B. (1989). Heat Kernels and Spectral Theory. Cambridge Univ. Press, Cambridge.
Mathematical Reviews (MathSciNet): MR990239
Zentralblatt MATH: 0699.35006
[15] Demoura, S. (1998). The nonparametric estimation of the expected value operator. Workshop Presentation, Univ. Chicago.
[16] Fan, J. (2005). A selective overview of nonparametric methods in financial econometrics (with discussion). Statist. Science 20 317–357.
Mathematical Reviews (MathSciNet): MR2210224
Digital Object Identifier: doi:10.1214/088342305000000412
Project Euclid: euclid.ss/1137076647
[17] Florens, J. P., Renault, E. and Touzi, N. (1998). Testing for embeddability by stationary reversible continuous-time Markov processes. Econometric Theory 69 744–69.
Mathematical Reviews (MathSciNet): MR1666704
Digital Object Identifier: doi:10.1017/S0266466698146029
[18] Fukushima, M., Oshima, Y. and Takeda, M. (1994). Dirichlet Forms and Symmetric Markov Processes. Walter de Gruyter, Berlin.
Mathematical Reviews (MathSciNet): MR1303354
Zentralblatt MATH: 0838.31001
[19] Gobet, E., Hoffmann, M. and Reib, M. (2004). Nonparametric estimation of scalar diffusions based on low frequency data. Ann. Statist. 26 2223–2253.
Mathematical Reviews (MathSciNet): MR2102509
Zentralblatt MATH: 1056.62091
Digital Object Identifier: doi:10.1214/009053604000000797
Project Euclid: euclid.aos/1098883788
[20] Hall, P., Muller, H. G. and Wang, J. L. (2006). Properties of principal components methods for functional and longitudinal data analysis. Ann. Statist. 34 1493–1517.
Mathematical Reviews (MathSciNet): MR2278365
Zentralblatt MATH: 1113.62073
Digital Object Identifier: doi:10.1214/009053606000000272
Project Euclid: euclid.aos/1152540756
[21] Hansen, L. P. and Scheinkman, J. (1995). Back to the future. Econometrica 63 767–804.
Mathematical Reviews (MathSciNet): MR1343081
Digital Object Identifier: doi:10.2307/2171800
[22] Hansen, L. P., Scheinkman, J. and Touzi, N. (1998). Spectral methods for identifying scalar diffusions. J. Econometrics 86 1–32.
Mathematical Reviews (MathSciNet): MR1650012
Zentralblatt MATH: 0962.62094
Digital Object Identifier: doi:10.1016/S0304-4076(97)00107-3
[23] Hotelling, H. (1933). Analysis of a complex of statistical variables into principal components. Journal of Educational Psychology 17 417–441.
[24] Kessler, M. and Sorensen, M. (1999). Estimating equations based on eigenfunctions for a discretely observed diffusion process. Bernoulli 5 299–314.
Mathematical Reviews (MathSciNet): MR1681700
Digital Object Identifier: doi:10.2307/3318437
Project Euclid: euclid.bj/1173147908
[25] Nelson, E. (1958). The adjoint Markov process. Duke Math. J. 25 671–690.
Mathematical Reviews (MathSciNet): MR101555
Zentralblatt MATH: 0084.13402
Digital Object Identifier: doi:10.1215/S0012-7094-58-02561-4
Project Euclid: euclid.dmj/1077468200
[26] Pan, J. and Yao, Q. (2008). Modelling multiple time series via common factors. Biometrika 95 365–379. Available at http://biomet.oxfordjournals.org/cgi/reprint/95/2/365.pdf, http://biomet.oxfordjournals.org/cgi/content/abstract/95/2/365.
[27] Pearson, K. (1901). On lines and planes of closest fit to systems of points in space. Philosophical Magazine 2 559–572.
[28] Ramsay, J. and Silverman, B. W. (2005). Functional Data Analysis. Springer, New York.
Mathematical Reviews (MathSciNet): MR2168993
[29] Reed, M. and Simon, B. (1978). Methods of Modern Mathematical Physics IV: Analysis of Operators. Academic Press, San Diego.
Mathematical Reviews (MathSciNet): MR493421
Zentralblatt MATH: 0401.47001
[30] Rudin, W. (1973). Functional Analysis. McGraw-Hill, New York.
Mathematical Reviews (MathSciNet): MR365062
[31] Salinelli, E. (1998). Nonlinear principal components i: Absolutely continuous variables. Ann. Statist. 26 596–616.
Mathematical Reviews (MathSciNet): MR1626079
Zentralblatt MATH: 0929.62067
Digital Object Identifier: doi:10.1214/aos/1028144850
Project Euclid: euclid.aos/1028144850
[32] Silverman, B. W. (1996). Smoothed functional principal components analysis by choice of norm. Ann. Statist. 24 1–24.
Mathematical Reviews (MathSciNet): MR1389877
Zentralblatt MATH: 0853.62044
Digital Object Identifier: doi:10.1214/aos/1033066196
Project Euclid: euclid.aos/1033066196
[33] Wong, E. (1964). The construction of a class of stationary Markov processes. In Stochastic Processes in Mathematical Physics and Engineering (R. E. Bellman, ed.). Proceedings of Symposia in Applied Mathematics 17 264–276. Amer. Math. Soc., Providence, RI.
Mathematical Reviews (MathSciNet): MR161375
Zentralblatt MATH: 0139.34406
[34] Zhou, J. and He, X. (2008). Dimension reduction based on constrained cononical correlation and variable filtering. Ann. Statist. 36 1649–1668.
Mathematical Reviews (MathSciNet): MR2435451
Zentralblatt MATH: 1142.62045
Digital Object Identifier: doi:10.1214/07-AOS529
Project Euclid: euclid.aos/1216237295
[35] Zhou, L., Huang, J. Z. and Carroll, R. J. (2008). Joint modelling of paried sparse functional data using principal components. Biometrika 95 601–619.
previous :: next

2009 © Institute of Mathematical Statistics