The Annals of Statistics

Determining the dimension of iterative Hessian transformation

R. Dennis Cook and Bing Li
Source: Ann. Statist. Volume 32, Number 6 (2004), 2501-2531.

Abstract

The central mean subspace (CMS) and iterative Hessian transformation (IHT) have been introduced recently for dimension reduction when the conditional mean is of interest. Suppose that X is a vector-valued predictor and Y is a scalar response. The basic problem is to find a lower-dimensional predictor ηTX such that E(Y|X)=E(YTX). The CMS defines the inferential object for this problem and IHT provides an estimating procedure. Compared with other methods, IHT requires fewer assumptions and has been shown to perform well when the additional assumptions required by those methods fail. In this paper we give an asymptotic analysis of IHT and provide stepwise asymptotic hypothesis tests to determine the dimension of the CMS, as estimated by IHT. Here, the original IHT method has been modified to be invariant under location and scale transformations. To provide empirical support for our asymptotic results, we will present a series of simulation studies. These agree well with the theory. The method is applied to analyze an ozone data set.

First Page: Show Hide
Primary Subjects: 62G08
Secondary Subjects: 62G09, 62H05
Full-text: Open access
Links and Identifiers

Permanent link to this document: http://projecteuclid.org/euclid.aos/1107794877
Digital Object Identifier: doi:10.1214/009053604000000661
Mathematical Reviews number (MathSciNet): MR2153993
Zentralblatt MATH identifier: 1069.62033

References

Breiman, L. and Friedman, J. (1985). Estimating optimal transformations for multiple regression and correlation (with discussion). J. Amer. Statist. Assoc. 80 580--619.
Mathematical Reviews (MathSciNet): MR803258
Bura, E. and Cook, R. D. (2001). Estimating the structural dimension of regressions via parametric inverse regression. J. R. Stat. Soc. Ser. B Stat. Methodol. 63 393--410.
Mathematical Reviews (MathSciNet): MR1841422
Digital Object Identifier: doi:10.1111/1467-9868.00292
Clark, R. G., Henderson, H. V., Hoggard, G. K., Ellison, R. S. and Young, B. J. (1987). The ability of biochemical and haematological tests to predict recovery in periparturient recumbent cows. New Zealand Veterinary J. 35 126--133.
Cook, R. D. (1998a). Regression Graphics. Wiley, New York.
Mathematical Reviews (MathSciNet): MR1645673
Zentralblatt MATH: 0903.62001
Cook, R. D. (1998b). Principal Hessian directions revisited (with discussion). J. Amer. Statist. Assoc. 93 84--100.
Mathematical Reviews (MathSciNet): MR1614584
Cook, R. D. and Critchley, F. (2000). Identifying regression outliers and mixtures graphically. J. Amer. Statist. Assoc. 95 781--794.
Mathematical Reviews (MathSciNet): MR1803878
Cook, R. D. and Li, B. (2002). Dimension reduction for the conditional mean in regression. Ann. Statist. 30 455--474.
Mathematical Reviews (MathSciNet): MR1902895
Digital Object Identifier: doi:10.1214/aos/1021379861
Project Euclid: euclid.aos/1021379861
Cook, R. D. and Nachtsheim, C. J. (1994). Reweighting to achieve elliptically contoured covariates in regression. J. Amer. Statist. Assoc. 89 592--599.
Cook, R. D. and Weisberg, S. (1991). Discussion of ``Sliced inverse regression for dimension reduction,'' by K. C. Li. J. Amer. Statist. Assoc. 86 328--332.
Mathematical Reviews (MathSciNet): MR1137117
Cook, R. D. and Weisberg, S. (1999). Applied Regression Including Computing and Graphics. Wiley, New York.
Zentralblatt MATH: 0928.62045
Diaconis, P. and Freedman, D. (1984). Asymptotics of graphical projection pursuit. Ann. Statist. 12 793--815.
Mathematical Reviews (MathSciNet): MR751274
Eaton, M. L. (1986). A characterization of spherical distributions. J. Multivariate Anal. 20 272--276.
Mathematical Reviews (MathSciNet): MR866075
Digital Object Identifier: doi:10.1016/0047-259X(86)90083-7
Eaton, M. L. and Tyler, D. E. (1994). The asymptotic distribution of singular values with applications to canonical correlations and correspondence analysis. J. Multivariate Anal. 50 238--264.
Mathematical Reviews (MathSciNet): MR1293045
Digital Object Identifier: doi:10.1006/jmva.1994.1041
Field, C. (1993). Tail areas of linear combinations of chi-squares and non-central chi-squares. J. Statist. Comput. Simulation 45 243--248.
Gather, U., Hilker, T. and Becker, C. (2001). A robustified version of sliced inverse regression. In Statistics in Genetics and in the Environmental Sciences (L. T. Fernholz, S. Morgenthaler and W. Stahel, eds.) 147--157. Birkhäuser, Basel.
Mathematical Reviews (MathSciNet): MR1843175
Gather, U., Hilker, T. and Becker, C. (2002). A note on outlier sensitivity of sliced inverse regression. Statistics 36 271--281.
Mathematical Reviews (MathSciNet): MR1923466
Digital Object Identifier: doi:10.1080/02331880213194
Hall, P. and Li, K. C. (1993). On almost linearity of low-dimensional projections from high-dimensional data. Ann. Statist. 21 867--889.
Mathematical Reviews (MathSciNet): MR1232523
Li, B., Cook, R. D. and Chiaromonte, F. (2003). Dimension reduction for the conditional mean in regressions with categorical predictors. Ann. Statist. 31 1636--1668.
Mathematical Reviews (MathSciNet): MR2012828
Digital Object Identifier: doi:10.1214/aos/1065705121
Project Euclid: euclid.aos/1065705121
Li, K. C. (1991). Sliced inverse regression for dimension reduction (with discussion). J. Amer. Statist. Assoc. 86 316--342.
Mathematical Reviews (MathSciNet): MR1137117
Li, K. C. (1992). On principal Hessian directions for data visualization and dimension reduction: Another application of Stein's lemma. J. Amer. Statist. Assoc. 87 1025--1039.
Mathematical Reviews (MathSciNet): MR1209564
Li, K. C. and Duan, N. (1989). Regression analysis under link violation. Ann. Statist. 17 1009--1052.
Mathematical Reviews (MathSciNet): MR1015136
Rao, C. R. (1965). Linear Statistical Inference and Its Applications. Wiley, New York.
Mathematical Reviews (MathSciNet): MR221616
Zentralblatt MATH: 0137.36203
Schott, J. (1994). Determining the dimensionality in sliced inverse regression. J. Amer. Statist. Assoc. 89 141--148.
Mathematical Reviews (MathSciNet): MR1266291
Yin, X. and Cook, R. D. (2002). Dimension reduction for the conditional $k$th moment in regression. J. R. Stat. Soc. Ser. B Stat. Methodol. 64 159--175.
Mathematical Reviews (MathSciNet): MR1904698
Digital Object Identifier: doi:10.1111/1467-9868.00330

2012 © Institute of Mathematical Statistics

The Annals of Statistics

The Annals of Statistics