Institute of Mathematical Statistics Collections
previous :: next

Higher order influence functions and minimax estimation of nonlinear functionals

James Robins, Lingling Li, Eric Tchetgen, Aad van der Vaart

Abstract

We present a theory of point and interval estimation for nonlinear functionals in parametric, semi-, and non-parametric models based on higher order influence functions (Robins (2004), Section 9; Li et al. (2004), Tchetgen et al. (2006), Robins et al. (2007)). Higher order influence functions are higher order U-statistics. Our theory extends the first order semiparametric theory of Bickel et al. (1993) and van der Vaart (1991) by incorporating the theory of higher order scores considered by Pfanzagl (1990), Small and McLeish (1994) and Lindsay and Waterman (1996). The theory reproduces many previous results, produces new non-$\sqrt{n}$ results, and opens up the ability to perform optimal non-$\sqrt{n}$ inference in complex high dimensional models. We present novel rate-optimal point and interval estimators for various functionals of central importance to biostatistics in settings in which estimation at the expected $\sqrt{n}$ rate is not possible, owing to the curse of dimensionality. We also show that our higher order influence functions have a multi-robustness property that extends the double robustness property of first order influence functions described by Robins and Rotnitzky (2001) and van der Laan and Robins (2003).

First Page: Show Hide
Primary Subjects: 62G05, 62C20
Keywords: influence functions; nonparametric; minimax; robust inference; U-statistics
Full-text: Open access
Links and Identifiers

Permanent link to this document: http://projecteuclid.org/euclid.imsc/1207580092
Digital Object Identifier: doi:10.1214/193940307000000527

References

[1] Arellano, M. (2003). Panel Data Econometrics. Oxford Univ. Press.
Mathematical Reviews (MathSciNet): MR2060514
Zentralblatt MATH: 1057.62112
[2] Bhattacharyya, A. (1947). On some analogues of the amount of information and their use in statistical estimation. II–III. Sankhyā 8 201–218.
Mathematical Reviews (MathSciNet): MR20242
[3] Bickel, P., Klassen, C., Ritov, Y. and Wellner, J. (1993). Efficient and Adaptive Estimation for Semiparametric Models. Springer, New York.
[4] Bickel, P. and Ritov, Y. (2003). Nonparametric estimators which can be “plugged-in”. Ann. Statist. 31 1033–53.
Mathematical Reviews (MathSciNet): MR2001641
Zentralblatt MATH: 1058.62031
Digital Object Identifier: doi:10.1214/aos/1059655904
Project Euclid: euclid.aos/1059655904
[5] Birge, L. and Massart, P. (1995). Estimation of integral functionals of a density. Ann. Statist. 23 11–29.
Mathematical Reviews (MathSciNet): MR1331653
Digital Object Identifier: doi:10.1214/aos/1176324452
Project Euclid: euclid.aos/1176324452
[6] Cai, T., Levine, M. and Wang, L. (2006). Variance function estimation in multivariate nonparametric regression. Technical report.
[7] He, X. and Shao, Q. M. (2000). On parameters of increasing dimension. J. Multivariate Anal. 73 120–135.
Mathematical Reviews (MathSciNet): MR1766124
Zentralblatt MATH: 0948.62013
Digital Object Identifier: doi:10.1006/jmva.1999.1873
[8] Lindsay, R. and Waterman, B. (1996). Projected score methods for approximating conditional scores. Biometrika 83 1–13.
Mathematical Reviews (MathSciNet): MR1399151
Zentralblatt MATH: 0866.62011
Digital Object Identifier: doi:10.1093/biomet/83.1.1
[9] Li, L., Tchetgen, E., van der Vaart, A. W. and Robins, J. M. (2006). Robust inference with higher order inference functions: Part II. In 2005 JSM Proceedings 2558–2565. American Statistical Association, Alexandria.
[10] Mallat, S. G. (1998). A Wavelet Tour of Signal Processing. Academic Press, San Diego.
Mathematical Reviews (MathSciNet): MR1614527
[11] Pfanzagl, J. (1990). Estimation in Semiparametric Models: Some Recent Developments. Springer, New York.
Mathematical Reviews (MathSciNet): MR1048589
Zentralblatt MATH: 0704.62034
[12] Portnoy, S. (1988). Asymptotic behavior of likelihood methods for exponential families when the number of parameters tends to infinity. Ann. Statist. 16 356–366.
Mathematical Reviews (MathSciNet): MR924876
Zentralblatt MATH: 0637.62026
Digital Object Identifier: doi:10.1214/aos/1176350710
Project Euclid: euclid.aos/1176350710
[13] Pyke, R. (1965). Spacings (with discussion). J. Roy. Statist. Soc. Ser. B 27 395–449.
Mathematical Reviews (MathSciNet): MR216622
[14] Ritov, Y. and Bickel, P. (1990). Achieving information bounds in non- and semi-parametric models. Ann. Statist. 18 925–938.
Mathematical Reviews (MathSciNet): MR1056344
Zentralblatt MATH: 0722.62025
Digital Object Identifier: doi:10.1214/aos/1176347633
Project Euclid: euclid.aos/1176347633
[15] Robins, J. and Ritov, Y. (1997). Toward a curse-of-dimensionality appropriate (CODA) asymptotic theory for semiparametric models. Statistics in Medicine 16 285–319.
[16] Robins, J. M. (2004). Optimal structural nested models for optimal sequential decisions. In Proceedings of the Second Seattle Symposium in Biostatistics (D. Y. Lin and P. Heagerty, eds.). Springer, New York.
Mathematical Reviews (MathSciNet): MR2129402
Zentralblatt MATH: 05196589
[17] Robins, J. M. and Rotnitzky, A. (2001). Comment on “Inference for semiparametric models: Some questions and an answer” by Bickel and Kwon. Statist. Sinica 11 920–936. [“On Double Robustness.”]
Mathematical Reviews (MathSciNet): MR1866079
[18] Robins, J. M., Li, L., Tchetgen, E. and van der Vaart, A. W. (2007). Asymptotic normality of degenerate U-statistics. Working paper.
[19] Robins, J. M. and van der Vaart, A. W. (2006). Adaptive nonparametric confidence sets. Ann. Statist. 34 229–253.
Mathematical Reviews (MathSciNet): MR2275241
Zentralblatt MATH: 1091.62039
Digital Object Identifier: doi:10.1214/009053605000000877
Project Euclid: euclid.aos/1146576262
[20] Small, D. and McLeish, C. (1994). Hilbert Space Methods in Probability and Statistical Inference. Wiley, New York.
Mathematical Reviews (MathSciNet): MR1269321
Zentralblatt MATH: 0838.62002
[21] Tchetgen, E., Li, L., van der Vaart, A. W. and Robins, J. M. (2006). Robust inference with higher order inference functions: Part I. In 2005 JSM Proceedings 2644–2651. American Statistical Association, Alexandria.
[22] Tchetgen, E., Li, L., van der Vaart, A. W. and Robins, J. M. (2007). Higher Order U-statistics estimators for longitudinal missing data and causal inference models. Working paper.
[23] van der Laan, M. and Dudoit, S. (2005). Asymptotics of cross-validated risk estimation in estimator selection and performance assessment. Stat. Methodol. 2 131–154.
Mathematical Reviews (MathSciNet): MR2161394
Digital Object Identifier: doi:10.1016/j.stamet.2005.02.003
[24] van der Laan, M. and Robins, J. M. (2003). Unified Methods for Censored Longitudinal Data and Causality. Springer, New York.
Mathematical Reviews (MathSciNet): MR1958123
Zentralblatt MATH: 1013.62034
[25] van der Vaart, A. W. (1991). On differentiable functionals. Ann. Statist. 19 178–204.
Mathematical Reviews (MathSciNet): MR1091845
Zentralblatt MATH: 0732.62035
Digital Object Identifier: doi:10.1214/aos/1176347976
Project Euclid: euclid.aos/1176347976
[26] van der Vaart, A. W. (1998). Asymptotic Statistics. Cambridge Series in Statistical and Probabilistic Mathematics, Cambridge.
[27] Wang, L., Brown, L. D., Cai, T. and Levine, M. (2006). Effect of mean on variance function estimation in nonparametric regression. Technical report.
previous :: next

2012 © Institute of Mathematical Statistics

Institute of Mathematical Statistics Collections

Institute of Mathematical Statistics Collections