We present a theory of point and interval estimation for nonlinear functionals in parametric, semi-, and non-parametric models based on higher order influence functions (Robins (2004), Section 9; Li et al. (2004), Tchetgen et al. (2006), Robins et al. (2007)). Higher order influence functions are higher order U-statistics. Our theory extends the first order semiparametric theory of Bickel et al. (1993) and van der Vaart (1991) by incorporating the theory of higher order scores considered by Pfanzagl (1990), Small and McLeish (1994) and Lindsay and Waterman (1996). The theory reproduces many previous results, produces new non-$\sqrt{n}$ results, and opens up the ability to perform optimal non-$\sqrt{n}$ inference in complex high dimensional models. We present novel rate-optimal point and interval estimators for various functionals of central importance to biostatistics in settings in which estimation at the expected $\sqrt{n}$ rate is not possible, owing to the curse of dimensionality. We also show that our higher order influence functions have a multi-robustness property that extends the double robustness property of first order influence functions described by Robins and Rotnitzky (2001) and van der Laan and Robins (2003).
References
[1] Arellano, M. (2003). Panel Data Econometrics. Oxford Univ. Press.
[2] Bhattacharyya, A. (1947). On some analogues of the amount of information and their use in statistical estimation. II–III. Sankhyā 8 201–218.
Mathematical Reviews (MathSciNet):
MR20242
[3] Bickel, P., Klassen, C., Ritov, Y. and Wellner, J. (1993). Efficient and Adaptive Estimation for Semiparametric Models. Springer, New York.
[4] Bickel, P. and Ritov, Y. (2003). Nonparametric estimators which can be “plugged-in”. Ann. Statist. 31 1033–53.
[5] Birge, L. and Massart, P. (1995). Estimation of integral functionals of a density. Ann. Statist. 23 11–29.
[6] Cai, T., Levine, M. and Wang, L. (2006). Variance function estimation in multivariate nonparametric regression. Technical report.
[7] He, X. and Shao, Q. M. (2000). On parameters of increasing dimension. J. Multivariate Anal. 73 120–135.
[8] Lindsay, R. and Waterman, B. (1996). Projected score methods for approximating conditional scores. Biometrika 83 1–13.
[9] Li, L., Tchetgen, E., van der Vaart, A. W. and Robins, J. M. (2006). Robust inference with higher order inference functions: Part II. In 2005 JSM Proceedings 2558–2565. American Statistical Association, Alexandria.
[10] Mallat, S. G. (1998). A Wavelet Tour of Signal Processing. Academic Press, San Diego.
[11] Pfanzagl, J. (1990). Estimation in Semiparametric Models: Some Recent Developments. Springer, New York.
[12] Portnoy, S. (1988). Asymptotic behavior of likelihood methods for exponential families when the number of parameters tends to infinity. Ann. Statist. 16 356–366.
Mathematical Reviews (MathSciNet):
MR924876
[13] Pyke, R. (1965). Spacings (with discussion). J. Roy. Statist. Soc. Ser. B 27 395–449.
Mathematical Reviews (MathSciNet):
MR216622
[14] Ritov, Y. and Bickel, P. (1990). Achieving information bounds in non- and semi-parametric models. Ann. Statist. 18 925–938.
[15] Robins, J. and Ritov, Y. (1997). Toward a curse-of-dimensionality appropriate (CODA) asymptotic theory for semiparametric models. Statistics in Medicine 16 285–319.
[16] Robins, J. M. (2004). Optimal structural nested models for optimal sequential decisions. In Proceedings of the Second Seattle Symposium in Biostatistics (D. Y. Lin and P. Heagerty, eds.). Springer, New York.
[17] Robins, J. M. and Rotnitzky, A. (2001). Comment on “Inference for semiparametric models: Some questions and an answer” by Bickel and Kwon. Statist. Sinica 11 920–936. [“On Double Robustness.”]
[18] Robins, J. M., Li, L., Tchetgen, E. and van der Vaart, A. W. (2007). Asymptotic normality of degenerate U-statistics. Working paper.
[19] Robins, J. M. and van der Vaart, A. W. (2006). Adaptive nonparametric confidence sets. Ann. Statist. 34 229–253.
[20] Small, D. and McLeish, C. (1994). Hilbert Space Methods in Probability and Statistical Inference. Wiley, New York.
[21] Tchetgen, E., Li, L., van der Vaart, A. W. and Robins, J. M. (2006). Robust inference with higher order inference functions: Part I. In 2005 JSM Proceedings 2644–2651. American Statistical Association, Alexandria.
[22] Tchetgen, E., Li, L., van der Vaart, A. W. and Robins, J. M. (2007). Higher Order U-statistics estimators for longitudinal missing data and causal inference models. Working paper.
[23] van der Laan, M. and Dudoit, S. (2005). Asymptotics of cross-validated risk estimation in estimator selection and performance assessment. Stat. Methodol. 2 131–154.
[24] van der Laan, M. and Robins, J. M. (2003). Unified Methods for Censored Longitudinal Data and Causality. Springer, New York.
[25] van der Vaart, A. W. (1991). On differentiable functionals. Ann. Statist. 19 178–204.
[26] van der Vaart, A. W. (1998). Asymptotic Statistics. Cambridge Series in Statistical and Probabilistic Mathematics, Cambridge.
[27] Wang, L., Brown, L. D., Cai, T. and Levine, M. (2006). Effect of mean on variance function estimation in nonparametric regression. Technical report.