Electronic Journal of Statistics

Variance function additive partial linear models

Abstract

To model heteroscedasticity in a broad class of additive partial linear models, we allow the variance function to be an additive partial linear model as well and the parameters in the variance function to be different from those in the mean function. We develop a two-step estimation procedure, where in the first step initial estimates of the parameters in both the mean and variance functions are obtained and then in the second step the estimates are updated using the weights based on the initial estimates. We use polynomial splines to approximate the additive nonparametric components in both the mean and variation functions and derive their convergence rates. The resulting weighted estimators of the linear coefficients in both the mean and variance functions are shown to be asymptotically normal and more efficient than the initial un-weighted estimators. Simulation experiments are conducted to examine the numerical performance of the proposed procedure, which is also applied to analyze the dataset from a nutritional epidemiology study.

Article information

Source
Electron. J. Statist., Volume 9, Number 2 (2015), 2793-2827.

Dates
First available in Project Euclid: 31 December 2015

https://projecteuclid.org/euclid.ejs/1451577217

Digital Object Identifier
doi:10.1214/15-EJS1080

Mathematical Reviews number (MathSciNet)
MR3439185

Zentralblatt MATH identifier
1329.62199

Citation

Fang, Yixin; Lian, Heng; Liang, Hua; Ruppert, David. Variance function additive partial linear models. Electron. J. Statist. 9 (2015), no. 2, 2793--2827. doi:10.1214/15-EJS1080. https://projecteuclid.org/euclid.ejs/1451577217

References

• Bickel, P. (1978). Using residuals robustly i: Tests for heteroscedasticity, nonlinearity, The Annals of Statistics 6: 266–291.
• Bickel, P. J., Klaasen, C. A. J., Ritov, Y. and Wellner, J. A. (1993)., Efficient and Adaptive Estimation for Semiparametric Models, The Johns Hopkins University Press.
• Box, G. and Hill, W. (1974). Correcting inhomogeneity of variance with power transformation weighting, Technometrics 16: 385–389.
• Box, G. and Meyer, D. (1986). An analysis for unreplicated fractional factorials, Technometrics 28: 11–18.
• Cai, T. T. and Wang, L. (2008). Adaptive variance function estimation in heteroscedastic nonparametric regression, The Annals of Statistics 36: 2025–2054.
• Carroll, R. J. (1982). Adapting for heteroscedasticity in linear models, The Annals of Statistics 10: 1224–1233.
• Carroll, R. J. (2003). Variances are not always nuisance parameters, Biometrics 59(2): 211–220.
• Carroll, R. J. and Härdle, W. (1989). Second order effects in semiparametric weighted least squares regression, Statistics 2: 179–186.
• Carroll, R. J. and Ruppert, D. (1982). Robust estimation in heteroscedasticity linear models, The Annals of Statistics 10: 429–441.
• Carroll, R. and Ruppert, D. (1988)., Transformation and Weighting in Regression, Chapman& Hall, New York.
• Davidian, M. and Carroll, R. J. (1987). Variance function estimation, Journal of the American Statistical Association 82: 1079–1091.
• de Boor, C. (2001)., A Practical Guide to Splines, Vol. 27 of Applied Mathematical Sciences, revised edn, Springer-Verlag, New York.
• Fan, J., Feng, Y. and Song, R. (2011). Nonparametric independence screening in sparse ultra-high dimensional additive models, Journal of the American Statistical Association 106: 544–557.
• Fan, J. and Li, R. (2001). Variable selection via nonconcave penalized likelihood and its oracle properties, Journal of the American Statistical Association 96: 1348–1360.
• Hall, P. and Carroll, R. J. (1989). Variance function estimation in regression – the effect of estimating the mean, Journal of the Royal Statistical Society Series B-Methodological 51(1): 3–14.
• Hastie, T. J. and Tibshirani, R. J. (1990)., Generalized Additive Models, Vol. 43 of Monographs on Statistics and Applied Probability, Chapman and Hall, London.
• Huang, J. (1999). Efficient estimation of the partially linear additive Cox model, The Annals of Statistics 27: 1536–1563.
• Huang, J. (2003). Local asymptotics for polynomial spline regression, The Annals of Statistics 31: 1600–1625.
• Knight, K. (1998). Limiting distributions for $L_1$ regression estimators under general conditions, The Annals of Statistics 26(2): 755–770.
• Li, Q. (2000). Efficient estimation of additive partially linear models, International Economic Review 41: 1073–1092.
• Lian, H., Lai, P. and Liang, H. (2013). Partially linear structure selection in cox models with varying coefficients, Biometrics 69: 348–357.
• Lian, H., Liang, H. and Carroll, R. J. (2015). Variance function partially linear single-index models, Journal of the Royal Statistical Society, Series B 77: 171–194.
• Liang, H., Thurston, S. W., Ruppert, D., Apanasovich, T. and Hauser, R. (2008). Additive partial linear models with measurement errors, Biometrika 95(3): 667–678.
• Liu, X., Wang, L. and Liang, H. (2011). Estimation and variable selection for semiparametric additive partial linear models, Statistica Sinica 21: 1225–1248.
• Newey, W. K. (1994). The asymptotic variance of semiparametric estimators, Econometrica 62: 1349–1382.
• Nierenberg, D., Stukel, T., Baron, J., Dain, B. and Greenberg, E. (1989). Determinants of plasma-leverls of beta-carotene and retinol, American Journal of Epidemiology 130: 511–521.
• Opsomer, J. and Ruppert, D. (1997). Fitting a bivariate additive model by local polynomial regression, The Annals of Statistics 25: 186–211.
• Opsomer, J. and Ruppert, D. (1999). A root-$n$ consistent backfitting estimator for semiparametric additive modeling, Journal of Computational and Graphical Statistics 8: 715–732.
• Ruppert, D., Wand, M. P. and Carroll, R. J. (2003)., Semiparametric Regression, Springer.
• Stone, C. J. (1985). Additive regression and other nonparametric models, The Annals of Statistics 13: 689–705.
• Stone, C. J. (1986). The dimensionality reduction principle for generalized additive models, The Annals of Statistics 14: 590–606.
• Teschendorff, A. E. and Widschwendter, M. (2012). Differential variability improves the identification of cancer risk markers in dna methylation studies profiling precursor cancer lesions, Bioinformatics 28: 1487–1494.
• Thomas, L., Stefanski, L. A. and Davidian, M. (2012). Measurement error model methods for bias reduction and variance estimation in logistic regression with estimated variance predictors, Technical report, North Carolina State University.
• Tibshirani, R. (1996). Regression shrinkage and selection via the lasso, Journal of the Royal Statistical Society, Series B 58: 267–288.
• van der Vaart, A. W. and Wellner, J. A. (1996)., Weak convergence and empirical processes, Springer Series in Statistics, Springer-Verlag, New York. With applications to statistics.
• Wang, L., Liu, X., Liang, H. and Carroll, R. (2011). Estimation and variable selection for generalized additive partial linear models, The Annals of Statistics 39: 1827–1851.
• Wang, L., Xue, L., Qu, A. and Liang, H. (2014). Estimation and model selection in generalized additive partial linear models for correlated data with diverging number of covariates, The Annals of Statistics 42: 592–624.
• Wang, L. and Yang, L. (2007). Spline-backfitted kernel smoothing of nonlinear additive autoregression model, The Annals of Statistics 35: 2474–2503.
• Western, B. and Bloome, D. (2009). Variance function regressions for studying inequality, Sociological Methodology 39: 293–326.
• Xue, L. (2009). Consistent variable selection in additive models, Statistica Sinica 19: 1281–1296.
• Xue, L. and Yang, L. (2006a). Additive coefficient modeling via polynomial spline, Statistica Sinica 16: 1423–1446.
• Xue, L. and Yang, L. (2006b). Estimation of semi-parametric additive coefficient model, Journal of Statistical Planning and Inference 136(8): 2506–2534.