The Annals of Statistics

Asymptotics for in-sample density forecasting

Young K. Lee, Enno Mammen, Jens P. Nielsen, and Byeong U. Park

Full-text: Open access


This paper generalizes recent proposals of density forecasting models and it develops theory for this class of models. In density forecasting, the density of observations is estimated in regions where the density is not observed. Identification of the density in such regions is guaranteed by structural assumptions on the density that allows exact extrapolation. In this paper, the structural assumption is made that the density is a product of one-dimensional functions. The theory is quite general in assuming the shape of the region where the density is observed. Such models naturally arise when the time point of an observation can be written as the sum of two terms (e.g., onset and incubation period of a disease). The developed theory also allows for a multiplicative factor of seasonal effects. Seasonal effects are present in many actuarial, biostatistical, econometric and statistical studies. Smoothing estimators are proposed that are based on backfitting. Full asymptotic theory is derived for them. A practical example from the insurance business is given producing a within year budget of reported insurance claims. A small sample study supports the theoretical results.

Article information

Ann. Statist., Volume 43, Number 2 (2015), 620-651.

First available in Project Euclid: 3 March 2015

Permanent link to this document

Digital Object Identifier

Mathematical Reviews number (MathSciNet)

Zentralblatt MATH identifier

Primary: 62G07: Density estimation 62G20: Asymptotic properties

Density estimation kernel smoothing backfitting chain ladder


Lee, Young K.; Mammen, Enno; Nielsen, Jens P.; Park, Byeong U. Asymptotics for in-sample density forecasting. Ann. Statist. 43 (2015), no. 2, 620--651. doi:10.1214/14-AOS1288.

Export citation


  • Cheng, M.-Y. (1997). A bandwidth selector for local linear density estimators. Ann. Statist. 25 1001–1013.
  • Deimling, K. (1985). Nonlinear Functional Analysis. Springer, Berlin.
  • Guillot, D., Khare, A. and Rajaratnam, B. (2013). Classification of measurable solutions of Cauchy’s functional equations, and operators satisfying the Chain Rule. Preprint. Available at arXiv:1312.6297 [math.FA].
  • Jiang, J., Fan, Y. and Fan, J. (2010). Estimation in additive models with highly or nonhighly correlated covariates. Ann. Statist. 38 1403–1432.
  • Keiding, N. (1991). Age-specific incidence and prevalence: A statistical perspective. J. Roy. Statist. Soc. Ser. A 154 371–412.
  • Kuang, D., Nielsen, B. and Nielsen, J. P. (2008). Identification of the age-period-cohort model and the extended chain-ladder model. Biometrika 95 979–986.
  • Kuang, D., Nielsen, B. and Nielsen, J. P. (2009). Chain-ladder as maximum likelihood revisited. Annals of Actuarial Science 4 105–121.
  • Lee, Y. K., Mammen, E. and Park, B. U. (2010). Backfitting and smooth backfitting for additive quantile models. Ann. Statist. 38 2857–2883.
  • Lee, Y. K., Mammen, E. and Park, B. U. (2012). Flexible generalized varying coefficient regression models. Ann. Statist. 40 1906–1933.
  • Lee, Y. K., Mammen, E. and Park, B. U. (2014). Backfitting and smooth backfitting in varying coefficient quantile regression. Econom. J. 17 S20–S38.
  • Linton, O. and Nielsen, J. P. (1995). A kernel method of estimating structured nonparametric regression based on marginal integration. Biometrika 82 93–100.
  • Mammen, E., Linton, O. and Nielsen, J. (1999). The existence and asymptotic properties of a backfitting projection algorithm under weak conditions. Ann. Statist. 27 1443–1490.
  • Mammen, E., Martínez-Miranda, M. D. and Nielsen, J. P. (2015). In-sample forecasting applied to reserving and mesothelioma mortality. Insurance: Mathematics and Economics 61 76–86.
  • Mammen, E. and Nielsen, J. P. (2003). Generalised structured models. Biometrika 90 551–566.
  • Mammen, E. and Park, B. U. (2005). Bandwidth selection for smooth backfitting in additive models. Ann. Statist. 33 1260–1294.
  • Mammen, E. and Park, B. U. (2006). A simple smooth backfitting method for additive models. Ann. Statist. 34 2252–2271.
  • Martínez-Miranda, M. D., Nielsen, J. P., Sperlich, S. and Verrall, R. J. (2013). Continuous chain ladder: Reformulating and generalising a classical insurance problem. Expert Systems with Applications 40 5588–5603.
  • Martínez-Miranda, M. D., Nielsen, J. P. and Verrall, R. (2012). Double chain ladder. Astin Bull. 42 59–76.
  • Martínez-Miranda, M. D., Nielsen, B., Nielsen, J. P. and Verrall, R. (2011). Cash flow simulation for a model of outstanding liabilities based on claim amounts and claim numbers. Astin Bull. 41 107–129.
  • Nielsen, J. P. (1999). Multivariate boundary kernels from local linear estimation. Scand. Actuar. J. 1 93–95.
  • Nielsen, J. P. and Linton, O. B. (1998). An optimization interpretation of integration and back-fitting estimators for separable nonparametric models. J. R. Stat. Soc. Ser. B Stat. Methodol. 60 217–222.
  • Nielsen, J. P. and Sperlich, S. (2005). Smooth backfitting in practice. J. R. Stat. Soc. Ser. B Stat. Methodol. 67 43–61.
  • Opsomer, J. D. and Ruppert, D. (1997). Fitting a bivariate additive model by local polynomial regression. Ann. Statist. 25 186–211.
  • Verrall, R., Nielsen, J. P. and Jessen, A. H. (2010). Prediction of RBNS and IBNR claims using claim amounts and claim counts. Astin Bull. 40 871–887.
  • Yu, K., Park, B. U. and Mammen, E. (2008). Smooth backfitting in generalized additive models. Ann. Statist. 36 228–260.
  • Zhang, X., Park, B. U. and Wang, J.-L. (2013). Time-varying additive models for longitudinal data. J. Amer. Statist. Assoc. 108 983–998.