The Annals of Statistics
- Ann. Statist.
- Volume 33, Number 2 (2005), 774-805.
Generalized functional linear models
We propose a generalized functional linear regression model for a regression situation where the response variable is a scalar and the predictor is a random function. A linear predictor is obtained by forming the scalar product of the predictor function with a smooth parameter function, and the expected value of the response is related to this linear predictor via a link function. If, in addition, a variance function is specified, this leads to a functional estimating equation which corresponds to maximizing a functional quasi-likelihood. This general approach includes the special cases of the functional linear model, as well as functional Poisson regression and functional binomial regression. The latter leads to procedures for classification and discrimination of stochastic processes and functional data. We also consider the situation where the link and variance functions are unknown and are estimated nonparametrically from the data, using a semiparametric quasi-likelihood procedure.
An essential step in our proposal is dimension reduction by approximating the predictor processes with a truncated Karhunen–Loève expansion. We develop asymptotic inference for the proposed class of generalized regression models. In the proposed asymptotic approach, the truncation parameter increases with sample size, and a martingale central limit theorem is applied to establish the resulting increasing dimension asymptotics. We establish asymptotic normality for a properly scaled distance between estimated and true functions that corresponds to a suitable L2 metric and is defined through a generalized covariance operator. As a consequence, we obtain asymptotic tests and simultaneous confidence bands for the parameter function that determines the model.
The proposed estimation, inference and classification procedures and variants with unknown link and variance functions are investigated in a simulation study. We find that the practical selection of the number of components works well with the AIC criterion, and this finding is supported by theoretical considerations. We include an application to the classification of medflies regarding their remaining longevity status, based on the observed initial egg-laying curve for each of 534 female medflies.
Ann. Statist. Volume 33, Number 2 (2005), 774-805.
First available in Project Euclid: 26 May 2005
Permanent link to this document
Digital Object Identifier
Mathematical Reviews number (MathSciNet)
Zentralblatt MATH identifier
Primary: 62G05: Estimation 62G20: Asymptotic properties
Secondary: 62M09: Non-Markovian processes: estimation 62H30: Classification and discrimination; cluster analysis [See also 68T10, 91C20]
Classification of stochastic processes covariance operator eigenfunctions functional regression generalized linear model increasing dimension asymptotics Karhunen–Loève expansion martingale central limit theorem order selection parameter function quasi-likelihood simultaneous confidence bands
Müller, Hans-Georg; Stadtmüller, Ulrich. Generalized functional linear models. Ann. Statist. 33 (2005), no. 2, 774--805. doi:10.1214/009053604000001156. https://projecteuclid.org/euclid.aos/1117114336.