Electronic Journal of Statistics

Longitudinal functional principal component analysis

Sonja Greven, Ciprian Crainiceanu, Brian Caffo, and Daniel Reich

Full-text: Open access


We introduce models for the analysis of functional data observed at multiple time points. The dynamic behavior of functional data is decomposed into a time-dependent population average, baseline (or static) subject-specific variability, longitudinal (or dynamic) subject-specific variability, subject-visit-specific variability and measurement error. The model can be viewed as the functional analog of the classical longitudinal mixed effects model where random effects are replaced by random processes. Methods have wide applicability and are computationally feasible for moderate and large data sets. Computational feasibility is assured by using principal component bases for the functional processes. The methodology is motivated by and applied to a diffusion tensor imaging (DTI) study designed to analyze differences and changes in brain connectivity in healthy volunteers and multiple sclerosis (MS) patients. An R implementation is provided.

Article information

Electron. J. Statist. Volume 4 (2010), 1022-1054.

First available in Project Euclid: 12 October 2010

Permanent link to this document

Digital Object Identifier

Mathematical Reviews number (MathSciNet)

Zentralblatt MATH identifier

Diffusion tensor imaging functional data analysis Karhunen-Loève expansion longitudinal data analysis mixed effects model


Greven, Sonja; Crainiceanu, Ciprian; Caffo, Brian; Reich, Daniel. Longitudinal functional principal component analysis. Electron. J. Statist. 4 (2010), 1022--1054. doi:10.1214/10-EJS575. http://projecteuclid.org/euclid.ejs/1286889183.

Export citation


  • [1] Basser, P., Mattiello, J. and LeBihan, D. (1994). MR diffusion tensor spectroscopy and imaging., Biophysical Journal 66 259–267.
  • [2] Basser, P. J. and Pierpaoli, C. (1996). Microstructural and physiological features of tissues elucidated by quantitative-diffusion-tensor MRI., Journal of Magnetic Resonance, Series B 111 209–219.
  • [3] Brumback, B. A. and Rice, J. A. (1998). Smoothing spline models for the analysis of nested and crossed samples of curves., Journal of the American Statistical Association 961–976.
  • [4] Calabresi, P. A. (2008). Multiple sclerosis and demyelinating conditions of the central nervous system. In, Cecil Medicine 23rd ed. ( L. Goldman and D. A. Ausiello, eds.) Saunders Elsevier.
  • [5] Crainiceanu, C. and Ruppert, D. (2004). Likelihood ratio tests in linear mixed models with one variance component., Journal of the Royal Statistical Society, Series B 66 165-185.
  • [6] Crainiceanu, C. M., Staicu, A. M. and Di, C. Z. (2009). Generalized Multilevel Functional Regression., Journal of the American Statistical Association 104 1550–1561.
  • [7] Di, C. Z., Crainiceanu, C. M., Caffo, B. S. and Punjabi, N. M. (2008). Multilevel functional principal component analysis., Annals of Applied Statistics 3 458-488.
  • [8] Diggle, P., Heagerty, P., Liang, K. Y. and Zeger, S. (2002)., Analysis of longitudinal data. Oxford University Press, USA.
  • [9] Fan, J. and Gijbels, I. (1996)., Local polynomial modelling and its applications. CRC Press.
  • [10] Ferraty, F. and Vieu, P. (2006)., Nonparametric functional data analysis: theory and practice. Springer Verlag.
  • [11] Green, P. J. and Silverman, B. W. (1994)., Nonparametric Regression and Generalized Linear Models: a Roughness Penalty Approach. Chapman & Hall Ltd.
  • [12] Greven, S. and Kneib, T. (2010). On the Behaviour of Marginal and Conditional Akaike Information Criteria in Linear Mixed Models., Biometrika to appear.
  • [13] Greven, S., Crainiceanu, C., Caffo, B. and Reich D. (2010). Supplement to “Longitudinal functional principal component analysis.” DOI:, 10.1214/10-EJS575SUPP
  • [14] Greven, S., Crainiceanu, C. M., Küchenhoff, H. and Peters, A. (2008). Restricted Likelihood Ratio Testing for Zero Variance Components in Linear Mixed Models., Journal of Computational and Graphical Statistics 17 870–891.
  • [15] Guo, W. (2002). Functional mixed effects models., Biometrics 58 121–128.
  • [16] Guo, W. (2004). Functional data analysis in longitudinal settings using smoothing splines., Statistical methods in medical research 13 49.
  • [17] Hall, P., Müller, H. G. and Yao, F. (2008). Modelling sparse generalized longitudinal observations with latent Gaussian processes., Journal of the Royal Statistical Society: Series B 70 703–723.
  • [18] Heim, S., Fahrmeir, L., Eilers, P. and Marx, B. (2007). 3D space-varying coefficient models with application to diffusion tensor imaging., Computational Statistics & Data Analysis 51 6212–6228.
  • [19] Herrick, R. C. and Morris, J. S. (2006). Wavelet-Based Functional Mixed Model Analysis: Computation Considerations. In, Proceedings, Joint Statistical Meetings, ASA Section on Statistical Computing.
  • [20] Karhunen, K. (1947). Über Lineare Methoden in der Wahrscheinlichkeitsrechnung., Annales Academiae Scientiarum Fennicae 37 1-79.
  • [21] Krivobokova, T. and Kauermann, G. (2007). A note on penalized spline smoothing with correlated errors., Journal of the American Statistical Association 102 1328–1337.
  • [22] Laird, N. and Ware, J. H. (1982). Random-effects models for longitudinal data., Biometrics 38 963-974.
  • [23] Liang, H., Wu, H. and Zou, G. (2008). A note on conditional AIC for linear mixed-effects models., Biometrika 95 773–778.
  • [24] Lin, X. and Carroll, R. J. (2000). Nonparametric Function Estimation for Clustered Data When the Predictor Is Measured Without/With Error., Journal of the American Statistical Association 95 520-534.
  • [25] Lin, F., Yu, C., Jiang, T., Li, K., Li, X., Qin, W., Sun, H. and Chan, P. (2006). Quantitative analysis along the pyramidal tract by length-normalized parameterization based on diffusion tensor tractography: application to patients with relapsing neuromyelitis optica., NeuroImage 33 154–160.
  • [26] Loeve, M. (1945). Fonctions aléatoires du second ordre., Comptes Rendus Académie des Sciences 220 380.
  • [27] Mercer, J. (1909). Functions of positive and negative type, and their connection with the theory of integral equations., Philosophical Transactions of the Royal Society of London. Series A. 415–446.
  • [28] Mori, S., Crain, B. J., Chacko, V. and Van Zijl, P. C. M. (1999). Three-dimensional tracking of axonal projections in the brain by magnetic resonance imaging., Annals of Neurology 45 265–269.
  • [29] Morris, J. S. and Carroll, R. J. (2006). Wavelet-based functional mixed models., Journal of the Royal Statistical Society, Series B 68 179-199.
  • [30] Müller, H. G. (2005). Functional modelling and classification of longitudinal data., Scandinavian Journal of Statistics 32 223–240.
  • [31] Müller, H. G. and Zhang, Y. (2005). Time-varying functional regression for predicting remaining lifetime distributions from longitudinal trajectories., Biometrics 61 1064-1075.
  • [32] Oh, J. S., Song, I. C., Lee, J. S., Kang, H., Park, K. S., Kang, E. and Lee, D. S. (2007). Tractography-guided statistics (TGIS) in diffusion tensor imaging for the detection of gender difference of fiber integrity in the midsagittal and parasagittal corpora callosa., Neuroimage 36 606–616.
  • [33] Ozturk, A., Smith, S., Gordon-Lipkin, E., Harrison, D., Shiee, N., Pham, D., Caffo, B., Calabresi, P. and Reich, D. (2009). MRI of the corpus callosum in multiple sclerosis: association with disability., Multiple Sclerosis to appear.
  • [34] Ramsay, J. O. and Silverman, B. (2005)., Functional data analysis, 2nd ed. Springer.
  • [35] Rao, C. R. (1965). The theory of least squares when the parameters are stochastic and its application to the analysis of growth curves., Biometrika 52 447–458.
  • [36] Reich, D. S., Smith, S. A., Zackowski, K. M., Gordon-Lipkin, E. M., Jones, C. K., Farrell, J. A. D., Mori, S., van Zijl, P. C. M. and Calabresi, P. A. (2007). Multiparametric magnetic resonance imaging analysis of the corticospinal tract in multiple sclerosis., Neuroimage 38 271–279.
  • [37] Rice, J. A. (2004). Functional and longitudinal data analysis: Perspectives on smoothing., Statistica Sinica 14 631-647.
  • [38] Rice, J. A. and Silverman, B. (1991). Estimating the mean and covariance structure nonparametrically when the data are curves., Journal of the Royal Statistical Society. Series B 53 233–243.
  • [39] Ruppert, D., Wand, M. P. and Carroll, R. J. (2003)., Semiparametric Regression. Cambridge University Press.
  • [40] Staicu, A. M., Crainiceanu, C. M. and Carroll, R. J. (2010). Fast Methods for Spatially Correlated Multilevel Functional Data., Biostatistics to appear.
  • [41] Staniswalis, J. G. and Lee, J. J. (1998). Nonparametric Regression Analysis of Longitudinal Data., Journal of the American Statistical Association 93 1403–1404.
  • [42] Tievsky, A. L., Ptak, T. and Farkas, J. (1999). Investigation of apparent diffusion coefficient and diffusion tensor anisotropy in acute and chronic multiple sclerosis lesions., American Journal of Neuroradiology 20 1491-1499.
  • [43] Vaida, F. and Blanchard, S. (2005). Conditional Akaike information for mixed-effects models., Biometrika 92 351-370.
  • [44] Verbeke, G. and Molenberghs, G. (2009)., Linear mixed models for longitudinal data. Springer.
  • [45] Werring, D., Clark, C., Barker, G., Thompson, A. and Miller, D. (1999). Diffusion tensor imaging of lesions and normal-appearing white matter in multiple sclerosis., Neurology 52 1626-1632.
  • [46] Witelson, S. F. (1989). Hand and sex differences in the isthmus and genu of the human corpus callosum: a postmortem morphological study., Brain 112 799-835.
  • [47] Wood, S. N. (2006)., Generalized Additive Models: An Introduction with R. Chapman and Hall/CRC.
  • [48] Wu, H. and Zhang, J. T. (2006)., Nonparametric regression methods for longitudinal data analysis: mixed-effects modeling approaches. Wiley-Blackwell.
  • [49] Yao, F. and Lee, T. C. M. (2006). Penalized spline models for functional principal component analysis., Journal of the Royal Statistical Society, Series B 68 3-25.
  • [50] Yao, F., Müller, H. G. and Wang, J. L. (2005). Functional data analysis for sparse longitudinal data., Journal of the American Statistical Association 100 577–590.
  • [51] Yao, F., Clifford, A. J., Dueker, S. R., Follett, J., Lin, Y., Buchholz, B. A. and Vogel, J. S. (2003). Shrinkage estimation for functional principal component scores with application to the population kinetics of plasma folate., Biometrics 59 676–685.
  • [52] Zhu, H., Styner, M., Tang, N., Liu, Z., Lin, W. and Gilmore, J. (2010). FRATS: Functional Regression Analysis of DTI Tract Statistics., IEEE Transactions on Medical Imaging 29 1039–1049.

Supplemental materials