Open Access
2016 Nonparametric clustering of functional data using pseudo-densities
Mattia Ciollaro, Christopher R. Genovese, Daren Wang
Electron. J. Statist. 10(2): 2922-2972 (2016). DOI: 10.1214/16-EJS1198


We study nonparametric clustering of smooth random curves on the basis of the $L^{2}$ gradient flow associated to a pseudo-density functional and we discuss the conditions under which the clustering is well-defined both at the population and at the sample level. We provide an algorithm to idenify significant local modes of the estimated pseudo-density, which are associated to informative sample clusters, and we prove its consistency and other statistical properties. Our theory is developed under weak assumptions, which essentially reduce to the integrability of the random curves. If the underlying probability distribution is supported on a finite-dimensional subspace, we show that the proposed pseudo-density functional and the expectation of a kernel density estimator induce the same gradient flow, hence the same population clustering. Although our theory is developed for smooth curves that belong to a potentially infinite-dimensional functional space, we provide consistent procedures that can be used with real functional data (discretized and noisy curves). We illustrate these procedures by means of applications both on simulated and real datasets.


Download Citation

Mattia Ciollaro. Christopher R. Genovese. Daren Wang. "Nonparametric clustering of functional data using pseudo-densities." Electron. J. Statist. 10 (2) 2922 - 2972, 2016.


Received: 1 January 2016; Published: 2016
First available in Project Euclid: 31 October 2016

zbMATH: 1357.62162
MathSciNet: MR3567238
Digital Object Identifier: 10.1214/16-EJS1198

Primary: 62G07 , 62G86
Secondary: 62G99

Keywords: Functional data analysis , Gradient flow , Modal clustering , pseudo-density

Rights: Copyright © 2016 The Institute of Mathematical Statistics and the Bernoulli Society

Vol.10 • No. 2 • 2016
Back to Top