A method for extracting multiscale geometric features from a data cloud is proposed and analyzed. Based on geometric considerations, we map each pair of data points into a real-valued feature function defined on the unit interval. Further statistical analysis is then based on the collection of feature functions. The potential of the method is illustrated by different applications, including classification and anomaly detection. Connections to other concepts, such as random set theory, localized depth measures and nonlinear dimension reduction, are also explored.
"Multiscale geometric feature extraction for high-dimensional and non-Euclidean data with applications." Ann. Statist. 49 (2) 988 - 1010, April 2021. https://doi.org/10.1214/20-AOS1988