Abstract
Bump-hunting or mode identification is a fundamental problem that arises in almost every scientific field of data-driven discovery. Surprisingly, very few data modeling tools are available for automatic (not requiring manual case-by-case investigation), objective (not subjective), and nonparametric (not based on restrictive parametric model assumptions) mode discovery, which can scale to large data sets. This article introduces LPMode–an algorithm based on a new theory for detecting multimodality of a probability density. We apply LPMode to answer important research questions arising in various fields from environmental science, ecology, econometrics, analytical chemistry to astronomy and cancer genomics.
Citation
Subhadeep Mukhopadhyay. "Large-scale mode identification and data-driven sciences." Electron. J. Statist. 11 (1) 215 - 240, 2017. https://doi.org/10.1214/17-EJS1229
Information