## Electronic Journal of Statistics

### Estimating the reach of a manifold

#### Abstract

Various problems in manifold estimation make use of a quantity called the reach, denoted by $\tau_{M}$, which is a measure of the regularity of the manifold. This paper is the first investigation into the problem of how to estimate the reach. First, we study the geometry of the reach through an approximation perspective. We derive new geometric results on the reach for submanifolds without boundary. An estimator $\hat{\tau }$ of $\tau_{M}$ is proposed in an oracle framework where tangent spaces are known, and bounds assessing its efficiency are derived. In the case of i.i.d. random point cloud $\mathbb{X}_{n}$, $\hat{\tau }(\mathbb{X}_{n})$ is showed to achieve uniform expected loss bounds over a $\mathcal{C}^{3}$-like model. Finally, we obtain upper and lower bounds on the minimax rate for estimating the reach.

#### Article information

Source
Electron. J. Statist., Volume 13, Number 1 (2019), 1359-1399.

Dates
First available in Project Euclid: 12 April 2019

https://projecteuclid.org/euclid.ejs/1555056153

Digital Object Identifier
doi:10.1214/19-EJS1551

#### Citation

Aamari, Eddie; Kim, Jisu; Chazal, Frédéric; Michel, Bertrand; Rinaldo, Alessandro; Wasserman, Larry. Estimating the reach of a manifold. Electron. J. Statist. 13 (2019), no. 1, 1359--1399. doi:10.1214/19-EJS1551. https://projecteuclid.org/euclid.ejs/1555056153

#### References

• [1] Aamari, E. and Levrard, C. (2018). Stability and minimax optimality of tangential Delaunay complexes for manifold reconstruction., Discrete Comput. Geom. 59 923–971.
• [2] Aamari, E. and Levrard, C. (2019). Nonasymptotic rates for manifold, tangent space and curvature estimation., Ann. Statist. 47 177–204.
• [3] Alexander, S. B. and Bishop, R. L. (2006). Gauss equation and injectivity radii for subspaces in spaces of curvature bounded above., Geom. Dedicata 117 65–84.
• [4] Arias-Castro, E., Lerman, G. and Zhang, T. (2017). Spectral clustering based on local PCA., J. Mach. Learn. Res. 18 Paper No. 9, 57.
• [5] Arias-Castro, E., Pateiro-López, B. and Rodríguez-Casal, A. (2018). Minimax Estimation of the Volume of a Set Under the Rolling Ball Condition., Journal of the American Statistical Association 0 1-12.
• [6] Attali, D., Boissonnat, J.-D. and Edelsbrunner, H. (2009). Stability and computation of medial axes: a state-of-the-art report. In, Mathematical foundations of scientific visualization, computer graphics, and massive data exploration. Math. Vis. 109–125. Springer, Berlin.
• [7] Balakrishnan, S., Rinaldo, A., Sheehy, D., Singh, A. and Wasserman, L. A. (2012). Minimax rates for homology inference. In, International Conference on Artificial Intelligence and Statistics 64–72.
• [8] Belkin, M., Niyogi, P. and Sindhwani, V. (2006). Manifold regularization: a geometric framework for learning from labeled and unlabeled examples., J. Mach. Learn. Res. 7 2399–2434.
• [9] Berger, M. (1987)., Geometry. II. Universitext. Springer-Verlag, Berlin Translated from the French by M. Cole and S. Levy.
• [10] Boissonnat, J.-D. and Ghosh, A. (2014). Manifold reconstruction using tangential Delaunay complexes., Discrete Comput. Geom. 51 221–267.
• [11] Boissonnat, J.-D., Lieutier, A. and Wintraecken, M. (2018). The reach, metric distortion, geodesic convexity and the variation of tangent spaces. In, 34th International Symposium on Computational Geometry. LIPIcs. Leibniz Int. Proc. Inform. 99 Art. No. 10, 14. Schloss Dagstuhl. Leibniz-Zent. Inform., Wadern.
• [12] Burago, D., Burago, Y. and Ivanov, S. (2001)., A course in metric geometry. Graduate Studies in Mathematics 33. American Mathematical Society, Providence, RI.
• [13] Chazal, F. and Lieutier, A. (2005). The $\lambda$-medial axis., J. Graphical Models 67 304–331.
• [14] Cheng, S.-W. and Chiu, M.-K. (2016). Tangent estimation from point samples., Discrete Comput. Geom. 56 505–557.
• [15] Cuevas, A., Fraiman, R. and Pateiro-López, B. (2012). On statistical properties of sets fulfilling rolling-type conditions., Adv. in Appl. Probab. 44 311–329.
• [16] Cuevas, A., Fraiman, R. and Rodríguez-Casal, A. (2007). A nonparametric approach to the estimation of lengths and surface areas., Ann. Statist. 35 1031–1051.
• [17] Cuevas, A., Llop, P. and Pateiro-López, B. (2014). On the estimation of the medial axis and inner parallel body., J. Multivariate Anal. 129 171–185.
• [18] De Marco, G., Gorni, G. and Zampieri, G. (1994). Global inversion of functions: an introduction., NoDEA Nonlinear Differential Equations Appl. 1 229–248.
• [19] Dey, T. K. and Sun, J. (2006). Normal and feature approximations from noisy point clouds. In, FSTTCS 2006: Foundations of software technology and theoretical computer science. Lecture Notes in Comput. Sci. 4337 21–32. Springer, Berlin.
• [20] do Carmo, M. P. (1992)., Riemannian geometry. Mathematics: Theory & Applications. Birkhäuser Boston, Inc., Boston, MA Translated from the second Portuguese edition by Francis Flaherty.
• [21] Dyer, R., Vegter, G. and Wintraecken, M. (2015). Riemannian simplices and triangulations., Geometriae Dedicata 179 91–138.
• [22] Federer, H. (1959). Curvature measures., Trans. Amer. Math. Soc. 93 418–491.
• [23] Federer, H. (1969)., Geometric measure theory. Die Grundlehren der mathematischen Wissenschaften, Band 153. Springer-Verlag New York Inc., New York.
• [24] Fefferman, C., Mitter, S. and Narayanan, H. (2016). Testing the manifold hypothesis., J. Amer. Math. Soc. 29 983–1049.
• [25] Genovese, C. R., Perone-Pacifico, M., Verdinelli, I. and Wasserman, L. (2012). Minimax manifold estimation., J. Mach. Learn. Res. 13 1263–1291.
• [26] Giné, E. and Koltchinskii, V. (2006). Empirical graph Laplacian approximation of Laplace-Beltrami operators: large sample results. In, High dimensional probability. IMS Lecture Notes Monogr. Ser. 51 238–259. Inst. Math. Statist., Beachwood, OH.
• [27] Hatcher, A. (2002)., Algebraic topology. Cambridge University Press, Cambridge.
• [28] Hug, D., Kiderlen, M. and Svane, A. M. (2017). Voronoi-based estimation of Minkowski tensors from finite point samples., Discrete Comput. Geom. 57 545–570.
• [29] Kanagawa, S., Mochizuki, Y. and Tanaka, H. (1992). Limit theorems for the minimum interpoint distance between any pair of i.i.d. random points in $\mathbfR^d$., Ann. Inst. Statist. Math. 44 121–131.
• [30] Karcher, H. (1989). Riemannian comparison constructions. In, Global differential geometry. MAA Stud. Math. 27 170–222. Math. Assoc. America, Washington, DC.
• [31] Kim, A. K. H. and Zhou, H. H. (2015). Tight minimax rates for manifold estimation under Hausdorff loss., Electron. J. Stat. 9 1562–1582.
• [32] Klette, R. and Rosenfeld, A. (2004)., Digital geometry. Morgan Kaufmann Publishers, San Francisco, CA; Elsevier Science B.V., Amsterdam Geometric methods for digital picture analysis.
• [33] Niyogi, P., Smale, S. and Weinberger, S. (2008). Finding the homology of submanifolds with high confidence from random samples., Discrete Comput. Geom. 39 419–441.
• [34] Rataj, J. and Zajíček, L. (2017). On the structure of sets with positive reach., Math. Nachr. 290 1806–1829.
• [35] Rodríguez-Casal, A. and Saavedra-Nieves, P. (2016). A fully data-driven method for estimating the shape of a point cloud., ESAIM Probab. Stat. 20 332–348.
• [36] Singer, A. and Wu, H. T. (2012). Vector diffusion maps and the connection Laplacian., Comm. Pure Appl. Math. 65 1067–1144.
• [37] Thäle, C. (2008). 50 years sets with positive reach—a survey., Surv. Math. Appl. 3 123–165.
• [38] Yu, B. (1997). Assouad, Fano, and Le Cam. In, Festschrift for Lucien Le Cam 423–435. Springer, New York.