Bernoulli

Some intriguing properties of Tukey’s half-space depth

Subhajit Dutta, Anil K. Ghosh, and Probal Chaudhuri
Source: Bernoulli Volume 17, Number 4 (2011), 1420-1434.

Abstract

For multivariate data, Tukey’s half-space depth is one of the most popular depth functions available in the literature. It is conceptually simple and satisfies several desirable properties of depth functions. The Tukey median, the multivariate median associated with the half-space depth, is also a well-known measure of center for multivariate data with several interesting properties. In this article, we derive and investigate some interesting properties of half-space depth and its associated multivariate median. These properties, some of which are counterintuitive, have important statistical consequences in multivariate analysis. We also investigate a natural extension of Tukey’s half-space depth and the related median for probability distributions on any Banach space (which may be finite- or infinite-dimensional) and prove some results that demonstrate anomalous behavior of half-space depth in infinite-dimensional spaces.

First Page: Show Hide
Full-text: Access denied (no subscription detected)
We're sorry, but we are unable to provide you with the full text of this article because we are not able to identify you as a subscriber.
If you have a personal subscription to this journal, then please login. If you are already logged in, then you may need to update your profile to register your subscription. Read more about accessing full-text
Links and Identifiers

Permanent link to this document: http://projecteuclid.org/euclid.bj/1320417511
Digital Object Identifier: doi:10.3150/10-BEJ322
Zentralblatt MATH identifier: 1229.62063
Mathematical Reviews number (MathSciNet): MR2854779

References

[1] Ajne, B. (1968). A simple test for uniformity of a circular distribution. Biometrika 55 343–354.
Mathematical Reviews (MathSciNet): MR235662
Zentralblatt MATH: 0157.48502
Digital Object Identifier: doi:10.1093/biomet/55.2.343
[2] Chaudhuri, P. and Sengupta, D. (1993). Sign tests in multi-dimension: Inference based on the geometry of the data cloud. J. Amer. Statist. Assoc. 88 1363–1370.
Mathematical Reviews (MathSciNet): MR1245371
Zentralblatt MATH: 0792.62047
Digital Object Identifier: doi:10.2307/2291278
[3] Chow, Y.S. and Teicher, H. (2005). Probability Theory: Independence, Interchangeability, Martingales. New York: Springer.
[4] Cuesta-Albertosa, J.A. and Nieto-Reyes, A. (2008). The Tukey and the random Tukey depths characterize discrete distributions. J. Multivariate Anal. 10 2304–2311.
Mathematical Reviews (MathSciNet): MR2463390
Zentralblatt MATH: 05374623
Digital Object Identifier: doi:10.1016/j.jmva.2008.02.017
[5] Dang, X. and Serfling, R. (2010). Nonparametric depth-based multivariate outlier identifiers, and masking robustness properties. J. Statist. Plann. Inference 140 198–213.
Mathematical Reviews (MathSciNet): MR2568133
Zentralblatt MATH: 1191.62084
Digital Object Identifier: doi:10.1016/j.jspi.2009.07.004
[6] Donoho, D. and Gasko, M. (1992). Breakdown properties of location estimates based half-space depth and projected outlyingness. Ann. Statist. 20 1803–1827.
Mathematical Reviews (MathSciNet): MR1193313
Zentralblatt MATH: 0776.62031
Digital Object Identifier: doi:10.1214/aos/1176348890
Project Euclid: euclid.aos/1176348890
[7] Ghosh, A.K. and Chaudhuri, P. (2005). On data depth and distribution free discriminant analysis using separating surfaces. Bernoulli 11 1–27.
Mathematical Reviews (MathSciNet): MR2121452
Digital Object Identifier: doi:10.3150/bj/1110228239
Project Euclid: euclid.bj/1110228239
[8] Ghosh, A.K. and Chaudhuri, P. (2005). On maximum depth classifiers. Scand. J. Statist. 32 328–350.
Mathematical Reviews (MathSciNet): MR2188677
Digital Object Identifier: doi:10.1111/j.1467-9469.2005.00423.x
[9] Hassairi, A. and Regaieg, O. (2008). On the Tukey depth of a continuous probability distribution. Statist. Probab. Lett. 78 2308–2313.
Mathematical Reviews (MathSciNet): MR2462666
[10] Koshevoy, G.A. (2002). The Tukey’s depth characterizes the atomic measure. J. Multivariate Anal. 83 360–364.
Mathematical Reviews (MathSciNet): MR1945958
Zentralblatt MATH: 1028.62040
Digital Object Identifier: doi:10.1006/jmva.2001.2052
[11] Koshevoy, G.A. (2003). Lift-zonoid and multivariate depths. In Developments in Robust Statistics (Vorau, 2001) 194–202. Heidelberg: Physica.
Mathematical Reviews (MathSciNet): MR1977477
Digital Object Identifier: doi:10.1007/978-3-642-57338-5_16
[12] Liu, R. (1990). On a notion of data depth based on random simplices. Ann. Statist. 18 405–414.
Mathematical Reviews (MathSciNet): MR1041400
Zentralblatt MATH: 0701.62063
Digital Object Identifier: doi:10.1214/aos/1176347507
Project Euclid: euclid.aos/1176347507
[13] Liu, R., Parelius, J. and Singh, K. (1999). Multivariate analysis of the data depth: Descriptive statistics and inference. Ann. Statist. 27 783–858.
Mathematical Reviews (MathSciNet): MR1724033
Zentralblatt MATH: 0984.62037
Project Euclid: euclid.aos/1018031260
[14] Lopez-Pintado, S. and Romo, J. (2006). Depth based classification for functional data. In DIMACS Ser. Math. and Theo. Comp. Sci. (R. Liu and R. Serfling, Eds.) 72 103–119. Providence, RI: Amer. Math. Soc.
Mathematical Reviews (MathSciNet): MR2343116
[15] Mizera, I. and Muller, C.H. (2004). Location-scale depth. J. Amer. Statist. Assoc. 99 949–966.
Mathematical Reviews (MathSciNet): MR2109488
Zentralblatt MATH: 1071.62032
Digital Object Identifier: doi:10.1198/016214504000001312
[16] Mosler, K. (2002). Multivariate Dispersions, Central Regions and Depth. New York: Springer.
Mathematical Reviews (MathSciNet): MR1913862
Zentralblatt MATH: 1027.62033
[17] Nolan, D. (1992). Asymptotics for multivariate trimming. Stochastic Process. Appl. 42 157–169.
Mathematical Reviews (MathSciNet): MR1172513
Zentralblatt MATH: 0763.62007
Digital Object Identifier: doi:10.1016/0304-4149(92)90032-L
[18] Serfling, R. (2006). Depth functions in nonparametric multivariate inference. In DIMACS Ser. Math. and Theo. Comp. Sci. (R. Liu and R. Serfling. Eds.) 72 1–16. Providence, RI: Amer. Math. Soc.
Mathematical Reviews (MathSciNet): MR2343109
[19] Small, C.G. (1990). A survey of multidimensional medians. Inter. Statist. Rev. 58 263–277.
[20] Tukey, J. (1975). Mathematics and the picturing of data. In Proc. 1975 Inter. Cong. Math., Vancouver 523–531. Montreal: Canad. Math. Congress.
Mathematical Reviews (MathSciNet): MR426989
Zentralblatt MATH: 0347.62002
[21] Vardi, Y. and Zhang, C.H. (2000). The multivariate L1-median and associated data depth. Proc. Natl. Acad. Sci. USA 97 1423–1426.
Mathematical Reviews (MathSciNet): MR1740461
Zentralblatt MATH: 1054.62067
Digital Object Identifier: doi:10.1073/pnas.97.4.1423
[22] Zuo, Y. and Serfling, R. (2000). General notions of statistical depth function. Ann. Statist. 28 461–482.
Mathematical Reviews (MathSciNet): MR1790005
Zentralblatt MATH: 1106.62334
Digital Object Identifier: doi:10.1214/aos/1016218226
Project Euclid: euclid.aos/1016218226
[23] Zuo, Y. (2003). Projection-based depth functions and associated medians. Ann. Statist. 31 1460–1490.
Mathematical Reviews (MathSciNet): MR2012822
Zentralblatt MATH: 1046.62056
Digital Object Identifier: doi:10.1214/aos/1065705115
Project Euclid: euclid.aos/1065705115

2013 © Bernoulli Society for Mathematical Statistics and Probability

Bernoulli

Bernoulli

Turn MathJax Off
What is MathJax?