The Annals of Statistics

Depth weighted scatter estimators

Yijun Zuo and Hengjian Cui
Source: Ann. Statist. Volume 33, Number 1 (2005), 381-413.

Abstract

General depth weighted scatter estimators are introduced and investigated. For general depth functions, we find out that these affine equivariant scatter estimators are Fisher consistent and unbiased for a wide range of multivariate distributions, and show that the sample scatter estimators are strong and $\sqrt{n}$-consistent and asymptotically normal, and the influence functions of the estimators exist and are bounded in general. We then concentrate on a specific case of the general depth weighted scatter estimators, the projection depth weighted scatter estimators, which include as a special case the well-known Stahel–Donoho scatter estimator whose limiting distribution has long been open until this paper. Large sample behavior, including consistency and asymptotic normality, and efficiency and finite sample behavior, including breakdown point and relative efficiency of the sample projection depth weighted scatter estimators, are thoroughly investigated. The influence function and the maximum bias of the projection depth weighted scatter estimators are derived and examined. Unlike typical high-breakdown competitors, the projection depth weighted scatter estimators can integrate high breakdown point and high efficiency while enjoying a bounded-influence function and a moderate maximum bias curve. Comparisons with leading estimators on asymptotic relative efficiency and gross error sensitivity reveal that the projection depth weighted scatter estimators behave very well overall and, consequently, represent very favorable choices of affine equivariant multivariate scatter estimators.

First Page: Show Hide
Primary Subjects: 62F35, 62H12
Secondary Subjects: 62E20, 62F12
Full-text: Open access
Links and Identifiers

Permanent link to this document: http://projecteuclid.org/euclid.aos/1112967710
Digital Object Identifier: doi:10.1214/009053604000000922
Zentralblatt MATH identifier: 02182567
Mathematical Reviews number (MathSciNet): MR2157807

References

Chen, Z. and Tyler, D. E. (2002). The influence function and maximum bias of Tukey's median. Ann. Statist. 30 1737--1759.
Mathematical Reviews (MathSciNet): MR1969448
Digital Object Identifier: doi:10.1214/aos/1043351255
Project Euclid: euclid.aos/1043351255
Zentralblatt MATH: 1015.62058
Cui, H. and Tian, Y. (1994). Estimation of the projection absolute median deviation and its application. J. Systems Sci. Math. Sci. 14 63--72. (In Chinese.)
Mathematical Reviews (MathSciNet): MR1331533
Davies, P. L. (1987). Asymptotic behavior of $S$-estimates of multivariate location parameters and dispersion matrices. Ann. Statist. 15 1269--1292.
Mathematical Reviews (MathSciNet): MR902258
Digital Object Identifier: doi:10.1214/aos/1176350505
Project Euclid: euclid.aos/1176350505
Zentralblatt MATH: 0645.62057
Donoho, D. L. (1982). Breakdown properties of multivariate location estimators. Ph.D. qualifying paper, Dept. Statistics, Harvard Univ.
Donoho, D. L. and Huber, P. J. (1983). The notion of breakdown point. In A Festschrift for Erich L. Lehmann (P. J. Bickel, K. A. Doksum and J. L. Hodges, Jr., eds.) 157--184. Wadsworth, Belmont, CA.
Mathematical Reviews (MathSciNet): MR689745
Zentralblatt MATH: 0523.62032
Dümbgen, L. (1992). Limit theorem for the simplicial depth. Statist. Probab. Lett. 14 119--128.
Mathematical Reviews (MathSciNet): MR1173409
Eaton, M. L. (1981). On the projections of isotropic distributions. Ann. Statist. 9 391--400.
Mathematical Reviews (MathSciNet): MR606622
Digital Object Identifier: doi:10.1214/aos/1176345404
Project Euclid: euclid.aos/1176345404
Zentralblatt MATH: 0463.62016
Gather, U. and Hilker, T. (1997). A note on Tyler's modification of the MAD for the Stahel--Donoho estimator. Ann. Statist. 25 2024--2026.
Mathematical Reviews (MathSciNet): MR1474080
Digital Object Identifier: doi:10.1214/aos/1069362384
Project Euclid: euclid.aos/1069362384
Zentralblatt MATH: 0881.62033
Hampel, F. R., Ronchetti, E. M., Rousseeuw, P. J. and Stahel, W. A. (1986). Robust Statistics: The Approach Based On Influence Functions. Wiley, New York.
Mathematical Reviews (MathSciNet): MR829458
Zentralblatt MATH: 0593.62027
He, X. and Simpson, D. G. (1993). Lower bounds for contamination bias: Globally minimax versus locally linear estimation. Ann. Statist. 21 314--337.
Mathematical Reviews (MathSciNet): MR1212179
Digital Object Identifier: doi:10.1214/aos/1176349028
Project Euclid: euclid.aos/1176349028
Zentralblatt MATH: 0770.62023
Huber, P. J. (1964). Robust estimation of a location parameter. Ann. Math. Statist. 35 73--101.
Mathematical Reviews (MathSciNet): MR161415
Digital Object Identifier: doi:10.1214/aoms/1177703732
Project Euclid: euclid.aoms/1177703732
Kent, J. T. and Tyler, D. E. (1996). Constrained $M$-estimation multivariate location and scatter. Ann. Statist. 24 1346--1370.
Mathematical Reviews (MathSciNet): MR1401854
Digital Object Identifier: doi:10.1214/aos/1032526973
Project Euclid: euclid.aos/1032526973
Zentralblatt MATH: 0862.62048
Liu, R. Y. (1990). On a notion of data depth based on random simplices. Ann. Statist. 18 405--414.
Mathematical Reviews (MathSciNet): MR1041400
Digital Object Identifier: doi:10.1214/aos/1176347507
Project Euclid: euclid.aos/1176347507
Zentralblatt MATH: 0701.62063
Liu, R. Y. (1992). Data depth and multivariate rank tests. In $L_1$-Statistical Analysis and Related Methods (Y. Dodge, ed.) 279--294. North-Holland, Amsterdam.
Mathematical Reviews (MathSciNet): MR1214839
Liu, R. Y., Parelius, J. M. and Singh, K. (1999). Multivariate analysis by data depth: Descriptive statistics, graphics and inference (with discussion). Ann. Statist. 27 783--858.
Mathematical Reviews (MathSciNet): MR1724033
Project Euclid: euclid.aos/1018031260
Lopuhaä, H. P. (1989). On the relation between $S$-estimators and $M$-estimators of multivariate location and covariance. Ann. Statist. 17 1662--1683.
Mathematical Reviews (MathSciNet): MR1026304
Digital Object Identifier: doi:10.1214/aos/1176347386
Project Euclid: euclid.aos/1176347386
Zentralblatt MATH: 0702.62031
Lopuhaä, H. P. (1999). Asymptotics of reweighted estimators of multivariate location and scatter. Ann. Statist. 27 1638--1665.
Mathematical Reviews (MathSciNet): MR1742503
Digital Object Identifier: doi:10.1214/aos/1017939145
Project Euclid: euclid.aos/1017939145
Zentralblatt MATH: 0957.62017
Maronna, R. A. (1976). Robust $M$-estimators of multivariate location and scatter. Ann. Statist. 4 51--67.
Mathematical Reviews (MathSciNet): MR388656
Digital Object Identifier: doi:10.1214/aos/1176343347
Project Euclid: euclid.aos/1176343347
Zentralblatt MATH: 0322.62054
Maronna, R. A. and Yohai, V. J. (1995). The behavior of the Stahel--Donoho robust multivariate estimator. J. Amer. Statist. Assoc. 90 330--341.
Mathematical Reviews (MathSciNet): MR1325140
Digital Object Identifier: doi:10.2307/2291158
Zentralblatt MATH: 0820.62050
Martin, R. D., Yohai, V. J. and Zamar, R. H. (1989). Min--max bias robust regression. Ann. Statist. 17 1608--1630.
Mathematical Reviews (MathSciNet): MR1026302
Digital Object Identifier: doi:10.1214/aos/1176347384
Project Euclid: euclid.aos/1176347384
Zentralblatt MATH: 0713.62068
Massé, J.-C. (2004). Asymptotics for the Tukey depth process, with an application to a multivariate trimmed mean. Bernoulli 10 397--419.
Mathematical Reviews (MathSciNet): MR2061438
Project Euclid: euclid.bj/1089206404
Digital Object Identifier: doi:10.3150/bj/1089206404
Zentralblatt MATH: 1053.62021
Muirhead, R. J. (1982). Aspects of Multivariate Statistical Thoeory. Wiley, New York.
Mathematical Reviews (MathSciNet): MR652932
Zentralblatt MATH: 0556.62028
Pollard, D. (1984). Convergence of Stochastic Processes. Springer, New York.
Mathematical Reviews (MathSciNet): MR762984
Zentralblatt MATH: 0544.60045
Romanazzi, M. (2001). Influence function of halfspace depth. J. Multivariate Anal. 77 138--161.
Mathematical Reviews (MathSciNet): MR1838717
Digital Object Identifier: doi:10.1006/jmva.2000.1929
Zentralblatt MATH: 1033.62047
Rousseeuw, P. J. (1985). Multivariate estimation with high breakdown point. In Mathematical Statistics and Applications (W. Grossmann, G. Pflug, I. Vincze and W. Wertz, eds.) 283--297. Reidel, Dordrecht.
Mathematical Reviews (MathSciNet): MR851060
Zentralblatt MATH: 0609.62054
Stahel, W. A. (1981). Breakdown of covariance estimators. Research Report 31, Fachgruppe für Statistik, ETH, Zürich.
Tukey, J. W. (1975). Mathematics and the picturing of data. In Proc. International Congress of Mathematicians Vancouver 1974 2 523--531. Canadian Math. Congress, Montreal.
Mathematical Reviews (MathSciNet): MR426989
Zentralblatt MATH: 0347.62002
Tyler, D. E. (1982). Radial estimates and the test for sphericity. Biometrika 69 429--436.
Mathematical Reviews (MathSciNet): MR671982
Zentralblatt MATH: 0501.62041
Digital Object Identifier: doi:10.1093/biomet/69.2.429
Tyler, D. E. (1983). Robustness and efficiency properties of scatter matrices. Biometrika 70 411--420.
Mathematical Reviews (MathSciNet): MR712028
Zentralblatt MATH: 0536.62042
Digital Object Identifier: doi:10.1093/biomet/70.2.411
Tyler, D. E. (1994). Finite sample breakdown points of projection based multivariate location and scatter statistics. Ann. Statist. 22 1024--1044.
Mathematical Reviews (MathSciNet): MR1292555
Digital Object Identifier: doi:10.1214/aos/1176325510
Project Euclid: euclid.aos/1176325510
Zentralblatt MATH: 0815.62015
van der Vaart, A. W. and Wellner, J. A. (1996). Weak Convergence and Empirical Processes. With Applications to Statistics. Springer, New York.
Mathematical Reviews (MathSciNet): MR1385671
Zentralblatt MATH: 0862.60002
Zuo, Y. (2003). Projection-based depth functions and associated medians. Ann. Statist. 31 1460--1490.
Mathematical Reviews (MathSciNet): MR2012822
Digital Object Identifier: doi:10.1214/aos/1065705115
Project Euclid: euclid.aos/1065705115
Zentralblatt MATH: 1046.62056
Zuo, Y. (2004). Robustness of weighted $L_p$-depth and $L_p$-median. Allg. Stat. Arch. 88 215--234.
Mathematical Reviews (MathSciNet): MR2074731
Digital Object Identifier: doi:10.1007/s101820400169
Zentralblatt MATH: 05244356
Zuo, Y., Cui, H. and He, X. (2004). On the Stahel--Donoho estimator and depth-weighted means of multivariate data. Ann. Statist. 32 167--188.
Mathematical Reviews (MathSciNet): MR2051003
Digital Object Identifier: doi:10.1214/aos/1079120132
Project Euclid: euclid.aos/1079120132
Zentralblatt MATH: 1105.62349
Zuo, Y., Cui, H. and Young, D. (2004). Influence function and maximum bias of projection depth based estimators. Ann. Statist. 32 189--218.
Mathematical Reviews (MathSciNet): MR2051004
Digital Object Identifier: doi:10.1214/aos/1079120133
Project Euclid: euclid.aos/1079120133
Zentralblatt MATH: 1105.62350
Zuo, Y. and Serfling, R. (2000a). General notions of statistical depth function. Ann. Statist. 28 461--482.
Mathematical Reviews (MathSciNet): MR1790005
Digital Object Identifier: doi:10.1214/aos/1016218226
Project Euclid: euclid.aos/1016218226
Zentralblatt MATH: 1106.62334
Zuo, Y. and Serfling, R. (2000b). Structural properties and convergence results for contours of sample statistical depth functions. Ann. Statist. 28 483--499.
Mathematical Reviews (MathSciNet): MR1790006
Digital Object Identifier: doi:10.1214/aos/1016218227
Project Euclid: euclid.aos/1016218227
Zentralblatt MATH: 1105.62343

2012 © Institute of Mathematical Statistics

The Annals of Statistics

The Annals of Statistics