The Annals of Statistics

Efficient estimation in the bivariate censoring model and repairing NPMLE

Mark J. van der Laan
Source: Ann. Statist. Volume 24, Number 2 (1996), 596-627.

Abstract

The NPMLE in the bivariate censoring model is not consistent for continuous data. The problem is caused by the singly censored observations. In this paper we prove that if we observe the censoring times or if the censoring times are discrete, then a NPMLE based on a slightly reduced data set, in particular, we interval censor the singly censored observations, is asymptotically efficient for this reduced data and moreover if we let the width of the interval converge to zero slowly enough, then the NPMLE is also asymptotically efficient for the original data. We are able to determine a lower bound for the rate at which the bandwidth should converge to zero. Simulation results show that the estimator for small bandwidths has a very goodperformance. The efficiency proof uses a general identity which holds for NPMLE of a linear parameter in convex models. If we neither observe the censoring times nor the censoring times are discrete, then we conjecture that our estimator based on simulated censoring times is also asymptotically efficient.

First Page: Show Hide
Primary Subjects: 62G07
Secondary Subjects: 62F12
Full-text: Open access
Links and Identifiers

Permanent link to this document: http://projecteuclid.org/euclid.aos/1032894454
Mathematical Reviews number (MathSciNet): MR1394977
Digital Object Identifier: doi:10.1214/aos/1032894454
Zentralblatt MATH identifier: 0859.62033

References

BAKKER, D. M. 1990. Two nonparametric estimators of the survival function of bivariate right censored observations. Report BS-R9035, Centrum Wisk. Inform., Amsterdam. Z.
BICKEL, P. J. and FREEDMAN, D. A. 1981. Some asy mptotic theory for the bootstrap. Ann. Statist. 9 1196 1217. Z.
Mathematical Reviews (MathSciNet): MR630103
Zentralblatt MATH: 0449.62034
Digital Object Identifier: doi:10.1214/aos/1176345637
Project Euclid: euclid.aos/1176345637
BICKEL, P. J., KLAASSEN, A. J., RITOV, Y. and WELLNER, J. A. 1993. Efficient and Adaptive Estimation for Semi-Parametric Models. Johns Hopkins Univ. Press. Z.
Mathematical Reviews (MathSciNet): MR94m:62007
BURKE, M. D. 1988. Estimation of a bivariate survival function under random censorship. Biometrika 75 379 382. Z.
Mathematical Reviews (MathSciNet): MR946057
Digital Object Identifier: doi:10.1093/biomet/75.2.379
DABROWSKA, D. M. 1988. Kaplan Meier estimate on the plane. Ann. Statist. 16 1475 1489. Z.
Mathematical Reviews (MathSciNet): MR964934
Zentralblatt MATH: 0653.62071
Digital Object Identifier: doi:10.1214/aos/1176351049
Project Euclid: euclid.aos/1176351049
DABROWSKA, D. M. 1989. Kaplan Meier estimate on the plane: weak convergence, LIL, and the bootstrap. J. Multivariate Anal. 29 308 325. Z.
Mathematical Reviews (MathSciNet): MR90j:62023
Zentralblatt MATH: 0667.62025
Digital Object Identifier: doi:10.1016/0047-259X(89)90030-4
DEMPSTER, A. P., LAIRD, N. M. and RUBIN, D. B. 1977. Maximum likelihood from incomplete data via the EM-algorithm. J. Roy. Statist. Soc. Ser. B 39 1 38. Z.
Mathematical Reviews (MathSciNet): MR58:18858
EFRON, B. 1967. The two sample problem with censored data. Proc. Fifth Berkeley Sy mp. Math. Statist. Probab. 831 853. Univ. California Press, Berkeley. Z.
EINMAHL, J. H. H. 1987. Multivariate Empirical Processes. CWI Tract 32. Centrum Wisk. Inform., Amsterdam. Z.
Mathematical Reviews (MathSciNet): MR88g:60057
GILL, R. D. 1989. Nonand semi-parametric maximum likelihood estimators and the von Mises Z. method Part 1. Scand. J. Statist. 16 97 128. Z.
Mathematical Reviews (MathSciNet): MR91d:62042
GILL, R. D. 1992. Multivariate survival analysis. Theory Probab. Appl. 37 18 31 and 284 301. Z. English translation. Z.
Zentralblatt MATH: 0780.62086
GILL, R. D. 1994. Lectures on survival analysis. Ecole d'Ete de Probabilites de Saint Flour ´ ´ XXII. Lecture Notes in Math. 1581 115 241. Springer, Berlin. Z.
GILL, R. D., VAN DER LAAN, M. J. and WELLNER, J. A. 1993. Inefficient estimators of the bivariate survival function for three models. Ann. Inst. H. Poincare Probab. Statist.. ´ 31 547 597. Z.
HEITJAN, D. F. and RUBIN, D. B. 1991. Ignorability and coarse data. Ann. Statist. 19 2244 2253. Z. HOFFMANN-JøRGENSEN, J. 1984. Stochastic processes on Polish spaces. Unpublished manuscript. Z.
NEUHAUS, G. 1971. On weak convergence of stochastic processes with multidimensional time parameter. Ann. Math. Statist. 42 1285 1295. Z.
Mathematical Reviews (MathSciNet): MR293706
Zentralblatt MATH: 0222.60013
Digital Object Identifier: doi:10.1214/aoms/1177693241
Project Euclid: euclid.aoms/1177693241
PARTHASARATHY, K. R. 1967. Probability Measures on Metric Spaces. Academic Press, New York. Z.
Mathematical Reviews (MathSciNet): MR37:2271
POLLARD, D. 1990. Empirical Processes: Theory and Applications. IMS, Hay ward, CA. Z.
Mathematical Reviews (MathSciNet): MR93e:60046
Zentralblatt MATH: 0741.60001
PRENTICE, R. L. and CAI, J. 1992a. Covariance and survivor function estimation using censored multivariate failure time data. Biometrika 79 495 512. Z.
Mathematical Reviews (MathSciNet): MR1187604
Digital Object Identifier: doi:10.1093/biomet/79.3.495
PRENTICE, R. L. and CAI, J. 1992b. Marginal and conditional models for the analysis of Z multivariate failure time data. In Survival Analy sis State of the Art Klein, J. P. and. Goel, P. K., eds.. Kluwer, Dordrecht. Z.
PRUITT, R. C. 1991a. On negative mass assigned by the bivariate Kaplan Meier estimator. Ann. Statist. 19 443 453. Z.
Mathematical Reviews (MathSciNet): MR1091861
Zentralblatt MATH: 0738.62049
Digital Object Identifier: doi:10.1214/aos/1176347992
Project Euclid: euclid.aos/1176347992
PRUITT, R. C. 1991b. Strong consistency of self-consistent estimators: general theory and an application to bivariate survival analysis. Technical Report 543, Univ. Minnesota. Z.
PRUITT, R. C. 1993. Small sample comparisons of six bivariate survival curve estimators. J. Statist. Comput. Simulation. 45 147 167. Z.
TSAI, W-Y., LEURGANS, S. and CROWLEY, J. 1986. Nonparametric estimation of a bivariate survival function in the presence of censoring. Ann. Statist. 14 1351 1365. Z.
Mathematical Reviews (MathSciNet): MR868304
Zentralblatt MATH: 0625.62027
Digital Object Identifier: doi:10.1214/aos/1176350162
Project Euclid: euclid.aos/1176350162
TURNBULL, B. W. 1976. The empirical distribution with arbitrarily grouped censored and truncated data. J. Roy. Statist. Soc. Ser. B 38 290 295. Z.
Mathematical Reviews (MathSciNet): MR58:31567
VAN DER LAAN, M. J. 1990. Dabrowska's multivariate product limit estimator and the deltamethod. Master's dissertation, Dept. Mathematics, Univ. Utrecht, The Netherlands. Z.
VAN DER LAAN, M. J. 1993. General identity for linear parameters in convex models with Z. application to efficiency of the NP MLE. Preprint 765, Dept. Mathematics, Univ. Utrecht, The Netherlands.
VAN DER LAAN, M. J. 1994. Modified EM-estimator of the bivariate survival function. 3 213 243. Math. Methods Statist.. Z.
Mathematical Reviews (MathSciNet): MR96a:62043
Zentralblatt MATH: 0824.62037
VAN DER LAAN, M. J. 1995. Efficiency of the NPMLE in a general class of missing data models. Unpublished manuscript. Z.
VAN DER LAAN, M. J. 1996. Efficient and inefficient estimation in semiparametric models. Technical Report, CWI, Amsterdam. Z.
VAN DER VAART, A. W. 1988. Statistical estimation in large parameter spaces. CWI Tract 44. Centrum Wisk. Inform. Amsterdam. Z.
Mathematical Reviews (MathSciNet): MR89e:62049
Zentralblatt MATH: 0629.62035
VAN DER VAART, A. W. AND WELLNER, J. A. 1995. Weak Convergence and Empirical Processes.
IMS, Hay ward, CA.
UNIVERSITY OF CALIFORNIA, BERKELEY
BERKELEY, CALIFORNIA 94720

2012 © Institute of Mathematical Statistics

The Annals of Statistics

The Annals of Statistics