Open Access
2020 Kernel machines with missing responses
Tiantian Liu, Yair Goldberg
Electron. J. Statist. 14(2): 3766-3820 (2020). DOI: 10.1214/20-EJS1752

Abstract

Missing responses is a common type of data where the interested outcomes are not always observed. In this paper, we develop two new kernel machines to handle such a case, which can be used for both regression and classification. The first proposed kernel machine uses $\text{only}$ the complete cases where both response and covariates are observed. It is, however, subject to some assumption limitations. Our second proposed doubly-robust kernel machine overcomes such limitations regardless of the misspecification of either the missing mechanism or the conditional distribution of the response. Theoretical properties, including the oracle inequalities for the excess risk, universal consistency, and learning rates are established. We demonstrate the superiority of the proposed methods to some existing methods by simulation and illustrate their application to a real data set concerning a survey about homeless people.

Citation

Download Citation

Tiantian Liu. Yair Goldberg. "Kernel machines with missing responses." Electron. J. Statist. 14 (2) 3766 - 3820, 2020. https://doi.org/10.1214/20-EJS1752

Information

Received: 1 October 2019; Published: 2020
First available in Project Euclid: 17 October 2020

zbMATH: 07270277
MathSciNet: MR4164463
Digital Object Identifier: 10.1214/20-EJS1752

Subjects:
Primary: 60K35

Keywords: consistency , doubly-robust estimator , inverse probability weighted estimator , Kernel machines , learning rate , missing responses , Oracle inequality

Vol.14 • No. 2 • 2020
Back to Top