April 2023 Optimally tackling covariate shift in RKHS-based nonparametric regression
Cong Ma, Reese Pathak, Martin J. Wainwright
Author Affiliations +
Ann. Statist. 51(2): 738-761 (April 2023). DOI: 10.1214/23-AOS2268

Abstract

We study the covariate shift problem in the context of nonparametric regression over a reproducing kernel Hilbert space (RKHS). We focus on two natural families of covariate shift problems defined using the likelihood ratios between the source and target distributions. When the likelihood ratios are uniformly bounded, we prove that the kernel ridge regression (KRR) estimator with a carefully chosen regularization parameter is minimax rate-optimal (up to a log factor) for a large family of RKHSs with regular kernel eigenvalues. Interestingly, KRR does not require full knowledge of the likelihood ratio apart from an upper bound on it. In striking contrast to the standard statistical setting without covariate shift, we also demonstrate that a naïve estimator, which minimizes the empirical risk over the function class, is strictly suboptimal under covariate shift as compared to KRR. We then address the larger class of covariate shift problems where likelihood ratio is possibly unbounded yet has a finite second moment. Here, we propose a reweighted KRR estimator that weights samples based on a careful truncation of the likelihood ratios. Again, we are able to show that this estimator is minimax optimal, up to logarithmic factors.

Funding Statement

This work was partially supported by NSF-DMS Grant 2015454, NSF-IIS Grant 1909365, as well as Office of Naval Research Grant DOD-ONR-N00014-18-1-2640 to MJW. RP was partially supported by a Berkeley Fellowship and an ARCS Fellowship.

Acknowledgments

The authors would like to thank the anonymous referees, an Asso- ciate Editor and the Editor for their constructive comments that improved the quality of this paper. The authors would also like to thank Samory Kpotufe for helpful discussions on transfer learning. MJW is currently on leave from UC Berkeley EECS and Statistics.

Citation

Download Citation

Cong Ma. Reese Pathak. Martin J. Wainwright. "Optimally tackling covariate shift in RKHS-based nonparametric regression." Ann. Statist. 51 (2) 738 - 761, April 2023. https://doi.org/10.1214/23-AOS2268

Information

Received: 1 May 2022; Revised: 1 October 2022; Published: April 2023
First available in Project Euclid: 13 June 2023

zbMATH: 07714179
MathSciNet: MR4601000
Digital Object Identifier: 10.1214/23-AOS2268

Subjects:
Primary: 62C20
Secondary: 62G08

Keywords: covariate shift , kernel ridge regression , Nonparametric regression , reproducing kernel Hilbert spaces , transfer learning

Rights: Copyright © 2023 Institute of Mathematical Statistics

JOURNAL ARTICLE
24 PAGES

This article is only available to subscribers.
It is not available for individual sale.
+ SAVE TO MY LIBRARY

Vol.51 • No. 2 • April 2023
Back to Top