December 2024 Deep neural networks for nonparametric interaction models with diverging dimension
Sohom Bhattacharya, Jianqing Fan, Debarghya Mukherjee
Author Affiliations +
Ann. Statist. 52(6): 2738-2766 (December 2024). DOI: 10.1214/24-AOS2442

Abstract

Deep neural networks have achieved tremendous success due to their representation power and adaptation to low-dimensional structures. Their potential for estimating structured regression functions has been recently established in the literature. However, most of the studies require the input dimension to be fixed, and consequently, they ignore the effect of dimension on the rate of convergence and hamper their applications to modern big data with high dimensionality. In this paper, we bridge this gap by analyzing a k-way nonparametric interaction model in both growing dimension scenarios (d grows with n but at a slower rate) and in high dimension (dn). In the latter case, sparsity assumptions and associated regularization are required to obtain optimal convergence rates. A new challenge in diverging dimension setting is in calculation mean-square error; the covariance terms among estimated additive components are an order of magnitude larger than those of the variances and can deteriorate statistical properties without proper care. We introduce a critical debiasing technique to amend the problem. We show that under certain standard assumptions, debiased deep neural networks achieve a minimax optimal rate both in terms of (n,d). Our proof techniques rely crucially on a novel debiasing technique that makes the covariances of additive components negligible in the mean-square error calculation. In addition, we establish the matching lower bounds.

Funding Statement

This paper is supported by ONR N00014-22-1-2340 and the NSF grants DMS-2052926, DMS-2053832, DMS-2210833.

Citation

Download Citation

Sohom Bhattacharya. Jianqing Fan. Debarghya Mukherjee. "Deep neural networks for nonparametric interaction models with diverging dimension." Ann. Statist. 52 (6) 2738 - 2766, December 2024. https://doi.org/10.1214/24-AOS2442

Information

Received: 1 February 2023; Revised: 1 April 2024; Published: December 2024
First available in Project Euclid: 18 December 2024

MathSciNet: MR4842825
Digital Object Identifier: 10.1214/24-AOS2442

Subjects:
Primary: 62G08 , 62M45

Keywords: Deep neural networks , High-dimensional statistics , Minimax rate , nonparametric interaction model , sparse nonparametric components

Rights: Copyright © 2024 Institute of Mathematical Statistics

Vol.52 • No. 6 • December 2024
Back to Top