Adaptive Bayesian density regression for high-dimensional data

Weining Shen; Subhashis Ghosal

doi:10.3150/14-BEJ663

February 2016 Adaptive Bayesian density regression for high-dimensional data

Weining Shen, Subhashis Ghosal

Bernoulli 22(1): 396-420 (February 2016). DOI: 10.3150/14-BEJ663

Abstract

Density regression provides a flexible strategy for modeling the distribution of a response variable $Y$ given predictors $\mathbf{X}=(X_{1},\ldots,X_{p})$ by letting that the conditional density of $Y$ given $\mathbf{X}$ as a completely unknown function and allowing its shape to change with the value of $\mathbf{X}$. The number of predictors $p$ may be very large, possibly much larger than the number of observations $n$, but the conditional density is assumed to depend only on a much smaller number of predictors, which are unknown. In addition to estimation, the goal is also to select the important predictors which actually affect the true conditional density. We consider a nonparametric Bayesian approach to density regression by constructing a random series prior based on tensor products of spline functions. The proposed prior also incorporates the issue of variable selection. We show that the posterior distribution of the conditional density contracts adaptively at the truth nearly at the optimal oracle rate, determined by the unknown sparsity and smoothness levels, even in the ultra high-dimensional settings where $p$ increases exponentially with $n$. The result is also extended to the anisotropic case where the degree of smoothness can vary in different directions, and both random and deterministic predictors are considered. We also propose a technique to calculate posterior moments of the conditional density function without requiring Markov chain Monte Carlo methods.

Citation

Download Citation

Weining Shen. Subhashis Ghosal. "Adaptive Bayesian density regression for high-dimensional data." Bernoulli 22 (1) 396 - 420, February 2016. https://doi.org/10.3150/14-BEJ663

Information

Received: 1 July 2013; Revised: 1 June 2014; Published: February 2016

First available in Project Euclid: 30 September 2015

zbMATH: 06543275

MathSciNet: MR3449788

Digital Object Identifier: 10.3150/14-BEJ663

Keywords: adaptive estimation , density regression , high-dimensional models , MCMC-free computation , nonparametric Bayesian inference , Posterior contraction rate , Variable selection

Access the abstract

JOURNAL ARTICLE
25 PAGES

DOWNLOAD PDF + SAVE TO MY LIBRARY