Open Access
March 2021 Model free estimation of graphical model using gene expression data
Jenny Yang, Yang Liu, Yufeng Liu, Wei Sun
Author Affiliations +
Ann. Appl. Stat. 15(1): 194-207 (March 2021). DOI: 10.1214/20-AOAS1380

Abstract

Graphical model is a powerful and popular approach to study high-dimensional omic data, such as genome-wide gene expression data. Nonlinear relations between genes are widely documented. However, partly due to sparsity of data points in high-dimensional space (i.e., curse of dimensionality) and computational challenges, most available methods construct graphical models by testing linear relations. We propose to address this challenge by a two-step approach: first, use a model-free approach to prioritize the neighborhood of each gene; then, apply a nonparametric conditional independence testing method to refine such neighborhood estimation. Our method, named as “mofreds” (MOdel FRee Estimation of DAG Skeletons), seeks to estimate the skeleton of a directed acyclic graph (DAG) by this two-step approach. We studied the theoretical properties of mofreds and evaluated its performance in extensive simulation settings. We found mofreds has substantially better performance than the state-of-the art method which is designed to detect linear relations of Gaussian graphical models. We applied mofreds to analyze gene expression data of breast cancer patients from The Cancer Genome Atlas (TCGA). We found that it discovers nonlinear relationships among gene pairs that are missed by the Gaussian graphical model methods.

Acknowledgments

This work is supported, in part, by NIH grants GM126550 and GM105785 and NSF Grant DMS-1821231.

Citation

Download Citation

Jenny Yang. Yang Liu. Yufeng Liu. Wei Sun. "Model free estimation of graphical model using gene expression data." Ann. Appl. Stat. 15 (1) 194 - 207, March 2021. https://doi.org/10.1214/20-AOAS1380

Information

Received: 1 August 2019; Revised: 1 March 2020; Published: March 2021
First available in Project Euclid: 18 March 2021

Digital Object Identifier: 10.1214/20-AOAS1380

Keywords: directed acyclic graphs , graphical models , model free

Rights: Copyright © 2021 Institute of Mathematical Statistics

Vol.15 • No. 1 • March 2021
Back to Top