Open Access
September 2017 A novel and efficient algorithm for de novo discovery of mutated driver pathways in cancer
Binghui Liu, Chong Wu, Xiaotong Shen, Wei Pan
Ann. Appl. Stat. 11(3): 1481-1512 (September 2017). DOI: 10.1214/17-AOAS1042

Abstract

Next-generation sequencing studies on cancer somatic mutations have discovered that driver mutations tend to appear in most tumor samples, but they barely overlap in any single tumor sample, presumably because a single driver mutation can perturb the whole pathway. Based on the corresponding new concepts of coverage and mutual exclusivity, new methods can be designed for de novo discovery of mutated driver pathways in cancer. Since the computational problem is a combinatorial optimization with an objective function involving a discontinuous indicator function in high dimension, many existing optimization algorithms, such as a brute force enumeration, gradient descent and Newton’s methods, are practically infeasible or directly inapplicable. We develop a new algorithm based on a novel formulation of the problem as nonconvex programming and nonconvex regularization. The method is computationally more efficient, effective and scalable than existing Monte Carlo searching and several other algorithms, which have been applied to The Cancer Genome Atlas (TCGA) project. We also extend the new method for integrative analysis of both mutation and gene expression data. We demonstrate the promising performance of the new methods with applications to three cancer datasets to discover de novo mutated driver pathways.

Citation

Download Citation

Binghui Liu. Chong Wu. Xiaotong Shen. Wei Pan. "A novel and efficient algorithm for de novo discovery of mutated driver pathways in cancer." Ann. Appl. Stat. 11 (3) 1481 - 1512, September 2017. https://doi.org/10.1214/17-AOAS1042

Information

Received: 1 September 2015; Revised: 1 March 2017; Published: September 2017
First available in Project Euclid: 5 October 2017

zbMATH: 1379.62076
MathSciNet: MR3709567
Digital Object Identifier: 10.1214/17-AOAS1042

Keywords: DNA sequencing , driver mutations , optimization , subset selection , truncated $L_{1}$ penalty

Rights: Copyright © 2017 Institute of Mathematical Statistics

Vol.11 • No. 3 • September 2017
Back to Top