Open Access
2017 Efficient block boundaries estimation in block-wise constant matrices: An application to HiC data
Vincent Brault, Julien Chiquet, Céline Lévy-Leduc
Electron. J. Statist. 11(1): 1570-1599 (2017). DOI: 10.1214/17-EJS1270

Abstract

In this paper, we propose a novel modeling and a new methodology for estimating the location of block boundaries in a random matrix consisting of a block-wise constant matrix corrupted with white noise. Our method consists in rewriting this problem as a variable selection issue. A penalized least-squares criterion with an $\ell_{1}$-type penalty is used for dealing with this problem. Firstly, some theoretical results ensuring the consistency of our block boundaries estimators are provided. Secondly, we explain how to implement our approach in a very efficient way. This implementation is available in the R package blockseg which can be found in the Comprehensive R Archive Network. Thirdly, we provide some numerical experiments to illustrate the statistical and numerical performance of our package, as well as a thorough comparison with existing methods. Fourthly, an empirical procedure is proposed for estimating the number of blocks. Finally, our approach is applied to HiC data which are used in molecular biology for better understanding the influence of the chromosomal conformation on the cells functioning.

Citation

Download Citation

Vincent Brault. Julien Chiquet. Céline Lévy-Leduc. "Efficient block boundaries estimation in block-wise constant matrices: An application to HiC data." Electron. J. Statist. 11 (1) 1570 - 1599, 2017. https://doi.org/10.1214/17-EJS1270

Information

Received: 1 September 2016; Published: 2017
First available in Project Euclid: 22 April 2017

zbMATH: 1362.62012
MathSciNet: MR3638971
Digital Object Identifier: 10.1214/17-EJS1270

Subjects:
Primary: 62-07 , 62F30 , 62P10
Secondary: 62F12 , 62J07

Keywords: Change-points , HiC experiments , high-dimensional sparse linear model

Vol.11 • No. 1 • 2017
Back to Top