Annals of Applied Statistics
- Ann. Appl. Stat.
- Volume 9, Number 4 (2015), 2052-2072.
Identifying heterogeneous transgenerational DNA methylation sites via clustering in beta regression
This paper explores the transgenerational DNA methylation pattern (DNA methylation transmitted from one generation to the next) via a clustering approach. Beta regression is employed to model the transmission pattern from parents to their offsprings at the population level. To facilitate this goal, an expectation maximization algorithm for parameter estimation along with a BIC criterion to determine the number of clusters is proposed. Applying our method to the DNA methylation data composed of 4063 CpG sites of 41 mother–father-infant triads, we identified a set of CpG sites in which DNA methylation transmission is dominated by fathers, while at a large number of CpG sites, DNA methylation is mainly maternally transmitted to the offspring.
Ann. Appl. Stat., Volume 9, Number 4 (2015), 2052-2072.
Received: October 2014
Revised: August 2015
First available in Project Euclid: 28 January 2016
Permanent link to this document
Digital Object Identifier
Mathematical Reviews number (MathSciNet)
Zentralblatt MATH identifier
Han, Shengtong; Zhang, Hongmei; Lockett, Gabrielle A.; Mukherjee, Nandini; Holloway, John W.; Karmaus, Wilfried. Identifying heterogeneous transgenerational DNA methylation sites via clustering in beta regression. Ann. Appl. Stat. 9 (2015), no. 4, 2052--2072. doi:10.1214/15-AOAS865. https://projecteuclid.org/euclid.aoas/1453994191