In biological experiments researchers often have information in the form of a graph that supplements observed numerical data. Incorporating the knowledge contained in these graphs into an analysis of the numerical data is an important and nontrivial task. We look at the example of metagenomic data—data from a genomic survey of the abundance of different species of bacteria in a sample. Here, the graph of interest is a phylogenetic tree depicting the interspecies relationships among the bacteria species. We illustrate that analysis of the data in a nonstandard inner-product space effectively uses this additional graphical information and produces more meaningful results.
"Analysis of a data matrix and a graph: Metagenomic data and the phylogenetic tree." Ann. Appl. Stat. 5 (4) 2326 - 2358, December 2011. https://doi.org/10.1214/10-AOAS402