Abstract
The automated analysis of flow cytometry measurements is an active research field. We introduce a new algorithm, referred to as , using regularized optimal transport to directly estimate the different cell population proportions from a biological sample characterized with flow cytometry measurements. We rely on the regularized Wasserstein metric to compare cytometry measurements from different samples, thus accounting for possible misalignment of a given cell population across samples (due to technical variability from the technology of measurements). In this work we rely on a supervised learning technique, based on the Wasserstein metric, that is used to estimate an optimal reweighting of class proportions in a mixture model from a source distribution (with known segmentation into cell sub-populations) to fit a target distribution with unknown segmentation. Due to the high dimensionality of flow cytometry data, we use stochastic algorithms to approximate the regularized Wasserstein metric to solve the optimization problem involved in the estimation of optimal weights representing the cell population proportions in the target distribution. Several flow cytometry data sets are used to illustrate the performances of that are also compared to those of existing algorithms for automatic gating based on supervised learning.
Funding Statement
Experiments presented in this paper were carried out using the PlaFRIM experimental testbed, supported by Inria, CNRS (LABRI and IMB), Université de Bordeaux, Bordeaux INP and Conseil Régional d’Aquitaine.
Jérémie Bigot is a member of Institut Universitaire de France (IUF), and this work has been carried out with financial support from the IUF.
Acknowledgments
The authors would like to thank the Associate Editor and the three referees for their suggestions and constructive comments which helped to improve the paper substantially. The author would also like to thank Kalidou Ba for developing the CytOpT packages.
B. P. Hejblum is also affiliated with Vaccine Research Institute (VRI), 94010 Créteil, France.
Citation
Paul Freulon. Jérémie Bigot. Boris P. Hejblum. "CytOpT: Optimal transport with domain adaptation for interpreting flow cytometry data." Ann. Appl. Stat. 17 (2) 1086 - 1104, June 2023. https://doi.org/10.1214/22-AOAS1660
Information