Abstract
Effective integration of single-cell data can facilitate the discovery of cell-type specific gene expression patterns and cellular interactions, ultimately leading to a better understanding of various biological processes and diseases. However, datasets from different platforms, species, and modalities exhibit various levels of heterogeneities, posing significant challenges in data alignment using a unified approach. Here we propose DeepMap, a flexible and efficient method for single-cell data integration, by taking advantage of the deep learning framework. Our method utilizes iterative cell matching based on mutual nearest neighbors, leverages an autoencoder framework to learn harmonized representations of cells from various datasets, and incorporates a covariance penalty term into the framework for structure preservation. In addition to harmonization of data from different datasets, we specifically take account of the preservation of important biological variations within dataset, which is crucial to reliable downstream analysis. Comprehensive real data analysis demonstrates the flexibility of DeepMap for diverse datasets from different platforms, species, and modalities, and highlights its marked ability in preserving structures over existing integration methods with enhanced computational efficiency and optimized memory usage. The robust DeepMap-integrated data offers promising prospects for advancing our understanding of cell biology, hence making it a highly attractive option for integrative single-cell data analysis.
Funding Statement
This work was supported by the National Key R&D Program of China (No. 2021YFA1000100, 2021YFA1000101), the National Natural Science Foundation of China (No. 12201219, 12371289), Shanghai Sailing Program (No. 21YF1410600), Shanghai Key Program of Computational Biology (No. 23JS1400500, 23JS1400800), the Basic Research Project of Shanghai Science and Technology Commission (Grant No. 22JC1400800), and the Shanghai Pilot Program for Basic Research (Grant No. TQ20220105).
Acknowledgments
The authors thank the Editor, Associate Editor, and anonymous reviewers for their constructive feedback on earlier versions of this paper. All correspondence should be addressed to Jingsi Ming (the corresponding author) at jsming@fem.ecnu.edu.cn.
Citation
Shuntuo Xu. Zhou Yu. Jingsi Ming. "DeepMap: Deep learning-based single-cell data integration using iterative cell matching and structure preservation constraints." Ann. Appl. Stat. 18 (4) 3596 - 3613, December 2024. https://doi.org/10.1214/24-AOAS1954
Information