December 2024 DeepMap: Deep learning-based single-cell data integration using iterative cell matching and structure preservation constraints
Shuntuo Xu, Zhou Yu, Jingsi Ming
Author Affiliations +
Ann. Appl. Stat. 18(4): 3596-3613 (December 2024). DOI: 10.1214/24-AOAS1954

Abstract

Effective integration of single-cell data can facilitate the discovery of cell-type specific gene expression patterns and cellular interactions, ultimately leading to a better understanding of various biological processes and diseases. However, datasets from different platforms, species, and modalities exhibit various levels of heterogeneities, posing significant challenges in data alignment using a unified approach. Here we propose DeepMap, a flexible and efficient method for single-cell data integration, by taking advantage of the deep learning framework. Our method utilizes iterative cell matching based on mutual nearest neighbors, leverages an autoencoder framework to learn harmonized representations of cells from various datasets, and incorporates a covariance penalty term into the framework for structure preservation. In addition to harmonization of data from different datasets, we specifically take account of the preservation of important biological variations within dataset, which is crucial to reliable downstream analysis. Comprehensive real data analysis demonstrates the flexibility of DeepMap for diverse datasets from different platforms, species, and modalities, and highlights its marked ability in preserving structures over existing integration methods with enhanced computational efficiency and optimized memory usage. The robust DeepMap-integrated data offers promising prospects for advancing our understanding of cell biology, hence making it a highly attractive option for integrative single-cell data analysis.

Funding Statement

This work was supported by the National Key R&D Program of China (No. 2021YFA1000100, 2021YFA1000101), the National Natural Science Foundation of China (No. 12201219, 12371289), Shanghai Sailing Program (No. 21YF1410600), Shanghai Key Program of Computational Biology (No. 23JS1400500, 23JS1400800), the Basic Research Project of Shanghai Science and Technology Commission (Grant No. 22JC1400800), and the Shanghai Pilot Program for Basic Research (Grant No. TQ20220105).

Acknowledgments

The authors thank the Editor, Associate Editor, and anonymous reviewers for their constructive feedback on earlier versions of this paper. All correspondence should be addressed to Jingsi Ming (the corresponding author) at jsming@fem.ecnu.edu.cn.

Citation

Download Citation

Shuntuo Xu. Zhou Yu. Jingsi Ming. "DeepMap: Deep learning-based single-cell data integration using iterative cell matching and structure preservation constraints." Ann. Appl. Stat. 18 (4) 3596 - 3613, December 2024. https://doi.org/10.1214/24-AOAS1954

Information

Received: 1 February 2024; Revised: 1 July 2024; Published: December 2024
First available in Project Euclid: 31 October 2024

Digital Object Identifier: 10.1214/24-AOAS1954

Keywords: deep learning , iterative cell matching , Single-cell data integration , structure preservation constraints

Rights: Copyright © 2024 Institute of Mathematical Statistics

Vol.18 • No. 4 • December 2024
Back to Top