| Journal of computational biology | |
| Supervised Adversarial Alignment of Single-Cell RNA-seq Data | |
| article | |
| Songwei Ge1  Haohan Wang2  Amir Alavi1  Eric Xing2  Ziv Bar-joseph1  | |
| [1] Computational Biology Department, Carnegie Mellon University;Language Technologies Institute, Carnegie Mellon University;Machine Learning Department, Carnegie Mellon University | |
| 关键词: batch effect removal; data integration; dimensionality reduction; domain adversarial training; single-cell RNA-seq.; | |
| DOI : 10.1089/cmb.2020.0439 | |
| 来源: Mary Ann Liebert, Inc. Publishers | |
PDF
|
|
【 摘 要 】
Dimensionality reduction is an important first step in the analysis of single-cell RNAsequencing (scRNA-seq) data. In addition to enabling the visualization of the profiled cells, such representations are used by many downstream analyses methods ranging from pseudotime reconstruction to clustering to alignment of scRNA-seq data from different experiments, platforms, and laboratories. Both supervised and unsupervised methods have been proposed to reduce the dimension of scRNA-seq. However, all methods to date are sensitive to batch effects. When batches correlate with cell types, as is often the case, their impact can lead to representations that are batch rather than cell-type specific. To overcome this, we developed a domain adversarial neural network model for learning a reduced dimension representation of scRNA-seq data. The adversarial model tries to simultaneously optimize two objectives. The first is the accuracy of cell-type assignment and the second is the inability to distinguish the batch (domain). We tested the method by using the resulting representation to align several different data sets. As we show, by overcoming batch effects our method was able to correctly separate cell types, improving on several prior methods suggested for this task. Analysis of the top features used by the network indicates that by taking the batch impact into account, the reduced representation is much better able to focus on key genes for each cell type.
【 授权许可】
Unknown
【 预 览 】
| Files | Size | Format | View |
|---|---|---|---|
| RO202108110003442ZK.pdf | 702KB |
PDF