期刊论文详细信息
PeerJ
Detecting heterogeneity in single-cell RNA-Seq data by non-negative matrix factorization
article
Xun Zhu1  Travers Ching1  Xinghua Pan3  Sherman M. Weissman3  Lana Garmire1 
[1] Epidemiology Program, University of Hawaii Cancer Center;Molecular Biosciences and Bioengineering Graduate Program, University of Hawaii at Manoa;Department of Genetics, Yale University
关键词: Single-cell;    RNA-Seq;    Heterogeneity;    Non-negative matrix factorization;    Modularity;    Clustering;    Subpopulation;    Single cell sequencing;    Single cell;    Feature gene;   
DOI  :  10.7717/peerj.2888
学科分类:社会科学、人文和艺术(综合)
来源: Inra
PDF
【 摘 要 】

Single-cell RNA-Sequencing (scRNA-Seq) is a fast-evolving technology that enables the understanding of biological processes at an unprecedentedly high resolution. However, well-suited bioinformatics tools to analyze the data generated from this new technology are still lacking. Here we investigate the performance of non-negative matrix factorization (NMF) method to analyze a wide variety of scRNA-Seq datasets, ranging from mouse hematopoietic stem cells to human glioblastoma data. In comparison to other unsupervised clustering methods including K-means and hierarchical clustering, NMF has higher accuracy in separating similar groups in various datasets. We ranked genes by their importance scores (D-scores) in separating these groups, and discovered that NMF uniquely identifies genes expressed at intermediate levels as top-ranked genes. Finally, we show that in conjugation with the modularity detection method FEM, NMF reveals meaningful protein-protein interaction modules. In summary, we propose that NMF is a desirable method to analyze heterogeneous single-cell RNA-Seq data. The NMF based subpopulation detection package is available at: https://github.com/lanagarmire/NMFEM.

【 授权许可】

CC BY   

【 预 览 】
附件列表
Files Size Format View
RO202307100014412ZK.pdf 1977KB PDF download
  文献评价指标  
  下载次数:1次 浏览次数:1次