期刊论文详细信息
PeerJ
GMPR: A robust normalization method for zero-inflated count data with application to microbiome sequencing data
article
Li Chen1  James Reeve2  Lujun Zhang3  Shengbing Huang2  Xuefeng Wang4  Jun Chen2 
[1] Department of Health Outcomes Research and Policy, Harrison School of Pharmacy, Auburn University;Bioinformatics and Computational Biology Program, University of Minnesota—Rochester;College of Environmental and Resource Sciences, Zhejiang University;Department of Biostatistics and Bioinformatics, Moffitt Cancer Center;Division of Biomedical Statistics and Informatics and Center for Individualized Medicine, Mayo Clinic
关键词: Normalization;    Metagenomics;    Microbiome;    Statistics;    Zero-inflation;    RNA-seq;   
DOI  :  10.7717/peerj.4600
学科分类:社会科学、人文和艺术(综合)
来源: Inra
PDF
【 摘 要 】

Normalization is the first critical step in microbiome sequencing data analysis used to account for variable library sizes. Current RNA-Seq based normalization methods that have been adapted for microbiome data fail to consider the unique characteristics of microbiome data, which contain a vast number of zeros due to the physical absence or under-sampling of the microbes. Normalization methods that specifically address the zero-inflation remain largely undeveloped. Here we propose geometric mean of pairwise ratios—a simple but effective normalization method—for zero-inflated sequencing data such as microbiome data. Simulation studies and real datasets analyses demonstrate that the proposed method is more robust than competing methods, leading to more powerful detection of differentially abundant taxa and higher reproducibility of the relative abundances of taxa.

【 授权许可】

CC BY   

【 预 览 】
附件列表
Files Size Format View
RO202307100012756ZK.pdf 2647KB PDF download
  文献评价指标  
  下载次数:5次 浏览次数:3次