期刊论文

【摘要】

BackgroundMulti-sample comparison is commonly used in cancer genomics studies. By using next-generation sequencing (NGS), a mutation's status in a specific sample can be measured by the number of reads supporting mutant or wildtype alleles. When no mutant reads are detected, it could represent either a true negative mutation status or a false negative due to an insufficient number of reads, so-called "coverage". To minimize the chance of false-negative, we should consider the mutation status as "unknown" instead of "negative" when the coverage is inadequately low. There is no established method for determining the coverage threshold between negative and unknown statuses. A common solution is to apply a universal minimum coverage (UMC). However, this method relies on an arbitrarily chosen threshold, and it does not take into account the mutations' relative abundances, which can vary dramatically by the type of mutations. The result could be misclassification between negative and unknown statuses.MethodsWe propose an adaptive mutation-specific negative (MSN) method to improve the discrimination between negative and unknown mutation statuses. For a specific mutation, a non-positive sample is compared with every known positive sample to test the null hypothesis that they may contain the same frequency of mutant reads. The non-positive sample can only be claimed as “negative” when this null hypothesis is rejected with all known positive samples; otherwise, the status would be “unknown”.ResultsWe first compared the performance of MSN and UMC methods in a simulated dataset containing varying tumor cell fractions. Only the MSN methods appropriately assigned negative statuses for samples with both high- and low-tumor cell fractions. When evaluated on a real dual-platform single-cell sequencing dataset, the MSN method not only provided more accurate assessments of negative statuses but also yielded three times more available data after excluding the “unknown” statuses, compared with the UMC method.ConclusionsWe developed a new adaptive method for distinguishing unknown from negative statuses in multi-sample comparison NGS data. The method can provide more accurate negative statuses than the conventional UMC method and generate a remarkably higher amount of available data by reducing unnecessary “unknown” calls.

【授权许可】

CC BY

【预览】

附件列表
Files	Size	Format	View
RO202203043413888ZK.pdf	1272KB	PDF	download

BMC Medical Genomics
An adaptive method of defining negative mutation status for multi-sample comparison using next-generation sequencing

Mitsuko Murakami¹ Song Liu² Nicholas Hutson² James Graham² Lei Wei² Li Yan² Sujana Ganaparti² Qiang Hu² Fenglin Zhan³ Han Zhang⁴ Changxing Ma⁴ Jun Xie⁵
[1] Center for Personalized Medicine, Roswell Park Comprehensive Cancer Center, Buffalo, NY, USA;Department of Chemistry and Physics, Indiana State University, Terre Haute, IN, USA;Department of Biostatistics and Bioinformatics, Roswell Park Comprehensive Cancer Center, Buffalo, NY, USA;Department of Biostatistics and Bioinformatics, Roswell Park Comprehensive Cancer Center, Buffalo, NY, USA;PET/CT Center, The First Affiliated Hospital of USTC, Division of Life Sciences and Medicine, University of Science and Technology, 230001, Hefei, China;Department of Biostatistics, University At Buffalo, Buffalo, NY, USA;Department of Statistics, Purdue University, West Lafayette, IN, USA;
关键词: Negative status; Tumor heterogeneity; Liquid biopsy; Next-generation sequencing; Genetic testing; Personalized medicine;
DOI : 10.1186/s12920-021-00880-8
来源: Springer
PDF


	文献评价指标
	下载次数：9次	浏览次数：1次

【 摘 要 】

【 授权许可】

【 预 览 】

【摘要】

【授权许可】

【预览】