期刊论文详细信息
Pathogens
Performance of Five Metagenomic Classifiers for Virus Pathogen Detection Using Respiratory Samples from a Clinical Cohort
Ellen C. Carbo1  Aloys C. M. Kroes1  Eric C. J. Claas1  Sander van Boheemen1  Anneloes L. van Rijn-Klink1  Jutte J. C. de Vries1  Igor A. Sidorov1  Pieter S. Hiemstra2  Tomas M. Eagan3  Nikos Pappas4  Hailiang Mei4 
[1] Department of Medical Microbiology, Leiden University Medical Center, 2333 ZA Leiden, The Netherlands;Department of Pulmonology, Leiden University Medical Center, 2333 ZA Leiden, The Netherlands;Department of Thoracic Medicine, Haukeland University Hospital, 5021 Bergen, Norway;Sequencing Analysis Support Core, Department of Biomedical Data Sciences, Leiden University Medical Center, 2333 ZA Leiden, The Netherlands;
关键词: viral metagenomics;    bioinformatics;    pathogen detection;    next-generation sequencing;   
DOI  :  10.3390/pathogens11030340
来源: DOAJ
【 摘 要 】

Viral metagenomics is increasingly applied in clinical diagnostic settings for detection of pathogenic viruses. While several benchmarking studies have been published on the use of metagenomic classifiers for abundance and diversity profiling of bacterial populations, studies on the comparative performance of the classifiers for virus pathogen detection are scarce. In this study, metagenomic data sets (n = 88) from a clinical cohort of patients with respiratory complaints were used for comparison of the performance of five taxonomic classifiers: Centrifuge, Clark, Kaiju, Kraken2, and Genome Detective. A total of 1144 positive and negative PCR results for a total of 13 respiratory viruses were used as gold standard. Sensitivity and specificity of these classifiers ranged from 83 to 100% and 90 to 99%, respectively, and was dependent on the classification level and data pre-processing. Exclusion of human reads generally resulted in increased specificity. Normalization of read counts for genome length resulted in a minor effect on overall performance, however it negatively affected the detection of targets with read counts around detection level. Correlation of sequence read counts with PCR Ct-values varied per classifier, data pre-processing (R2 range 15.1–63.4%), and per virus, with outliers up to 3 log10 reads magnitude beyond the predicted read count for viruses with high sequence diversity. In this benchmarking study, sensitivity and specificity were within the ranges of use for diagnostic practice when the cut-off for defining a positive result was considered per classifier.

【 授权许可】

Unknown   

  文献评价指标  
  下载次数:0次 浏览次数:5次