| BMC Genomics | |
| Identification of recent cases of hepatitis C virus infection using physical-chemical properties of hypervariable region 1 and a radial basis function neural network classifier | |
| Research | |
| Mahder Teka1  James Lara1  Yury Khudyakov1  | |
| [1] Division of Viral Hepatitis, National Center for HIV, Hepatitis, TB and STD Prevention, Centers for Disease Control and Prevention, 30333, Atlanta, GA, USA; | |
| 关键词: ; | |
| DOI : 10.1186/s12864-017-4269-2 | |
| 来源: Springer | |
PDF
|
|
【 摘 要 】
BackgroundIdentification of acute or recent hepatitis C virus (HCV) infections is important for detecting outbreaks and devising timely public health interventions for interruption of transmission. Epidemiological investigations and chemistry-based laboratory tests are 2 main approaches that are available for identification of acute HCV infection. However, owing to complexity, both approaches are not efficient. Here, we describe a new sequence alignment-free method to discriminate between recent (R) and chronic (C) HCV infection using next-generation sequencing (NGS) data derived from the HCV hypervariable region 1 (HVR1).ResultsUsing dinucleotide auto correlation (DAC), we identified physical-chemical (PhyChem) features of HVR1 variants. Significant (p < 9.58 × 10−4) differences in the means and frequency distributions of PhyChem features were found between HVR1 variants sampled from patients with recent vs chronic (R/C) infection. Moreover, the R-associated variants were found to occupy distinct and discrete PhyChem spaces. A radial basis function neural network classifier trained on the PhyChem features of intra-host HVR1 variants accurately classified R/C-HVR1 variants (classification accuracy (CA) = 94.85%; area under the ROC curve, AUROC = 0.979), in 10-fold cross-validation). The classifier was accurate in assigning individual HVR1 variants to R/C-classes in the testing set (CA = 84.15%; AUROC = 0.912) and in detection of infection duration (R/C-class) in patients (CA = 88.45%). Statistical tests and evaluation of the classifier on randomly-labeled datasets indicate that classifiers’ CA is robust (p < 0.001) and unlikely due to random correlations (CA = 59.04% and AUROC = 0.50).ConclusionsThe PhyChem features of intra-host HVR1 variants are strongly associated with the duration of HCV infection. Application of the PhyChem biomarkers to models for detection of the R/C-state of HCV infection in patients offers a new opportunity for detection of outbreaks and for molecular surveillance. The method will be available at https://webappx.cdc.gov/GHOST/ to the authenticated users of Global Hepatitis Outbreak and Surveillance Technology (GHOST) for further testing and validation.
【 授权许可】
CC BY
© The Author(s). 2017
【 预 览 】
| Files | Size | Format | View |
|---|---|---|---|
| RO202311090956071ZK.pdf | 2108KB | ||
| 12864_2017_4269_Article_IEq1.gif | 1KB | Image | |
| 12864_2017_4269_Article_IEq2.gif | 1KB | Image | |
| 12864_2017_4269_Article_IEq3.gif | 1KB | Image | |
| 12864_2017_4269_Article_IEq4.gif | 1KB | Image | |
| 12864_2017_4269_Article_IEq5.gif | 1KB | Image | |
| 12864_2017_4269_Article_IEq6.gif | 1KB | Image | |
| 12864_2017_4269_Article_IEq7.gif | 1KB | Image | |
| 12864_2017_4269_Article_IEq8.gif | 1KB | Image | |
| 12864_2017_4269_Article_IEq9.gif | 1KB | Image |
【 图 表 】
12864_2017_4269_Article_IEq9.gif
12864_2017_4269_Article_IEq8.gif
12864_2017_4269_Article_IEq7.gif
12864_2017_4269_Article_IEq6.gif
12864_2017_4269_Article_IEq5.gif
12864_2017_4269_Article_IEq4.gif
12864_2017_4269_Article_IEq3.gif
12864_2017_4269_Article_IEq2.gif
12864_2017_4269_Article_IEq1.gif
【 参考文献 】
- [1]
- [2]
- [3]
- [4]
- [5]
- [6]
- [7]
- [8]
- [9]
- [10]
- [11]
- [12]
- [13]
- [14]
- [15]
- [16]
- [17]
- [18]
- [19]
- [20]
- [21]
PDF