| BMC Bioinformatics | |
| Comparison of different cell type correction methods for genome-scale epigenetics studies | |
| Research Article | |
| Akhilesh Kaushal1  Wilfried J. J. Karmaus1  Hongmei Zhang1  Meredith Ray1  Shu-Li Wang2  Alicia K. Smith3  Mylin A. Torres4  | |
| [1] Division of Epidemiology, Biostatistics, and Environmental Health, University of Memphis, 38152, Memphis, TN, USA;National Institute of Environmental Health Sciences, National Health Research Institutes, Miaoli, Taiwan;Winship Cancer Institute, Emory University, 1365 Clifton Rd. NE, 30322, Atlanta, GA, USA;Department of Psychiatry and Behavioral Sciences, Emory University School of Medicine, 101 Woodruff Circle, Suite 4000, 30322, Atlanta, GA, USA;Winship Cancer Institute, Emory University, 1365 Clifton Rd. NE, 30322, Atlanta, GA, USA;Department of Radiation Oncology, Emory University School of Medicine, 1365 Clifton Rd. NE, 30322, Atlanta, GA, USA; | |
| 关键词: Cell-type composition; CpG sites; Genome-scale DNA methylation; Surrogate variables; | |
| DOI : 10.1186/s12859-017-1611-2 | |
| received in 2016-12-08, accepted in 2017-03-24, 发布年份 2017 | |
| 来源: Springer | |
PDF
|
|
【 摘 要 】
BackgroundWhole blood is frequently utilized in genome-wide association studies of DNA methylation patterns in relation to environmental exposures or clinical outcomes. These associations can be confounded by cellular heterogeneity. Algorithms have been developed to measure or adjust for this heterogeneity, and some have been compared in the literature. However, with new methods available, it is unknown whether the findings will be consistent, if not which method(s) perform better.ResultsMethods: We compared eight cell-type correction methods including the method in the minfi R package, the method by Houseman et al., the Removing unwanted variation (RUV) approach, the methods in FaST-LMM-EWASher, ReFACTor, RefFreeEWAS, and RefFreeCellMix R programs, along with one approach utilizing surrogate variables (SVAs). We first evaluated the association of DNA methylation at each CpG across the whole genome with prenatal arsenic exposure levels and with cancer status, adjusted for estimated cell-type information obtained from different methods. We then compared CpGs showing statistical significance from different approaches. For the methods implemented in minfi and proposed by Houseman et al., we utilized homogeneous data with composition of some blood cells available and compared them with the estimated cell compositions. Finally, for methods not explicitly estimating cell compositions, we evaluated their performance using simulated DNA methylation data with a set of latent variables representing “cell types”.Results: Results from the SVA-based method overall showed the highest agreement with all other methods except for FaST-LMM-EWASher. Using homogeneous data, minfi provided better estimations on cell types compared to the originally proposed method by Houseman et al. Further simulation studies on methods free of reference data revealed that SVA provided good sensitivities and specificities, RefFreeCellMix in general produced high sensitivities but specificities tended to be low when confounding is present, and FaST-LMM-EWASher gave the lowest sensitivity but highest specificity.ConclusionsResults from real data and simulations indicated that SVA is recommended when the focus is on the identification of informative CpGs. When appropriate reference data are available, the method implemented in the minfi package is recommended. However, if no such reference data are available or if the focus is not on estimating cell proportions, the SVA method is suggested.
【 授权许可】
CC BY
© The Author(s). 2017
【 预 览 】
| Files | Size | Format | View |
|---|---|---|---|
| RO202311102884445ZK.pdf | 584KB |
【 参考文献 】
- [1]
- [2]
- [3]
- [4]
- [5]
- [6]
- [7]
- [8]
- [9]
- [10]
- [11]
- [12]
- [13]
- [14]
- [15]
- [16]
- [17]
- [18]
- [19]
- [20]
- [21]
- [22]
- [23]
- [24]
- [25]
- [26]
- [27]
- [28]
- [29]
- [30]
- [31]
- [32]
- [33]
- [34]
- [35]
- [36]
- [37]
- [38]
- [39]
- [40]
- [41]
- [42]
- [43]
- [44]
- [45]
- [46]
- [47]
- [48]
- [49]
PDF