Frontiers in Medicine | |
Machine learning-based prediction of candidate gene biomarkers correlated with immune infiltration in patients with idiopathic pulmonary fibrosis | |
Medicine | |
Haibing Hua1  Ehsan Amiri-Ardekani2  Cong Wang3  Qingqing Xia3  Weilong Jiang3  Yufeng Zhang3  Yi Cheng4  Huizhe Zhang5  | |
[1] Department of Gastroenterology, Jiangyin Hospital of Traditional Chinese Medicine, Jiangyin Hospital Affiliated to Nanjing University of Chinese Medicine, Jiangyin, Jiangsu, China;Department of Phytopharmaceuticals (Traditional Pharmacy), Faculty of Pharmacy, Shiraz University of Medical Sciences, Shiraz, Iran;Department of Pulmonary and Critical Care Medicine, Jiangyin Hospital of Traditional Chinese Medicine, Jiangyin Hospital Affiliated to Nanjing University of Chinese Medicine, Jiangyin, Jiangsu, China;Department of Respiratory Medicine, Xinhua Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China;Department of Respiratory Medicine, Yancheng Hospital of Traditional Chinese Medicine, Yancheng Hospital Affiliated to Nanjing University of Chinese Medicine, Yancheng, Jiangsu, China; | |
关键词: gene biomarker; immune infiltration; idiopathic pulmonary fibrosis; machine learning algorithm; CIBERSORT; | |
DOI : 10.3389/fmed.2023.1001813 | |
received in 2022-08-03, accepted in 2023-01-26, 发布年份 2023 | |
来源: Frontiers | |
【 摘 要 】
ObjectiveThis study aimed to identify candidate gene biomarkers associated with immune infiltration in idiopathic pulmonary fibrosis (IPF) based on machine learning algorithms.MethodsMicroarray datasets of IPF were extracted from the Gene Expression Omnibus (GEO) database to screen for differentially expressed genes (DEGs). The DEGs were subjected to enrichment analysis, and two machine learning algorithms were used to identify candidate genes associated with IPF. These genes were verified in a validation cohort from the GEO database. Receiver operating characteristic (ROC) curves were plotted to assess the predictive value of the IPF-associated genes. The cell-type identification by estimating relative subsets of RNA transcripts (CIBERSORT) algorithm was used to evaluate the proportion of immune cells in IPF and normal tissues. Additionally, the correlation between the expression of IPF-associated genes and the infiltration levels of immune cells was examined.ResultsA total of 302 upregulated and 192 downregulated genes were identified. Functional annotation, pathway enrichment, Disease Ontology and gene set enrichment analyses revealed that the DEGs were related to the extracellular matrix and immune responses. COL3A1, CDH3, CEBPD, and GPIHBP1 were identified as candidate biomarkers using machine learning algorithms, and their predictive value was verified in a validation cohort. Additionally, ROC analysis revealed that the four genes had high predictive accuracy. The infiltration levels of plasma cells, M0 macrophages and resting dendritic cells were higher and those of resting natural killer (NK) cells, M1 macrophages and eosinophils were lower in the lung tissues of patients with IPF than in those of healthy individuals. The expression of the abovementioned genes was correlated with the infiltration levels of plasma cells, M0 macrophages and eosinophils.ConclusionCOL3A1, CDH3, CEBPD, and GPIHBP1 are candidate biomarkers of IPF. Plasma cells, M0 macrophages and eosinophils may be involved in the development of IPF and may serve as immunotherapeutic targets in IPF.
【 授权许可】
Unknown
Copyright © 2023 Zhang, Wang, Xia, Jiang, Zhang, Amiri-Ardekani, Hua and Cheng.
【 预 览 】
Files | Size | Format | View |
---|---|---|---|
RO202310105754308ZK.pdf | 6756KB | download |