Frontiers in Genetics | |
Identification of the diagnostic genes and immune cell infiltration characteristics of gastric cancer using bioinformatics analysis and machine learning | |
Genetics | |
Longfei Liu1  Xianzhou Lu1  Rongjun Xie2  Guoxin Li3  Chengjian He4  | |
[1] Department of General Surgery, Affiliated Nanhua Hospital, Hengyang Medical School, University of South China, Hengyang, China;Department of General Surgery, Nanfang Hospital, The First School of Clinical Medicine, Southern Medical University, Guangzhou, China;Department of General Surgery, Affiliated Nanhua Hospital, Hengyang Medical School, University of South China, Hengyang, China;Guangdong Provincial Key Laboratory of Precision Medicine for Gastrointestinal Tumor, Nanfang Hospital, The First School of Clinical Medicine, Southern Medical University, Guangzhou, China;Department of General Surgery, Nanfang Hospital, The First School of Clinical Medicine, Southern Medical University, Guangzhou, China;Guangdong Provincial Key Laboratory of Precision Medicine for Gastrointestinal Tumor, Nanfang Hospital, The First School of Clinical Medicine, Southern Medical University, Guangzhou, China;Department of Intensive Care Medicine, Affiliated Nanhua Hospital, Hengyang Medical School, University of South China, Hengyang, China; | |
关键词: gastric cancer; diagnostic gene; immune cell infiltration; bioinformatics analysis; machine learning; LASSO; SVM-RFE; | |
DOI : 10.3389/fgene.2022.1067524 | |
received in 2022-10-12, accepted in 2022-12-05, 发布年份 2023 | |
来源: Frontiers | |
【 摘 要 】
Background: Finding reliable diagnostic markers for gastric cancer (GC) is important. This work uses machine learning (ML) to identify GC diagnostic genes and investigate their connection with immune cell infiltration.Methods: We downloaded eight GC-related datasets from GEO, TCGA, and GTEx. GSE13911, GSE15459, GSE19826, GSE54129, and GSE79973 were used as the training set, GSE66229 as the validation set A, and TCGA & GTEx as the validation set B. First, the training set screened differentially expressed genes (DEGs), and gene ontology (GO), kyoto encyclopedia of genes and genomes (KEGG), disease Ontology (DO), and gene set enrichment analysis (GSEA) analyses were performed. Then, the candidate diagnostic genes were screened by LASSO and SVM-RFE algorithms, and receiver operating characteristic (ROC) curves evaluated the diagnostic efficacy. Then, the infiltration characteristics of immune cells in GC samples were analyzed by CIBERSORT, and correlation analysis was performed. Finally, mutation and survival analyses were performed for diagnostic genes.Results: We found 207 up-regulated genes and 349 down-regulated genes among 556 DEGs. gene ontology analysis significantly enriched 413 functional annotations, including 310 biological processes, 23 cellular components, and 80 molecular functions. Six of these biological processes are closely related to immunity. KEGG analysis significantly enriched 11 signaling pathways. 244 diseases were closely related to Ontology analysis. Multiple entries of the gene set enrichment analysis analysis were closely related to immunity. Machine learning screened eight candidate diagnostic genes and further validated them to identify ABCA8, COL4A1, FAP, LY6E, MAMDC2, and TMEM100 as diagnostic genes. Six diagnostic genes were mutated to some extent in GC. ABCA8, COL4A1, LY6E, MAMDC2, TMEM100 had prognostic value.Conclusion: We screened six diagnostic genes for gastric cancer through bioinformatic analysis and machine learning, which are intimately related to immune cell infiltration and have a definite prognostic value.
【 授权许可】
Unknown
Copyright © 2023 Xie, Liu, Lu, He and Li.
【 预 览 】
Files | Size | Format | View |
---|---|---|---|
RO202310109527609ZK.pdf | 4035KB | download |