期刊论文详细信息
BMC Medical Genomics
c-Diadem: a constrained dual-input deep learning model to identify novel biomarkers in Alzheimer’s disease
Research
Sherlyn Jemimah1  Aamna AlShehhi1 
[1] Department of Biomedical Engineering, Khalifa University, PO Box 127788, Abu Dhabi, United Arab Emirates;
关键词: Alzheimer’s disease;    Biomarkers;    Neural network;    Deep learning;    Binary classification;    Genomics;    Genetics;    Gene expression;   
DOI  :  10.1186/s12920-023-01675-9
 received in 2022-11-29, accepted in 2023-09-27,  发布年份 2023
来源: Springer
PDF
【 摘 要 】

BackgroundAlzheimer’s disease (AD) is an incurable, debilitating neurodegenerative disorder. Current biomarkers for AD diagnosis require expensive neuroimaging or invasive cerebrospinal fluid sampling, thus precluding early detection. Blood-based biomarker discovery in Alzheimer’s can facilitate less-invasive, routine diagnostic tests to aid early intervention. Therefore, we propose “c-Diadem” (constrained dual-input Alzheimer’s disease model), a novel deep learning classifier which incorporates KEGG (Kyoto Encyclopedia of Genes and Genomes) pathway constraints on the input genotyping data to predict disease, i.e., mild cognitive impairment (MCI)/AD or cognitively normal (CN). SHAP (SHapley Additive exPlanations) was used to explain the model and identify novel, potential blood-based genetic markers of MCI/AD.MethodsWe developed a novel constrained deep learning neural network which utilizes SNPs (single nucleotide polymorphisms) and microarray data from ADNI (Alzheimer’s Disease Neuroimaging Initiative) to predict the disease status of participants, i.e., CN or with disease (MCI/AD), and identify potential blood-based biomarkers for diagnosis and intervention. The dataset contains samples from 626 participants, of which 212 are CN (average age 74.6 ± 5.4 years) and 414 patients have MCI/AD (average age 72.7 ± 7.6 years). KEGG pathway information was used to generate constraints applied to the input tensors, thus enhancing the interpretability of the model. SHAP scores were used to identify genes which could potentially serve as biomarkers for diagnosis and targets for drug development.ResultsOur model’s performance, with accuracy of 69% and AUC of 70% in the test dataset, is superior to previous models. The SHAP scores show that SNPs in PRKCZ, PLCB1 and ITPR2 as well as expression of HLA-DQB1, EIF1AY, HLA-DQA1, and ZFP57 have more impact on model predictions.ConclusionsIn addition to predicting MCI/AD, our model has been interrogated for potential genetic biomarkers using SHAP. From our analysis, we have identified blood-based genetic markers related to Ca2+ ion release in affected regions of the brain, as well as depression. The findings from our study provides insights into disease mechanisms, and can facilitate innovation in less-invasive, cost-effective diagnostics. To the best of our knowledge, our model is the first to use pathway constraints in a multimodal neural network to identify potential genetic markers for AD.

【 授权许可】

CC BY   
© The Author(s) 2023

【 预 览 】
附件列表
Files Size Format View
RO202311103407643ZK.pdf 2466KB PDF download
MediaObjects/12888_2023_5281_MOESM1_ESM.docx 14KB Other download
Fig. 3 379KB Image download
12951_2015_155_Article_IEq56.gif 1KB Image download
Fig. 1 1829KB Image download
【 图 表 】

Fig. 1

12951_2015_155_Article_IEq56.gif

Fig. 3

【 参考文献 】
  • [1]
  • [2]
  • [3]
  • [4]
  • [5]
  • [6]
  • [7]
  • [8]
  • [9]
  • [10]
  • [11]
  • [12]
  • [13]
  • [14]
  • [15]
  • [16]
  • [17]
  • [18]
  • [19]
  • [20]
  • [21]
  • [22]
  • [23]
  • [24]
  • [25]
  • [26]
  • [27]
  • [28]
  • [29]
  • [30]
  • [31]
  • [32]
  • [33]
  • [34]
  • [35]
  • [36]
  • [37]
  • [38]
  文献评价指标  
  下载次数:7次 浏览次数:3次