期刊论文详细信息
BMC Medical Informatics and Decision Making
Study on risk factors of diabetic peripheral neuropathy and establishment of a prediction model by machine learning
Research
Xiaoyang Lian1  Jingchen Zhong1  Mengqian Yuan1  Ming Wang2  Xiaojie Li3  Juanzhi Qi4  Tao Yang4  Gang Li4 
[1] Affiliated Hospital of Nanjing University of Chinese Medicine,Jiangsu Province Hospital of Chinese Medicine, 210029, Nanjing, Jiangsu, China;Geriatric Hospital of Nanjing Medical University, Jiangsu Province Official Hospital, 210036, Nanjing, Jiangsu, China;Jiangsu Health Vocational College, 210036, Nanjing, Jiangsu, China;School of Artificial Intelligence and Information Technology, Nanjing University of Chinese Medicine, 210023, Nanjing, Jiangsu, China;
关键词: Machine learning;    Data analysis;    Diabetes;    Diabetic peripheral neuropathy;   
DOI  :  10.1186/s12911-023-02232-1
 received in 2023-04-20, accepted in 2023-07-12,  发布年份 2023
来源: Springer
PDF
【 摘 要 】

BackgroundDiabetic peripheral neuropathy (DPN) is a common complication of diabetes. Predicting the risk of developing DPN is important for clinical decision-making and designing clinical trials.MethodsWe retrospectively reviewed the data of 1278 patients with diabetes treated in two central hospitals from 2020 to 2022. The data included medical history, physical examination, and biochemical index test results. After feature selection and data balancing, the cohort was divided into training and internal validation datasets at a 7:3 ratio. Training was made in logistic regression, k-nearest neighbor, decision tree, naive bayes, random forest, and extreme gradient boosting (XGBoost) based on machine learning. The k-fold cross-validation was used for model assessment, and the accuracy, precision, recall, F1-score, and the area under the receiver operating characteristic curve (AUC) were adopted to validate the models’ discrimination and clinical practicality. The SHapley Additive exPlanation (SHAP) was used to interpret the best-performing model.ResultsThe XGBoost model outperformed other models, which had an accuracy of 0·746, precision of 0·765, recall of 0·711, F1-score of 0·736, and AUC of 0·813. The SHAP results indicated that age, disease duration, glycated hemoglobin, insulin resistance index, 24-h urine protein quantification, and urine protein concentration were risk factors for DPN, while the ratio between 2-h postprandial C-peptide and fasting C-peptide(C2/C0), total cholesterol, activated partial thromboplastin time, and creatinine were protective factors.ConclusionsThe machine learning approach helped established a DPN risk prediction model with good performance. The model identified the factors most closely related to DPN.

【 授权许可】

CC BY   
© The Author(s) 2023

【 预 览 】
附件列表
Files Size Format View
RO202309156529359ZK.pdf 1639KB PDF download
Fig. 2 2478KB Image download
Fig. 1 24KB Image download
42490_2023_74_Article_IEq3.gif 1KB Image download
Fig. 8 1446KB Image download
42490_2023_74_Article_IEq30.gif 1KB Image download
42490_2023_74_Article_IEq40.gif 1KB Image download
42490_2023_74_Article_IEq42.gif 1KB Image download
Fig. 2 349KB Image download
Fig. 1 87KB Image download
【 图 表 】

Fig. 1

Fig. 2

42490_2023_74_Article_IEq42.gif

42490_2023_74_Article_IEq40.gif

42490_2023_74_Article_IEq30.gif

Fig. 8

42490_2023_74_Article_IEq3.gif

Fig. 1

Fig. 2

【 参考文献 】
  • [1]
  • [2]
  • [3]
  • [4]
  • [5]
  • [6]
  • [7]
  • [8]
  • [9]
  • [10]
  • [11]
  • [12]
  • [13]
  • [14]
  • [15]
  • [16]
  • [17]
  • [18]
  • [19]
  • [20]
  • [21]
  • [22]
  • [23]
  • [24]
  • [25]
  • [26]
  • [27]
  • [28]
  • [29]
  • [30]
  • [31]
  • [32]
  • [33]
  • [34]
  • [35]
  • [36]
  • [37]
  • [38]
  • [39]
  • [40]
  • [41]
  • [42]
  • [43]
  • [44]
  • [45]
  • [46]
  • [47]
  • [48]
  • [49]
  • [50]
  文献评价指标  
  下载次数:17次 浏览次数:0次