期刊论文详细信息
BMC Medical Informatics and Decision Making
Improvement of APACHE II score system for disease severity based on XGBoost algorithm
Zhiyu Wang1  Cong Wang1  Yan Luo1 
[1] School of Computer Science (National Pilot Software Engineering School), Beijing University of Posts and Telecommunications, 100876, Beijing, China;Key Laboratory of Trustworthy Distributed Computing and Service (BUPT), Ministry of Education, 100876, Beijing, China;
关键词: APACHE II score system;    Machine learning;    Predictive modeling;    MIMIC III database;    Intensive care units treatment;   
DOI  :  10.1186/s12911-021-01591-x
来源: Springer
PDF
【 摘 要 】

BackgroundPrognostication is an essential tool for risk adjustment and decision making in the intensive care units (ICUs). In order to improve patient outcomes, we have been trying to develop a more effective model than Acute Physiology and Chronic Health Evaluation (APACHE) II to measure the severity of the patients in ICUs. The aim of the present study was to provide a mortality prediction model for ICUs patients, and to assess its performance relative to prediction based on the APACHE II scoring system.MethodsWe used the Medical Information Mart for Intensive Care version III (MIMIC-III) database to build our model. After comparing the APACHE II with 6 typical machine learning (ML) methods, the best performing model was screened for external validation on anther independent dataset. Performance measures were calculated using cross-validation to avoid making biased assessments. The primary outcome was hospital mortality. Finally, we used TreeSHAP algorithm to explain the variable relationships in the extreme gradient boosting algorithm (XGBoost) model.ResultsWe picked out 14 variables with 24,777 cases to form our basic data set. When the variables were the same as those contained in the APACHE II, the accuracy of XGBoost (accuracy: 0.858) was higher than that of APACHE II (accuracy: 0.742) and other algorithms. In addition, it exhibited better calibration properties than other methods, the result in the area under the ROC curve (AUC: 0.76). we then expand the variable set by adding five new variables to improve the performance of our model. The accuracy, precision, recall, F1, and AUC of the XGBoost model increased, and were still higher than other models (0.866, 0.853, 0.870, 0.845, and 0.81, respectively). On the external validation dataset, the AUC was 0.79 and calibration properties were good.ConclusionsAs compared to conventional severity scores APACHE II, our XGBoost proposal offers improved performance for predicting hospital mortality in ICUs patients. Furthermore, the TreeSHAP can help to enhance the understanding of our model by providing detailed insights into the impact of different features on the disease risk. In sum, our model could help clinicians determine prognosis and improve patient outcomes.

【 授权许可】

CC BY   

【 预 览 】
附件列表
Files Size Format View
RO202109172212841ZK.pdf 1765KB PDF download
  文献评价指标  
  下载次数:1次 浏览次数:2次