期刊论文详细信息
BMC Medical Informatics and Decision Making
Utilizing imbalanced electronic health records to predict acute kidney injury by ensemble learning and time series model
Yake Wei1  Jingwei Li2  Yubo Zhou3  Yuan Wang4  Qin Wu5  Hao Yang5 
[1] Center for Cyber Security, University of Electronic Science and Technology of China, ChengDu, China;Center for Cyber Security, University of Electronic Science and Technology of China, ChengDu, China;State Key Laboratory of Information Security, Institute of Information Engineering, Chinese Academy of Sciences, Beijing, China;College of Artificial Intelligence, Tianjin University of Science and Technology, 300222, TianJin, China;College of Artificial Intelligence, Tianjin University of Science and Technology, 300222, TianJin, China;Population and Precision Health Care (Tianjin), Ltd, TianJin, China;Department of Critical Care Medicine, West China Hospital, Sichuan University, ChengDu, China;
关键词: Acute kidney injury (AKI);    Prediction;    Ensemble learning;    ETSM;    Drug combination;   
DOI  :  10.1186/s12911-020-01245-4
来源: Springer
PDF
【 摘 要 】

BackgroundAcute Kidney Injury (AKI) is a shared complication among Intensive Care Unit (ICU), marked by high cost, high morbidity and high mortality. As the early prediction of AKI is critical for patients’ outcomes and data mining is such a powerful prediction tool, many AKI prediction models based on machine learning methods have been proposed. Our motivation is inspired by the fact that the incidence of AKI is a changing temporal sequence affected by the joint action of patients’ daily drug combinations and their physiological indexes. However, most existing models have not considered such a temporal correlation. Besides, due to great challenges caused by sparse, high-dimensional and highly imbalanced clinical data, it is hard to achieve ideal performance.MethodsWe develop a fast, simple and less-costly model based on an ensemble learning algorithm, named Ensemble Time Series Model (ETSM). Besides benefiting from vital signs and laboratory results as explicit indicators, ETSM explores the effect of drug combinations as possible implicit indicators for the AKI prediction. The model transforms temporal medication information into a multidimensional vector to consider and measure drug cumulative effects that may cause AKI.ResultsWe compare ETSM with state-of-the-art models on ICUC and MIMIC III datasets. On the basis of the experimental results, our model obtains satisfactory performance (ICUC: AUC 24 hours ahead: 0.81, 48 hours ahead: 0.78; MIMIC III: AUC 24 hours ahead: 0.95, 48 hours ahead: 0.95). Meanwhile, we compare the effects of different sampling and feature generation methods on the model performance. In the ablation study, we validate that medication information improves model performance (24 hours ahead: AUC increased from 0.74 to 0.81). We also find that the model’s performance is closely related to the balanced level of the derivation dataset. The optimal ratio of major class size to minor class size for the model is found for AKI prediction.ConclusionsETSM is an effective method for the early prediction of AKI. The model verifies that AKI incidence is related to the clinical medication. In comparison with other prediction methods, ETSM provides comparable performance results and better interpretability.

【 授权许可】

CC BY   

【 预 览 】
附件列表
Files Size Format View
RO202104247896821ZK.pdf 1153KB PDF download
  文献评价指标  
  下载次数:7次 浏览次数:1次