期刊论文详细信息
BMC Medical Research Methodology
Multivariate longitudinal data for survival analysis of cardiovascular event prediction in young adults: insights from a comparative explainable study
Research
Hieu T. Nguyen1  Henrique D. Vasconcellos2  Kimberley Keck2  João A.C. Lima3  Eliseo Guallar4  Cora E. Lewis5  Donald M. Lloyd-Jones6  Bharath Ambale-Venkatesh7  Steven Sidney8  Colin O. Wu9  Jared P. Reis9  Pamela J. Schreiner1,10 
[1] Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD, USA;Department of Cardiology, Johns Hopkins University, Baltimore, MD, USA;Department of Cardiology, Johns Hopkins University, Baltimore, MD, USA;Department of Radiology, Johns Hopkins University, Baltimore, MD, USA;Department of Epidemiology, Johns Hopkins University School of Public Health, Baltimore, MD, USA;Department of Epidemiology, School of Public Health, University of Alabama at Birmingham, Birmingham, AL, USA;Department of Preventive Medicine, Northwestern University, Chicago, IL, USA;Department of Radiology, Johns Hopkins University, Baltimore, MD, USA;Division of Research, Kaiser Permanente, Oakland, CA, USA;National Heart, Lung, and Blood Institute, Bethesda, MD, USA;School of Public Health, University of Minnesota, Minneapolis, MN, USA;
关键词: Longitudinal data;    Explainable AI;    Survival analysis;    Risk prediction;    Repeated measures;    Personalized medicine;    Time-varying covariates;    SHAP;    TIME;    CARDIA;   
DOI  :  10.1186/s12874-023-01845-4
 received in 2022-09-19, accepted in 2023-01-18,  发布年份 2023
来源: Springer
PDF
【 摘 要 】

BackgroundMultivariate longitudinal data are under-utilized for survival analysis compared to cross-sectional data (CS - data collected once across cohort). Particularly in cardiovascular risk prediction, despite available methods of longitudinal data analysis, the value of longitudinal information has not been established in terms of improved predictive accuracy and clinical applicability.MethodsWe investigated the value of longitudinal data over and above the use of cross-sectional data via 6 distinct modeling strategies from statistics, machine learning, and deep learning that incorporate repeated measures for survival analysis of the time-to-cardiovascular event in the Coronary Artery Risk Development in Young Adults (CARDIA) cohort. We then examined and compared the use of model-specific interpretability methods (Random Survival Forest Variable Importance) and model-agnostic methods (SHapley Additive exPlanation (SHAP) and Temporal Importance Model Explanation (TIME)) in cardiovascular risk prediction using the top-performing models.ResultsIn a cohort of 3539 participants, longitudinal information from 35 variables that were repeatedly collected in 6 exam visits over 15 years improved subsequent long-term (17 years after) risk prediction by up to 8.3% in C-index compared to using baseline data (0.78 vs. 0.72), and up to approximately 4% compared to using the last observed CS data (0.75). Time-varying AUC was also higher in models using longitudinal data (0.86–0.87 at 5 years, 0.79–0.81 at 10 years) than using baseline or last observed CS data (0.80–0.86 at 5 years, 0.73–0.77 at 10 years). Comparative model interpretability analysis revealed the impact of longitudinal variables on model prediction on both the individual and global scales among different modeling strategies, as well as identifying the best time windows and best timing within that window for event prediction. The best strategy to incorporate longitudinal data for accuracy was time series massive feature extraction, and the easiest interpretable strategy was trajectory clustering.ConclusionOur analysis demonstrates the added value of longitudinal data in predictive accuracy and epidemiological utility in cardiovascular risk survival analysis in young adults via a unified, scalable framework that compares model performance and explainability. The framework can be extended to a larger number of variables and other longitudinal modeling methods.Trial registrationClinicalTrials.gov Identifier: NCT00005130, Registration Date: 26/05/2000.

【 授权许可】

CC BY   
© The Author(s) 2023

【 预 览 】
附件列表
Files Size Format View
RO202305111488164ZK.pdf 2562KB PDF download
41116_2022_35_Article_IEq96.gif 1KB Image download
41116_2022_35_Article_IEq101.gif 1KB Image download
Fig. 3 110KB Image download
41116_2022_35_Article_IEq116.gif 1KB Image download
41116_2022_35_Article_IEq123.gif 1KB Image download
Fig. 4 622KB Image download
【 图 表 】

Fig. 4

41116_2022_35_Article_IEq123.gif

41116_2022_35_Article_IEq116.gif

Fig. 3

41116_2022_35_Article_IEq101.gif

41116_2022_35_Article_IEq96.gif

【 参考文献 】
  • [1]
  • [2]
  • [3]
  • [4]
  • [5]
  • [6]
  • [7]
  • [8]
  • [9]
  • [10]
  • [11]
  • [12]
  • [13]
  • [14]
  • [15]
  • [16]
  • [17]
  • [18]
  • [19]
  • [20]
  • [21]
  • [22]
  • [23]
  • [24]
  • [25]
  • [26]
  • [27]
  • [28]
  • [29]
  • [30]
  • [31]
  • [32]
  • [33]
  • [34]
  • [35]
  • [36]
  • [37]
  • [38]
  • [39]
  • [40]
  • [41]
  • [42]
  • [43]
  • [44]
  • [45]
  • [46]
  • [47]
  • [48]
  • [49]
  • [50]
  • [51]
  • [52]
  • [53]
  • [54]
  • [55]
  • [56]
  • [57]
  • [58]
  • [59]
  • [60]
  • [61]
  • [62]
  • [63]
  • [64]
  • [65]
  • [66]
  • [67]
  • [68]
  • [69]
  • [70]
  • [71]
  文献评价指标  
  下载次数:2次 浏览次数:2次