期刊论文详细信息
Cardiovascular Digital Health Journal 卷:2
Natural language processing of implantable cardioverter-defibrillator reports in hypertrophic cardiomyopathy: A paradigm for longitudinal device follow-up
Paul A. Friedman, MD, FHRS1  Adelaide M. Arruda-Olson, MD, PhD2  Huzefa Bhopalwala, MD3  Peter A. Noseworthy, MD, FHRS3  Nakeya Dewaswala, MD3  Michael J. Ackerman, MD, PhD3  Rick A. Nishimura, MD3  Konstantinos C. Siontis, MD3  Jeffrey B. Geske, MD3  Steve R. Ommen, MD3  Christopher G. Scott, MS4 
[1] Departments of Pediatric and Adolescent Medicine, and Molecular Pharmacology &
[2] Experimental Therapeutics, Divisions of Heart Rhythm Services and Pediatric Cardiology, Mayo Clinic, Rochester, Minnesota;
[3] Department of Cardiovascular Medicine, Mayo Clinic, Rochester, Minnesota;
[4] Department of Health Sciences Research, Mayo Clinic, Rochester, Minnesota;
关键词: Electronic health record;    Hypertrophic cardiomyopathy;    Implantable cardioverter-defibrillators;    Natural language processing;   
DOI  :  
来源: DOAJ
【 摘 要 】

Background: The follow-up of implantable cardioverter-defibrillators (ICDs) generates large amounts of valuable structured and unstructured data embedded in device interrogation reports. Objective: We aimed to build a natural language processing (NLP) model for automated capture of ICD-recorded events from device interrogation reports using a single-center cohort of patients with hypertrophic cardiomyopathy (HCM). Methods: A total of 687 ICD interrogation reports from 247 HCM patients were included. Using a derivation set of 480 reports, we developed a rule-based NLP algorithm based on unstructured (free-text) data from the interpretation field of the ICD reports to identify sustained atrial and ventricular arrhythmias, and ICD therapies. A separate model based on structured numerical tabulated data was also developed. Both models were tested in a separate set of the 207 remaining ICD reports. Diagnostic performance was determined in reference to arrhythmia and ICD therapy annotations generated by expert manual review of the same reports. Results: The NLP system achieved sensitivity 0.98 and 0.99, and F1-scores 0.98 and 0.92 for arrhythmia and ICD therapy events, respectively. In contrast, the performance of the structured data model was significantly lower with sensitivity 0.33 and 0.76, and F1-scores 0.45 and 0.78, for arrhythmia and ICD therapy events, respectively. Conclusion: An automated NLP system can capture arrhythmia events and ICD therapies from unstructured device interrogation reports with high accuracy in HCM. These findings demonstrate the feasibility of an NLP paradigm for the extraction of data for clinical care and research from ICD reports embedded in the electronic health record.

【 授权许可】

Unknown   

  文献评价指标  
  下载次数:0次 浏览次数:0次