| Journal of Clinical Bioinformatics | |
| Extraction of echocardiographic data from the electronic medical record is a rapid and efficient method for study of cardiac structure and function | |
| Dana C Crawford1  Eric Farber-Eger2  Quinn S Wells3  | |
| [1] Molecular Physiology and Biophysics, Vanderbilt University, Nashville, TN 37232, USA;Center for Human Genetics Research, Vanderbilt University, Nashville, TN 37232, USA;Vanderbilt University Medical Center, 2525 West End Avenue, Suite 300, Nashville TN 37203, USA | |
| 关键词: Natural language processing; Echocardiography; Electronic health records; | |
| Others : 1133396 DOI : 10.1186/2043-9113-4-12 |
|
| received in 2014-06-23, accepted in 2014-09-11, 发布年份 2014 | |
PDF
|
|
【 摘 要 】
Background
Measures of cardiac structure and function are important human phenotypes that are associated with a range of clinical outcomes. Studying these traits in large populations can be time consuming and costly. Utilizing data from large electronic medical records (EMRs) is one possible solution to this problem. We describe the extraction and filtering of quantitative transthoracic echocardiographic data from the Epidemiologic Architecture for Genes Linked to Environment (EAGLE) study, a large, racially diverse, EMR-based cohort (n = 15,863).
Results
There were 6,076 echocardiography reports for 2,834 unique adult subjects. Missing data were uncommon with over 90% of data points present. Data irregularities are primarily related to inconsistent use of measurement units and transcriptional errors. The reported filtering method requires manual review of very few data points (<1%), and filtered echocardiographic parameters are similar to published data from epidemiologic populations of similar ethnicity. Moreover, the cohort is comparable in size, and in some cases larger than community-based cohorts of similar race/ethnicity.
Conclusions
These results demonstrate that echocardiographic data can be efficiently extracted from EMRs, and suggest that EMR-based cohorts have the potential to make major contributions toward the study of epidemiologic and genotype-phenotype associations for cardiac structure and function in diverse populations.
【 授权许可】
2014 Wells et al.; licensee BioMed Central Ltd.
【 预 览 】
| Files | Size | Format | View |
|---|---|---|---|
| 20150304144227501.pdf | 889KB | ||
| Figure 6. | 30KB | Image | |
| Figure 5. | 63KB | Image | |
| Figure 4. | 55KB | Image | |
| Figure 3. | 64KB | Image | |
| Figure 2. | 81KB | Image | |
| Figure 1. | 81KB | Image |
【 图 表 】
Figure 1.
Figure 2.
Figure 3.
Figure 4.
Figure 5.
Figure 6.
【 参考文献 】
- [1]Vasan RS, Larson MG, Levy D, Evans JC, Benjamin EJ: Distribution and categorization of echocardiographic measurements in relation to reference limits: the Framingham Heart Study: formulation of a height- and sex-specific classification and its prospective validation. Circulation 1997, 96:1863-1873.
- [2]Vasan RS, Larson MG, Benjamin EJ, Evans JC, Levy D: Left ventricular dilatation and the risk of congestive heart failure in people without myocardial infarction. N Engl J Med 1997, 336:1350-1355.
- [3]Benjamin EJ, D’Agostino RB, Belanger AJ, Wolf PA, Levy D: Left atrial size and the risk of stroke and death. The Framingham Heart Study. Circulation 1995, 92:835-841.
- [4]Gardin JM, Arnold AM, Polak J, Jackson S, Smith V, Gottdiener J: Usefulness of aortic root dimension in persons ≥65 years of age in predicting heart failure, stroke, cardiovascular mortality, all-cause mortality and acute myocardial infarction (from the cardiovascular health study). Am J Cardiol 2006, 97:270-275.
- [5]Wojczynski MK, Tiwari HK: Definition of phenotype. Adv Genet 2008, 60:75-105.
- [6]Denny JC, Crawford DC, Ritchie MD, Bielinski SJ, Basford MA, Bradford Y, Chai HS, Bastarache L, Zuvich R, Peissig P, Carrell D, Ramirez AH, Pathak J, Wilke RA, Rasmussen L, Wang X, Pacheco JA, Kho AN, Hayes MG, Weston N, Matsumoto M, Kopp PA, Newton KM, Jarvik GP, Li R, Manolio TA, Kullo IJ, Chute CG, Chisholm RL, Larson EB, et al.: Variants near FOXE1 are associated with hypothyroidism and other thyroid conditions: using electronic medical records for genome- and phenome-wide studies. Am J Hum Genet 2011, 89:529-542.
- [7]Gioli-Pereira L, Bernardez-Pereira S, Marcondes-Braga FG, Spina JMR, da Silva RMM, Ferreira NE, Bacal F, Fernandes FB, Mansur AJ, Krieger JE, Pereira AC: Genetic and electroNic medIcal records to predict outcomes in heart failure patients (GENIUS-HF) - design and rationale. BMC Cardiovasc Disord 2014, 14:1-5. BioMed Central Full Text
- [8]Kullo IJ, Ding K, Jouni H, Smith CY, Chute CG: A genome-wide association study of red blood cell traits using the electronic medical record. PLoS One 2010., 5(9) doi:10.1371/journal.pone.0013011
- [9]Crosslin DR, McDavid A, Weston N, Nelson SC, Zheng X, Hart E, de Andrade M, Kullo IJ, McCarty CA, Doheny KF, Pugh E, Kho A, Hayes MG, Pretel S, Saip A, Ritchie MD, Crawford DC, Crane PK, Newton K, Li R, Mirel DB, Crenshaw A, Larson EB, Carlson CS, Jarvik GP, Electronic Medical Records and Genomics (eMERGE) Network: Genetic variants associated with the white blood cell count in 13,923 subjects in the eMERGE Network. Hum Genet 2012, 131:639-652.
- [10]Denny JC, Ritchie MD, Crawford DC, Schildcrout JS, Ramirez AH, Pulley JM, Basford MA, Masys DR, Haines JL, Roden DM: Identification of genomic predictors of atrioventricular conduction: using electronic medical records as a tool for genome science. Circulation 2010, 122:2016-2021.
- [11]Kullo IJ, Ding K, Shameer K, McCarty CA, Jarvik GP, Denny JC, Ritchie MD, Ye Z, Crosslin DR, Chisholm RL, Manolio TA, Chute CG: Complement receptor 1 gene variants are associated with erythrocyte sedimentation rate. Am J Hum Genet 2011, 89:131-138.
- [12]Roden DM, Pulley JM, Basford MA, Bernard GR, Clayton EW, Balser JR, Masys DR: Development of a large-scale de-identified DNA biobank to enable personalized medicine. Clin Pharmacol Ther 2008, 84:362-369.
- [13]Ritchie MD, Denny JC, Crawford DC, Ramirez AH, Weiner JB, Pulley JM, Basford MA, Brown-Gentry K, Balser JR, Masys DR, Haines JL, Roden DM: Robust replication of genotype-phenotype associations across multiple diseases in an electronic medical record. Am J Hum Genet 2010, 86:560-572.
- [14]Matise TC, Ambite JL, Buyske S, Carlson CS, Cole SA, Crawford DC, Haiman CA, Heiss G, Kooperberg C, Marchand LL, Manolio TA, North KE, Peters U, Ritchie MD, Hindorff LA, Haines JL, for the PAGE Study: The Next PAGE in understanding complex traits: design for the analysis of population architecture using genetics and epidemiology (PAGE) study. Am J Epidemiol 2011, 174:849-859.
- [15]Garvin JH, DuVall SL, South BR, Bray BE, Bolton D, Heavirland J, Pickard S, Heidenreich P, Shen S, Weir C, Samore M, Goldstein MK: Automated extraction of ejection fraction for quality measurement using regular expressions in Unstructured Information Management Architecture (UIMA) for heart failure. J Am Med Inform Assoc 2012, 19:859-866.
- [16]Chung J, Murphy S: Concept-value pair extraction from semi-structured clinical narrative: a case study using echocardiogram reports. AMIA Annu Symp Proc 2005, 2005:131-135.
- [17]Fox ER, Musani SK, Barbalic M, Lin H, Yu B, Ogunyankin KO, Smith NL, Kutlar A, Glazer NL, Post WS, Paltoo DN, Dries DL, Farlow DN, Duarte CW, Kardia SL, Meyers KJ, Sun YV, Arnett DK, Patki AA, Sha J, Cui X, Samdarshi TE, Penman AD, Bibbins-Domingo K, Buzkova P, Benjamin EJ, Bluemke DA, Morrison AC, Heiss G, Carr JJ, et al.: Genome-wide association study of cardiac structure and systolic function in African Americans: the candidate gene association resource (CARe) study. Circ Cardiovasc Genet 2013, 6:37-46.
PDF