期刊论文详细信息
BMC Medical Research Methodology
Optimising the use of electronic health records to estimate the incidence of rheumatoid arthritis in primary care: what information is hidden in free text?
Jackie A Cassell5  Tim Williams6  Irene Petersen2  Kevin A Davies1  Greta Rait2  Helen E Smith5  Lesley Axelrod4  John Carroll4  A Rosemary Tate4  Rob Koeling4  Amanda Nicholson3  Elizabeth Ford5 
[1]Division of Clinical Medicine, Brighton and Sussex Medical School, Brighton, UK
[2]Research Department of Primary Care and Population Health, University College London, London, UK
[3]Now at: Faculty of Health and Medicine, Lancaster University, Lancaster, UK
[4]Department of Informatics, University of Sussex, Brighton, UK
[5]Division of Primary Care and Public Health, Brighton and Sussex Medical School, Mayfield House, University of Brighton, Falmer, Brighton BN1 9PH, UK
[6]Clinical Practice Research Datalink, the Medicines and Healthcare products Regulatory Agency, London, UK
关键词: Coding;    Free text;    Rheumatoid arthritis;    Electronic medical records;    Electronic health records;   
Others  :  1091954
DOI  :  10.1186/1471-2288-13-105
 received in 2012-11-29, accepted in 2013-08-07,  发布年份 2013
PDF
【 摘 要 】

Background

Primary care databases are a major source of data for epidemiological and health services research. However, most studies are based on coded information, ignoring information stored in free text. Using the early presentation of rheumatoid arthritis (RA) as an exemplar, our objective was to estimate the extent of data hidden within free text, using a keyword search.

Methods

We examined the electronic health records (EHRs) of 6,387 patients from the UK, aged 30 years and older, with a first coded diagnosis of RA between 2005 and 2008. We listed indicators for RA which were present in coded format and ran keyword searches for similar information held in free text. The frequency of indicator code groups and keywords from one year before to 14 days after RA diagnosis were compared, and temporal relationships examined.

Results

One or more keyword for RA was found in the free text in 29% of patients prior to the RA diagnostic code. Keywords for inflammatory arthritis diagnoses were present for 14% of patients whereas only 11% had a diagnostic code. Codes for synovitis were found in 3% of patients, but keywords were identified in an additional 17%. In 13% of patients there was evidence of a positive rheumatoid factor test in text only, uncoded. No gender differences were found. Keywords generally occurred close in time to the coded diagnosis of rheumatoid arthritis. They were often found under codes indicating letters and communications.

Conclusions

Potential cases may be missed or wrongly dated when coded data alone are used to identify patients with RA, as diagnostic suspicions are frequently confined to text. The use of EHRs to create disease registers or assess quality of care will be misleading if free text information is not taken into account. Methods to facilitate the automated processing of text need to be developed and implemented.

【 授权许可】

   
2013 Ford et al.; licensee BioMed Central Ltd.

【 预 览 】
附件列表
Files Size Format View
20150128175338223.pdf 437KB PDF download
Figure 1. 78KB Image download
【 图 表 】

Figure 1.

【 参考文献 】
  • [1]Blumenthal D: Launching HITECH. NEnglJ Med 2010, 362(5):382-385.
  • [2]Blumenthal D, Tavenner M: The “meaningful Use” regulation for electronic health records. NEnglJ Med 2010, 363(6):501-504.
  • [3]Department of Health: Liberating the NHS: An Information Revolution. A consultation on proposals. London, UK: Department of Health; 2010.
  • [4]Atreja A, Achkar J, Jain A, Harris C, Lashner B: Using technology to promote gastrointestinal outcomes research: a case for electronic health records. Am J Gastroenterol 2008, 103(9):2171-2178.
  • [5]van Staa T-P, Goldacre B, Gulliford M, Cassell J, Pirmohamed M, Taweel A, Delaney B, Smeeth L: Pragmatic randomised trials using routine electronic health records: putting them to the test. BMJ 2012, 344:e55.
  • [6]British Medical Association: Quality and Outcomes Framework guidance for GMS contract 2011/12. London, UK: British Medical Association; 2011.
  • [7]Nicholson A, Ford E, Davies K, Smith H, Rait G, Tate R, Peterson I, Cassell J: Optimising Use of electronic health records to describe the presentation of rheumatoid arthritis in primary care: a strategy for developing code lists. PLoS ONE 2013, 8(2):e54878.
  • [8]Dave S, Petersen I: Creating medical and drug code lists to identify cases in primary care databases. PharmacoepidemiolDrug Saf 2009.
  • [9]Jordan K, Porcheret M, Kadam UT, Croft P: The use of general practice consultation databases in rheumatology research. Rheumatology 2006, 45(2):126-128.
  • [10]Manuel DG, Rosella LC, Stukel TA: Importance of accurately identifying disease in studies using electronic health records. BMJ 2010, 341:c4226.
  • [11]Jordan K, Porcheret M, Croft P: Quality of morbidity coding in general practice computerized medical records: a systematic review. Fam Pract 2004, 21(4):396-412.
  • [12]Rhodes ET, Gonzalez TV, Laffel LMB, Ludwig DS: Accuracy of administrative coding for type 2 diabetes in children, adolescents, and young adults. Diabetes Care 2007, 30(1):141-143.
  • [13]Meystre SM, Savova GK, Kipper-Schuler KC, Hurdle JF: Extracting information from textual documents in the electronic health record: a review of recent research. YearbMed Inform 2008, 128-144.
  • [14]Nice: Rheumatoid arthritis. The management of rheumatoid arthritis in adults. Clinical guideline 79. London: NICE; 2009.
  • [15]Tate AR, Martin AG, Murray-Thomas T, Anderson SR, Cassell JA: Determining the date of diagnosis–is it a simple matter? The impact of different approaches to dating diagnosis on estimates of delayed care for ovarian cancer in UK primary care. BMC Med Res Methodol 2009, 9:42. BioMed Central Full Text
  • [16]Pascoe SW, Neal RD, Heywood PL, Allgar VL, Miles JN, Stefoski-Mikeljevic J: Identifying patients with a cancer diagnosis using general practice medical records and cancer registry data. Fam Pract 2008, 25(4):215-220.
  • [17]Hillestad R, Bigelow J, Bower A, Girosi F, Meili R, Scoville R, Taylor R: Can ElectronicMedical record systems transform health care? Potential health benefits, savings. And costs. Health Aff 2005, 24(5):1103-1107.
  • [18]Carroll RJ, Eyler AE, Denny JC: Naïve electronic health record phenotype identification for rheumatoid arthritis. AMIA Annu Symp Proc 2011, 2011:189-196.
  • [19]Carroll RJ, Thompson WK, Eyler AE, Mandelin AM, Cai T, Zink RM, Pacheco JA, Boomershine CS, Lasko TA, Xu H, et al.: Portability of an algorithm to identify rheumatoid arthritis in electronic health records. J Am Med Inform Assoc 2012, 19:e162-e169.
  • [20]Fact Sheet UMLS® Metathesaurus®. http://www.nlm.nih.gov/pubs/factsheets/umlsmeta.html webcite
  • [21]Tate AR, Martin AGR, Ali A, Cassell JA: Using free text information to explore how and when GPs code a diagnosis of ovarian cancer: an observational study using primary care records of patients with ovarian cancer. BMJ open 2011, 1:e000025.
  • [22]Linsell L, Dawson J, Zondervan K, Randall T, Rose P, Carr A, Fitzpatrick R: Prospective study of elderly people comparing treatments following first primary care consultation for a symptomatic hip or knee. Fam Pract 2005, 22(1):118-125.
  • [23]Hanauer DA, Englesbe MJ, Cowan Jr JA, Campbell DA: Informatics and the american college of surgeons national surgical quality improvement program: automated processes could replace manual record review. J Am Coll Surg 2009, 208:37-41.
  • [24]Voorham J, Denig P: Computerized extraction of information on the quality of diabetes care from free text in electronic patient records of general practitioners. J Am Med Inform Assoc 2007, 14(3):349-354.
  • [25]Hanauer DA, Miela G, Chinnaiyan AM, Chang AE, Blayney DW: The registry case finding engine: an automated tool to identify cancer cases from unstructured, free-text pathology reports and clinical notes. J Am CollSurg 2007, 205(5):690-697.
  • [26]Jordan K, Jinks C, Croft P: Health care utilisation: measurement using primary care records and patient recall both showed bias. J Clin Epidemiol 2006, 59:791-797.
  • [27]Pakhomov SS, Hemingway H, Weston SA, Jacobsen SJ, Rodeheffer R, Roger VL: Epidemiology of angina pectoris: role of natural language processing of the medical record. Am Heart J 2007, 153(4):666-673.
  • [28]DeLisle S, South B, Anthony JA, Kalp E, Gundlapallli A, Curriero FC, Glass GE, Samore M, Perl TM: Combining free text and structured electronic medical record entries to detect acute respiratory infections. PLoS ONE 2010, 5(10):e13377.
  • [29]Liao KP, Cai T, Gainer V, Goryachev S, Zeng-Treitler Q, Raychaudhuri S, Szolovits P, Churchill S, Murphy S, Kohane I, et al.: Electronic medical records for discovery research in rheumatoid arthritis. Arthritis Care Res (Hoboken) 2010, 62(8):1120-1127.
  • [30]Koeling R, Tate AR, Carroll JA: Automatically estimating the incidence of symptoms recording in GP free text notes. In Proceedings of MIXHS’11 2011. Glasgow, Scotland, UK; 2011.
  • [31]Greenhalgh T, Swinglehurst D: Studying technology use as social practice: the untapped potential of ethnography. BMC Med 2011, 9(1):45. BioMed Central Full Text
  • [32]Swinglehurst D, Greenhalgh T, Myall M, Russell J: Ethnographic study of ICT-supported collaborative work routines in general practice. BMC Health Serv Res 2010, 10(1):348. BioMed Central Full Text
  • [33]Rosenbloom ST, Denny JC, Xu H, Lorenzi NM, Stead WW, Johnson KB: Data from clinical notes: a perspective on the tension between structure and flexible documentation. J Am Med Inform Assoc 2011, 18:181-186.
  • [34]Zheng K, Hanauer DA, Padman R, Johnson MP, Hussain AA, Ye W, Zhou X, Diamond HS: Handling anticipated exceptions in clinical care: investigating clinical use of ‘exit strategies’ in an electronic health records system. J Am Med Inform Assoc 2011, 18:883-889.
  • [35]Carroll J, Koeling R, Puri S: Lexical aquisition for clinical text mining using distributional similarity. In Proceedings of the 13th International Conference on Text Processing and Computational Linguistics (CICLing): 2012; IIT. Delhi, India: Springer Lecture Notes in Computer Science; 2012:232-246.
  文献评价指标  
  下载次数:14次 浏览次数:19次