期刊论文详细信息
BMC Medical Research Methodology
The promise of record linkage for assessing the uptake of health services in resource constrained settings: a pilot study from South Africa
Georges Reniers5  Jane Menken1  Stephen Tollman4  Francesc Xavier Gómez-Olivé3  Benjamin D Clark2  Chodziwadziwa W Kabudula3 
[1]Institute of Behavioral Science, University of Colorado, Boulder, Colorado, USA
[2]Department of Ecology, Evolution and Environmental Biology, Columbia University, New York, USA
[3]MRC/Wits Rural Public Health and Health Transitions Research Unit (Agincourt), School of Public Health, Faculty of Health Sciences, University of the Witwatersrand, Johannesburg, South Africa
[4]INDEPTH Network, Accra, Ghana
[5]Department of Population Health, London School of Hygiene and Tropical Medicine, London, UK
关键词: Population surveillance;    South Africa;    Health facilities;    Record linkage;    Health and Demographic Surveillance System (HDSS);   
Others  :  865403
DOI  :  10.1186/1471-2288-14-71
 received in 2014-02-07, accepted in 2014-05-16,  发布年份 2014
PDF
【 摘 要 】

Background

Health and Demographic Surveillance Systems (HDSS) have been instrumental in advancing population and health research in low- and middle- income countries where vital registration systems are often weak. However, the utility of HDSS would be enhanced if their databases could be linked with those of local health facilities. We assess the feasibility of record linkage in rural South Africa using data from the Agincourt HDSS and a local health facility.

Methods

Using a gold standard dataset of 623 record pairs matched by means of fingerprints, we evaluate twenty record linkage scenarios (involving different identifiers, string comparison techniques and with and without clerical review) based on the Fellegi-Sunter probabilistic record linkage model. Matching rates and quality are measured by their sensitivity and positive predictive value (PPV). Background characteristics of matched and unmatched cases are compared to assess systematic bias in the resulting record-linked dataset.

Results

A hybrid approach of deterministic followed by probabilistic record linkage, and scenarios that use an extended set of identifiers including another household member’s first name yield the best results. The best fully automated record linkage scenario has a sensitivity of 83.6% and PPV of 95.1%. The sensitivity and PPV increase to 84.3% and 96.9%, respectively, when clerical review is undertaken on 10% of the record pairs. The likelihood of being linked is significantly lower for females, non-South Africans and the elderly.

Conclusion

Using records matched by means of fingerprints as the gold standard, we have demonstrated the feasibility of fully automated probabilistic record linkage using identifiers that are routinely collected in health facilities in South Africa. Our study also shows that matching statistics can be improved if other identifiers (e.g., another household member’s first name) are added to the set of matching variables, and, to a lesser extent, with clerical review. Matching success is, however, correlated with background characteristics that are indicative of the instability of personal attributes over time (e.g., surname in the case of women) or with misreporting (e.g., age).

【 授权许可】

   
2014 Kabudula et al.; licensee BioMed Central Ltd.

【 预 览 】
附件列表
Files Size Format View
20140726063254651.pdf 308KB PDF download
43KB Image download
【 图 表 】

【 参考文献 】
  • [1]Sankoh O: Global health estimates: stronger collaboration needed with low-and middle-income countries. PLoS Med 2010, 7(11):e1001005.
  • [2]Sankoh O, Byass P: The INDEPTH Network: filling vital gaps in global epidemiology. Int J Epidemiol 2012, 41(3):579-588.
  • [3]Ye Y, Wamukoya M, Ezeh A, Emina J, Sankoh O: Health and demographic surveillance systems: a step towards full civil registration and vital statistics system in sub-Sahara Africa? BMC Public Health 2012, 12(1):741. BioMed Central Full Text
  • [4]Jahn A, Floyd S, Crampin AC, Mwaungulu F, Mvula H, Munthali F, McGrath N, Mwafilaso J, Mwinuka V, Mangongo B, Fine PEM, Zaba B, Glynn JR: Population-level effect of HIV on adult mortality and early evidence of reversal after introduction of antiretroviral therapy in Malawi. Lancet 2008, 371(9624):1603-1611.
  • [5]Herbst AJ, Mafojane T, Newell ML: Verbal autopsy-based cause-specific mortality trends in rural KwaZulu-Natal, South Africa, 2000–2009. Popul Health Metrics 2011, 9(1):47. BioMed Central Full Text
  • [6]Bor J, Herbst AJ, Newell M-L, Bärnighausen T: Increases in adult life expectancy in rural South Africa: valuing the scale-up of HIV treatment. Science 2013, 339(6122):961-965.
  • [7]Jutte DP, Roos LL, Brownell MD: Administrative record linkage as a tool for public health research. Annu Rev Public Health 2011, 32:91-108.
  • [8]Holian J, Mallick MJ, Zaremba CM: Maternity and infant care, race and birth outcomes. J Health Soc Policy 2004, 18(4):1-11.
  • [9]Holian J: Live birth and infant death record linkage. J Health Soc Policy 2000, 12(1):1-10.
  • [10]Lyons R, Jones K, John G, Brooks C, Verplancke J-P, Ford D, Brown G, Leake K: The SAIL databank: linking multiple health and social care datasets. BMC Med Inform Decis Mak 2009, 9(1):3. BioMed Central Full Text
  • [11]Amin J, Law MG, Bartlett M, Kaldor JM, Dore GJ: Causes of death after diagnosis of hepatitis B or hepatitis C infection: a large community-based linkage study. Lancet 2006, 368(9539):938-945.
  • [12]Falster K, Wand H, Donovan B, Anderson J, Nolan D, Watson K, Watson J, Law MG: Hospitalizations in a cohort of HIV patients in Australia, 1999–2007. AIDS 2010, 24(9):1329.
  • [13]Holman CAJ, Bass AJ, Rouse IL, Hobbs MS: Population-based linkage of health records in Western Australia: development of a health services research linked database. Aust N Z J Public Health 2008, 23(5):453-459.
  • [14]Franceschi S, Dal Maso L, Arniani S, Crosignani P, Vercelli M, Simonato L, Falcini F, Zanetti R, Barchielli A, Serraino D, Rezza G: Risk of cancer other than Kaposi’s sarcoma and non-Hodgkin’s lymphoma in persons with AIDS in Italy. Cancer and AIDS Registry Linkage Study. Br J Cancer 1998, 78(7):966-970.
  • [15]Chen J, Fair M, Wilkins R, Cyr M: Maternal education and fetal and infant mortality in Quebec. Fetal and Infant Mortality Study Group of the Canadian Perinatal Surveillance System. Health Reports/Statistics Canada, Canadian Centre for Health Information 1998, 10(2):53-64.
  • [16]Tromp M, Meray N, Ravelli ACJ, Reitsma JB, Bonsel GJ: Medical record linkage of anonymous registries without validated sample linkage of the Dutch perinatal registries. Stud Health Tech Informat 2005, 116:125-130.
  • [17]Chard T, Penney G, Chalmers J: The risk of neonatal death in relation to birth weight and maternal hypertensive disease in infants born at 24–32 weeks. Eur J Obstet Gynecol Reprod Biol 2001, 95(1):114-118.
  • [18]Cole S: History of fingerprint pattern recognition. In Automatic Fingerprint Recognition Systems. Edited by Ratha N, Bollle R. New York: Springer; 2004:1-25.
  • [19]Kahn K, Collinson MA, Gómez-Olivé FX, Mokoena O, Twine R, Mee P, Afolabi SA, Clark BD, Kabudula CW, Khosa A, Khoza S, Shabangu MG, Silaule B, Tibane JB, Wagner RG, Garenne ML, Clark SJ, Tollman SM: Profile: Agincourt Health and Socio-demographic Surveillance System. Int J Epidemiol 2012, 41(4):988-1001.
  • [20]Kahn K, Tollman SM, Collinson MA, Clark SJ, Twine R, Clark BD, Shabangu M, Gomez-Olive FX, Mokoena O, Garenne ML: Research into health, population and social transitions in rural South Africa: Data and methods of the Agincourt Health and Demographic Surveillance System. Scand J Publ Health 2007, 35(69 suppl):8-20.
  • [21]Serwaa-Bonsu A, Herbst AJ, Reniers G, Ijaa W, Clark B, Kabudula C, Sankoh O: First experiences in the implementation of biometric technology to link data from Health and Demographic Surveillance Systems with health facility data. Glob Health Action 2010, 3:2120.
  • [22]CSIR: Using ICT to support rural clinics in managing chronic lifestyle diseases. In Sciencescope. Pretoria: CSIR; 2009:57-58.
  • [23]MorphoSmart Overview [https://www.yumpu.com/en/document/view/10855857/morphosmarttm-overview webcite]
  • [24]Li B, Quan H, Fong A, Lu M: Assessing record linkage between health care and Vital Statistics databases using deterministic methods. BMC Health Serv Res 2006, 6(1):48. BioMed Central Full Text
  • [25]Machado CJ: A literature review of record linkage procedures focusing on infant health outcomes. Cadernos de Saúde Pública 2004, 20:362-371.
  • [26]Maso LD, Braga C, Franceschi S: Methodology used for software for automated linkage in Italy (SALI). Comput Biomed Res 2001, 34(6):395.
  • [27]Victor TW, Mera RM: Record linkage of health care insurance claims. J Am Med Inform Assoc 2001, 8(3):281-288.
  • [28]Howe GR: Use of computerized record linkage in cohort studies. Epidemiol Rev 1998, 20(1):112-121.
  • [29]Beauchamp A, Tonkin AM, Kelsall H, Sundararajan V, English DR, Sundaresan L, Wolfe R, Turrell G, Giles GG, Peeters A: Validation of de-identified record linkage to ascertain hospital admissions in a cohort study. BMC Med Res Methodol 2011, 11(1):42. BioMed Central Full Text
  • [30]Cook L, Olson L, Dean J: Probabilistic record linkage: relationships between file sizes, identifiers, and match weights. Methods Inf Med 2001, 40(3):196-203.
  • [31]Jaro MA: Advances in record-linkage methodology as applied to matching the 1985 census of Tampa, Florida. J Am Stat Assoc 1989, 84(406):414-420.
  • [32]Jaro MA: Probabilistic linkage of large public health data files. Stat Med 1995, 14(5–7):491-498.
  • [33]Nitsch D, Morton S, DeStavola BL, Clark H, Leon DA: How good is probabilistic record linkage to reconstruct reproductive histories? Results from the Aberdeen children of the 1950 s study. BMC Med Res Methodol 2006, 6(1):15. BioMed Central Full Text
  • [34]Clark D: Practical introduction to record linkage for injury research. Inj Prev 2004, 10(3):186.
  • [35]Pacheco AG, Saraceni V, Tuboi SH, Moulton LH, Chaisson RE, Cavalcante SC, Durovni B, Faulhaber JC, Golub JE, King B, Schechter M, Harrison LH: Validation of a hierarchical deterministic record-linkage algorithm using data from 2 different cohorts of human immunodeficiency virus-infected persons and mortality databases in Brazil. Am J Epidemiol 2008, 168(11):1326-1332.
  • [36]Rosman DL: The feasibility of linking hospital and police road crash casualty records without names. Accid Anal Prev 1996, 28(2):271-274.
  • [37]Winkler WE: String Comparator Metrics and Enhanced Decision Rules in the Fellegi-Sunter Model of Record Linkage. In Proceedings of the Section on Survey Research Methods. Alexandria: American Statistical Association; 1990:354-359.
  • [38]Philips L: The double metaphone search algorithm. C Plus Plus Users J 2000, 18(6):38-43.
  • [39]Méray N, Reitsma JB, Ravelli AC, Bonsel GJ: Probabilistic record linkage is a valid and transparent tool to combine databases without a patient identification number. J Clin Epidemiol 2007, 60(9):883-e881.
  • [40]Grannis SJ, Overhage JM, Hui S, McDonald CJ: Analysis of a probabilistic record linkage technique without human review. J Am Med Informat Assoc 2003, 2003:259-263.
  • [41]Herzog TN, Scheuren F, Winkler WE: Data quality and record linkage techniques. Heidelberg: Springer; 2007.
  • [42]Fellegi IP, Sunter AB: A theory for record linkage. J Am Stat Assoc 1969, 64(328):1183-1210.
  • [43]Tromp M, Ravelli AC, Bonsel GJ, Hasman A, Reitsma JB: Results from simulated data sets: probabilistic record linkage outperforms deterministic record linkage. J Clin Epidemiol 2011, 64(5):565-572.
  • [44]Karmel R, Rosman D: Linkage of health and aged care service events: comparing linkage and event selection methods. BMC Health Serv Res 2008, 8(1):149. BioMed Central Full Text
  • [45]Blakely T, Salmond C: Probabilistic record linkage and a method to calculate the positive predictive value. Int J Epidemiol 2002, 31(6):1246-1252.
  • [46]Bentley JP, Ford JB, Taylor LK, Irvine KA, Roberts CL: Investigating linkage rates among probabilistically linked birth and hospitalization records. BMC Med Res Methodol 2012, 12(1):149. BioMed Central Full Text
  • [47]Megan B, Damien J, Vijaya S, Sue E, David P, Ian S, Caroline B: Data Linkage: A powerful research tool with potential problems. BMC Health Serv Res 2010, 10:346. BioMed Central Full Text
  • [48]Kolenikov S, Angeles G: Socioeconomic status measurement with discrete proxy variables: Is principal component analysis a reliable answer? Rev Income Wealth 2009, 55(1):128-165.
  • [49]SimMetrics [http://sourceforge.net/projects/simmetrics webcite]
  • [50]Implement Phonetic ("Sounds-like") Name Searches with Double Metaphone Part V: NET Implementation. [http://www.codeproject.com/Articles/4624/Implement-Phonetic-quot-Sounds-like-quot-Name-Sear webcite]
  • [51]Snae C: A comparison and analysis of name matching algorithms. Int J Appl Sci Eng Technol 2007, 4(1):252-257.
  文献评价指标  
  下载次数:21次 浏览次数:29次