期刊论文详细信息
BMC Medical Informatics and Decision Making
Use of name recognition software, census data and multiple imputation to predict missing data on ethnicity: application to cancer registry records
Research Article
Sally Vernon1  Sue Wilson2  Ronan Ryan3  Gill Lawrence4 
[1] Eastern Cancer Registration and Information Centre, Public Health Building, Unit C - Magog Court, Shelford Bottom, Hinton Way, CB22 3AD, Cambridge, UK;Primary Care Clinical Sciences, University of Birmingham, B15 2TT, Birmingham, UK;Public Health, Epidemiology and Biostatistics, University of Birmingham, B15 2TT, Birmingham, UK;West Midlands Cancer Intelligence Unit, Public Health Building, University of Birmingham, B15 2TT, Birmingham, UK;
关键词: Ethnic Group;    Positive Predictive Value;    National Health Service;    Imputation Model;    Hospital Episode Statistics;   
DOI  :  10.1186/1472-6947-12-3
 received in 2011-05-18, accepted in 2012-01-23,  发布年份 2012
来源: Springer
PDF
【 摘 要 】

BackgroundInformation on ethnicity is commonly used by health services and researchers to plan services, ensure equality of access, and for epidemiological studies. In common with other important demographic and clinical data it is often incompletely recorded. This paper presents a method for imputing missing data on the ethnicity of cancer patients, developed for a regional cancer registry in the UK.MethodsRoutine records from cancer screening services, name recognition software (Nam Pehchan and Onomap), 2001 national Census data, and multiple imputation were used to predict the ethnicity of the 23% of cases that were still missing following linkage with self-reported ethnicity from inpatient hospital records.ResultsThe name recognition software were good predictors of ethnicity for South Asian cancer cases when compared with data on ethnicity derived from hospital inpatient records, especially when combined (sensitivity 90.5%; specificity 99.9%; PPV 93.3%). Onomap was a poor predictor of ethnicity for other minority ethnic groups (sensitivity 4.4% for Black cases and 0.0% for Chinese/Other ethnic groups). Area-based data derived from the national Census was also a poor predictor non-White ethnicity (sensitivity: South Asian 7.4%; Black 2.3%; Chinese/Other 0.0%; Mixed 0.0%).ConclusionsCurrently, neither method for assigning individuals to an ethnic group (name recognition and ethnic distribution of area of residence) performs well across all ethnic groups. We recommend further development of name recognition applications and the identification of additional methods for predicting ethnicity to improve their precision and accuracy for comparisons of health outcomes. However, real improvements can only come from better recording of ethnicity by health services.

【 授权许可】

CC BY   
© Ryan et al; licensee BioMed Central Ltd. 2012

【 预 览 】
附件列表
Files Size Format View
RO202311095464849ZK.pdf 302KB PDF download
【 参考文献 】
  • [1]
  • [2]
  • [3]
  • [4]
  • [5]
  • [6]
  • [7]
  • [8]
  • [9]
  • [10]
  • [11]
  • [12]
  • [13]
  • [14]
  • [15]
  • [16]
  • [17]
  • [18]
  • [19]
  • [20]
  文献评价指标  
  下载次数:1次 浏览次数:0次