期刊论文详细信息
International Journal of Health Geographics
Geographic bias related to geocoding in epidemiologic studies
Linda W Pickle1  Fern R Hauck2  Mir Siadaty2  Kevin A Matthews3  M Norman Oliver2 
[1] Surveillance Research Program, Division of Cancer Control and Population Sciences, National Cancer Institute, Bethesda, MD, USA;Department of Public Health Sciences, University of Virginia, Charlottesville, VA, USA;Department of Family Medicine, University of Virginia, Charlottesville, VA, USA
关键词: geographic information systems;    epidemiology;    confounding factors;    bias (epidemiology);   
Others  :  1148575
DOI  :  10.1186/1476-072X-4-29
 received in 2005-11-02, accepted in 2005-11-10,  发布年份 2005
PDF
【 摘 要 】

Background

This article describes geographic bias in GIS analyses with unrepresentative data owing to missing geocodes, using as an example a spatial analysis of prostate cancer incidence among whites and African Americans in Virginia, 1990–1999. Statistical tests for clustering were performed and such clusters mapped. The patterns of missing census tract identifiers for the cases were examined by generalized linear regression models.

Results

The county of residency for all cases was known, and 26,338 (74%) of these cases were geocoded successfully to census tracts. Cluster maps showed patterns that appeared markedly different, depending upon whether one used all cases or those geocoded to the census tract. Multivariate regression analysis showed that, in the most rural counties (where the missing data were concentrated), the percent of a county's population over age 64 and with less than a high school education were both independently associated with a higher percent of missing geocodes.

Conclusion

We found statistically significant pattern differences resulting from spatially non-random differences in geocoding completeness across Virginia. Appropriate interpretation of maps, therefore, requires an understanding of this phenomenon, which we call "cartographic confounding."

【 授权许可】

   
2005 Oliver et al; licensee BioMed Central Ltd.

【 预 览 】
附件列表
Files Size Format View
20150404170726317.pdf 1250KB PDF download
Figure 4. 52KB Image download
Figure 3. 94KB Image download
Figure 2. 48KB Image download
Figure 1. 94KB Image download
【 图 表 】

Figure 1.

Figure 2.

Figure 3.

Figure 4.

【 参考文献 】
  • [1]Ricketts TC: Geographic Information Systems and Public Health. Annu Rev Public Health 2003, 24:1-6.
  • [2]McLafferty SL: GIS and Health Care. Annu Rev Public Health 2003, 24:25-42.
  • [3]Rushton G: Public Health, GIS, and Spatial Analytic Tools. Annu Rev Public Health 2003, 24:43-56.
  • [4]Krieger N, Chen JT, Waterman PD, Soobader MJ, Subramanian SV, Carson R: Geocoding and monitoring of U.S. socioeconomic inequalities in mortality and cancer incidence: Does the choice of area-based measure and geographic level matter? The Public Health Disparities Geocoding Project. Am J Epidemiol 2002, 156:471-482.
  • [5]GK S, BA M, B H, Edwards BK: Area socioeconomic variations in U.S. cancer incidence, mortality, stage, treatment, and survival, 1975-1999. Volume NIH Pub No 03-5417. Bethesda, MD, National Cancer Institute; 2003. [NCI Cancer Surveillance Monograph Series, No. 4]
  • [6]Krieger N, Chen JT, Waterman PD, Rehkopf DH, Subramanian SV: Painting a truer picture of U.S. socioeconomic and racial/ethnic health inequalities: The Public Health Disparities Geocoding Project. Am J Public Health 2005, 95:312-323.
  • [7]Pickle LW, Feuer EJ, Edwards BK: Prediction of incident cancer cases in non-SEER counties. ASA Proceedings of the Biometrics Section 2000.
  • [8]Pickle LW, Feuer EJ, Edwards BK: U.S. predicted cancer incidence, 1999: Complete maps by county and state from spatial projection models. NCI Cancere Surveillance Monograph Series No. 5. Bethesda, MD, National Cancer Institute, NIH Publication No. 03-5435; 2003.
  • [9]McElroy JA, Remington PL, Trentham-Dietz A, Robert SA, Newcomb PA: Geocoding addresses from a large population-based study: lessons learned. Epidemiology 2003, 14:399-407.
  • [10]Boscoe FP, Kielb MS, Schymura MJ, Bolani TM: Assessing and improving census tract completeness. J Registry Management 2002, 29:117-123.
  • [11]Krieger N, Waterman PD, Lemieux K, Zierler S, Hogan JW: On the wrong side of the tracts? Evaluating the accuracy of geocoding in public health research. Am J Public Health 2001, 91:1114-1116.
  • [12]Krieger N, Waterman PD, Chen JT, Soobader MJ, Subramanian SV, Carson R: Zip code caveat: Bias due to spatitemporal mismatches between zip codes and U.S. census-defined geographic areas -- The Public Health Disparities Geocoding Project. Am J Public Health 2002, 92:1100-1102.
  • [13]Cayo MR, Talbot TO: Positional error in automated geocoding of residential addresses. Int J Health Geogr 2003, 2:10. BioMed Central Full Text
  • [14]Bonner MR, Han D, Nie J, Rogerson P, Vena JE, Freudenheim JL: Positional accuracy of geocoded addresses in epidemiologic research. Epidemiology 2003, 14:408-412.
  • [15]Rushton G: Selecting appropriate geocoding methods for cancer control and prevention program activities. [http:/ / www.uiowa.edu/ ~gishlth/ giswkshp/ GCD_Rushton_files/ frame.htm#slide0001.htm] webcite 2005.
  • [16]Whitsel EA, Rose KM, Wood JL, Henley AC, Liao D, Heiss G: Accuracy and repeatability of commercial geocoding. Am J Epidemiol 2004, 160:1023-1029.
  • [17]Rothman KJ, Greenland S: Modern epidemiology.. Second edition. Philadelphia, PA, Lippincott Williams & Wilkins; 1998.
  • [18]Vach W: Some issues in estimating the effect of prognostic factors from incomplete covariate data. Stat Med 1997, 16:57-72.
  • [19]EK C, McLafferty SL: GIS and public health.. New York, The Guilford Press; 2002.
  • [20]Waller LA, Gotway CA: Applied spatial statistics for public health data.. Hoboken, NJ, John Wiley & Sons, Inc.; 2004.
  • [21]Gregorio DI, Cromley E, Mrozinski R, Walsh SJ: Subject loss in spatial analysis of breast cancer. Health Place 1999, 5:173-177.
  • [22]Oliver MN, Smith E, Siadaty M, Hauck FR, Pickle LW: A spatial analysis of prostate cancer incidence and race in Virginia, 1990-1999. Am J Prev Med 2005.
  • [23]Gregorio DI, Dechello LM, Samociuk H, Kulldorff M: Lumping or splitting: seeking the preferred areal unit for health geography studies. Int J Health Geogr 2005, 4:6. BioMed Central Full Text
  • [24]Beyers N, Gie RP, Zietsman HL, Kunneke M, Hauman J, Tatley M, Donald PR: The use of a geographical information system (GIS) to evaluate the distribution of tuberculosis in a high-incidence community. S Afr Med J 1996, 86:40-1, 44.
  • [25]Miles-Doan R, Kelly S: Geographic concentration of violence between intimate partners. Public Health Rep 1997, 112:135-141.
  • [26]Rushton G, Lolonis P: Exploratory spatial analysis of birth defect rates in an urban population. Stat Med 1996, 15:717-726.
  • [27]Hurley SE, Saunders TM, Nivas R, Hertz A, Reynolds P: Post office box addresses: a challenge for geographic information system-based studies. Epidemiology 2003, 14:386-391.
  • [28]Kravets N, Hadden WC: The accuracy of address coding and the effects of coding errors. Health Place 2005.
  • [29]Klassen AC, Curriero FC, Hong JH, Williams C, Kulldorff M, Meissner HI, Alberg A, Ensminger M: The role of area-level influences on prostate cancer grade and stage at diagnosis. Prev Med 2004, 39:441-448.
  • [30]PA L, MF G, DJ M, DW R: Geographic information systems and science. New York, John Wiley & Sons; 2001.
  • [31]Centers for Disease Control and Prevention (CDC): Behavioral Risk Factor Surveillance System Survey Data.. Atlanta, GA, U.S. Department of Health and Human Services, Centers for Disease Control and Prevention; 2000.
  • [32]North American Association of Central Cancer Registries: Cancer in North America, 1988-2002. Appendix C: Indicators of data quality for all participating registries. [http:/ / www.naaccr.org/ index.asp?Col_Sectionkey=11&Col_Con tentID=50=12&Col_ContentID=54] webcite 2005.
  • [33]Fleiss JL: Statistical methods for rates and proportions.. New York, NY, John Wiley and Sons; 1981.
  • [34]Mungiole M, Pickle LW, Simonson KH: Application of a weighted headbanging algorithm to mortality data maps. Statistics in Medicine 1999, 18:3201-3209.
  • [35]LR C: Simultaneous statistical inference in the normal multiple linear regression model. JASA 1973, 68:457-461.
  • [36]Tango T: A test for spatial disease clustering adjusted for multiple testing. Statistics in Medicine 2000, 19:191-204.
  • [37]Kuldorff M, Nagarwalla N: Spatial disease clusters: detection and inference. Statistics in Medicine 1995, 14:799-810.
  • [38]Kuldorff M: A spatial scan statistic. Communications in Statistics Theory and Methods 1997, 26:1481-1496.
  文献评价指标  
  下载次数:26次 浏览次数:23次