期刊论文详细信息
BMC Medical Research Methodology
Estimating the size of hidden populations from register data
Peter Wennberg1  Anders Ledberg1 
[1] Centre for Social Research on Alcohol and Drugs, SoRAD, Stockholm University, SE-10691 Stockholm, Sweden
关键词: Mortality;    Heroin;    Opiates;    Truncated Poisson;    Capture-recapture;    Hidden population;    Prevalence;   
Others  :  866349
DOI  :  10.1186/1471-2288-14-58
 received in 2014-01-14, accepted in 2014-04-15,  发布年份 2014
PDF
【 摘 要 】

Background

Prevalence estimates of drug use, or of its consequences, are considered important in many contexts and may have substantial influence over public policy. However, it is rarely possible to simply count the relevant individuals, in particular when the defining characteristics might be illegal, as in the drug use case. Consequently methods are needed to estimate the size of such partly ‘hidden’ populations, and many such methods have been developed and used within epidemiology including studies of alcohol and drug use. Here we introduce a method appropriate for estimating the size of human populations given a single source of data, for example entries in a health-care registry.

Methods

The setup is the following: during a fixed time-period, e.g. a year, individuals belonging to the target population have a non-zero probability of being “registered”. Each individual might be registered multiple times and the time-points of the registrations are recorded. Assuming that the population is closed and that the probability of being registered at least once is constant, we derive a family of maximum likelihood (ML) estimators of total population size. We study the ML estimator using Monte Carlo simulations and delimit the range of cases where it is useful. In particular we investigate the effect of making the population heterogeneous with respect to probability of being registered.

Results

The new estimator is asymptotically unbiased and we show that high precision estimates can be obtained for samples covering as little as 25% of the total population size. However, if the total population size is small (say in the order of 500) a larger fraction needs to be sampled to achieve reliable estimates. Further we show that the estimator give reliable estimates even when individuals differ in the probability of being registered. We also compare the ML estimator to an estimator known as Chao’s estimator and show that the latter can have a substantial bias when applied to epidemiological data.

Conclusions

The population size estimator suggested herein complements existing methods and is less sensitive to certain types of dependencies typical in epidemiological data.

【 授权许可】

   
2014 Ledberg and Wennberg; licensee BioMed Central Ltd.

【 预 览 】
附件列表
Files Size Format View
20140727071104852.pdf 751KB PDF download
25KB Image download
34KB Image download
35KB Image download
34KB Image download
36KB Image download
77KB Image download
【 图 表 】

【 参考文献 】
  • [1]UNO: Tungt Narkotikamissbruk: en Totalundersökning 1979 [Heavy Drug Abuse: A Complete Population Stud 1979]. Stockholm: Socialdepartementet; 1980.
  • [2]Olsson B, Adamsson Warhen C, Byqvist S: Det Tunga Narkotikabrukets Omfattning i Sverige 1998. Stockholm: CAN; 2001.
  • [3]European Monitoring Centre for Drugs and Drug Addiction: Annual Report 2012. The State of the Drug Problem in Europe. Luxembourg: Publications Office of the European Union; 2012.
  • [4]Hook E: Capture recapture methods in epidemiology: methods and limitations. Epidemiol Rev 1995, 17(2):243-264.
  • [5]Yip P, Bruno G, Tajima N, Seber G, Buckland S, Cormack R, Unwin N, Chang Y, Fienberg S, Junker B, LaPorte R, Libman I, McCarty D: Capture-recapture and multiple-record systems estimation II applications in human-diseases. Am J Epidemiol 1995, 142(10):1059-1068.
  • [6]Chao A, Tsay P, Lin S, Shau W, Chao D: The applications of capture-recapture models to epidemiological data. Stat Med 2001, 20(20):3123-3157.
  • [7]Frischer M, Leyland A, Cormack R, Goldberg D, Bloor M, Green S, Taylor A, Covell R, McKeganey N, Platt S: Estimating the population prevalence of injection-drug use and infection with human-immunodeficiency-virus among injection-drug users in Glasgow, Scotland. Am J Epidemiol 1993, 138(3):170-181.
  • [8]Domingo-Salvany A, Hartnoll R, Maguire A, Brugal M, Albertin P, Cayla J, Casabona J, Suelves J: Analytical considerations in the use of capture-recapture to estimate prevalence: case studies of the estimation of opiate use in the metropolitan area og Barcelona, Spain. Am J Epidemiol 1998, 148(8):732-740.
  • [9]Chao A: Estimating the population-size for capture recapture data with unequal catchability. Biometrics 1987, 43(4):783-791.
  • [10]Zelterman D: Robust estimation in truncated discrete-distributions with application to capture recapture experiments. J Stat Plan Infer 1988, 18(2):225-237.
  • [11]Hay G, Smit F: Estimating the number of drug injectors from needle exchange data. Addict Res Theory 2003, 11(4):235-243.
  • [12]Bohning D, Suppawattanabodee B, Kusolvisitkul W, Viwatwongkasem C: Estimating the number of drug users in Bangkok 2001 : a capture-recapture approach using repeated entries in one list. Eur J Epidemiol 2004, 19(12):1075-1083.
  • [13]Cormack R: Problems with using capture-recapture in epidemiology: an example of a measles epidemic. J Clin Epidemiol 1999, 52(10):909-914.
  • [14]Hayne D: Two methods for estimating population from trapping records. J Mammal 1949, 30(4):399-411.
  • [15]Moran P: A mathematical theory of animal trapping. Biometrika 1951, 38(3–4):307-311.
  • [16]Zippin C: An evaluation of the removal method of estimating animal populations. Biometrics 1956, 12(2):163-189.
  • [17]Chao A: Nonparametric estimation of the number of classes in a population. Scand J Statist 1984, 11:265-270.
  • [18]Seber G, Whale J: The removal method for two and three samples. Biometrics 1970, 26(3):393-400.
  • [19]Feller W: An Introduction to Probability Theory and Its Applications. Stockholm: Wiley; 1968.
  文献评价指标  
  下载次数:53次 浏览次数:38次