Hidden Population Estimation with Indirect Inference and Auxiliary Information

Published: 26 Apr 2024, Last Modified: 13 Jun 2024UAI 2024 spotlightEveryoneRevisionsBibTeXCC BY 4.0
Keywords: Respondent Driven Sampling, Indirect Inference, Networks
TL;DR: We use indirect inference and auxiliary information to improve hidden population size estimation from samples taken using Respondent Driven Sampling.
Abstract: Many populations defined by illegal or stigmatized behavior are difficult to sample using conventional survey methodology. Respondent Driven Sampling (RDS) is a participant referral process frequently employed in this context to collect information. This sampling methodology can be modeled as a stochastic process that explores the graph of a social network, generating a partially observed subgraph between study participants. The methods currently used to impute the missing edges in this subgraph exhibit biased downstream estimation. We leverage auxiliary participant information and concepts from indirect inference to ameliorate these issues and improve estimation of the hidden population size. These advances result in smaller bias and higher precision in the estimation of the study participant arrival rate, the sample subgraph, and the population size. Lastly, we use our method to estimate the number of People Who Inject Drugs (PWID) in the Kohtla-Jarve region of Estonia.
Supplementary Material: zip
List Of Authors: Weltz, Justin and Laber, Eric and Volfovsky, Alexander
Submission Number: 583
Loading