文章基本信息

标题：Asymptotic seed bias in respondent-driven sampling
本地全文：下载
作者：Yuling Yan ; Bret Hanlon ; Sebastien Roch 等
期刊名称：Electronic Journal of Statistics
印刷版ISSN：1935-7524
出版年度：2020
卷号：14
期号：1
页码：1577-1610
DOI：10.1214/20-EJS1698
语种：English
出版社：Institute of Mathematical Statistics
摘要：Respondent-driven sampling (RDS) collects a sample of individuals in a networked population by incentivizing the sampled individuals to refer their contacts into the sample. This iterative process is initialized from some seed node(s). Sometimes, this selection creates a large amount of seed bias. Other times, the seed bias is small. This paper gains a deeper understanding of this bias by characterizing its effect on the limiting distribution of various RDS estimators. Using classical tools and results from multi-type branching processes [12], we show that the seed bias is negligible for the Generalized Least Squares (GLS) estimator and non-negligible for both the inverse probability weighted and Volz-Heckathorn (VH) estimators. In particular, we show that (i) above a critical threshold, VH converge to a non-trivial mixture distribution, where the mixture component depends on the seed node, and the mixture distribution is possibly multi-modal. Moreover, (ii) GLS converges to a Gaussian distribution independent of the seed node, under a certain condition on the Markov process. Numerical experiments with both simulated data and empirical social networks suggest that these results appear to hold beyond the Markov conditions of the theorems.
关键词：Limit distribution; Galton-Watson process; Volz-Heckathorn estimator