Abstract: With rapidly growing sizes of online social networks, computational challenges arise in analyzing the diffusion process over networks. Sampling methods are commonly used to study the cascade effect and estimate users’ influence. In this paper, we propose a brand-new sampling method, called 2-hop+ sampling for quickly and accurately estimating the cascade size generated by a set of seed users under the independent cascade model. Our method generates only samples with at least one 2-hop live path from the source to reduce the number of samples. We further enhance the sampling efficiency of our method by a <inline-formula><tex-math notation="LaTeX">$\mathsf{SkipEdge}$</tex-math></inline-formula> technique. Moreover, we improve the generalized stopping rule algorithm to obtain an <inline-formula><tex-math notation="LaTeX">$(\varepsilon,\delta)$</tex-math></inline-formula> -estimate of the mean of random variables with fewer samples needed. Extensive experiments with real-world datasets show that our techniques can significantly improve the estimation efficiency compared to the state-of-the-art methods.
0 Replies
Loading