WeChat Toxic Article Detection: A Data-Driven Machine Learning Approach

Published: 2018, Last Modified: 21 Jan 2026APSIPA 2018EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Recently, toxic information detection has attracted tremendous amounts of research interest because of the popularity of social networks and the widespread of toxic information which may have dire consequences to the public. Existing work extensively studies toxic article detection in open social networks from information diffusion perspective. However, in closed social networks as exemplified by WeChat Moments (WM), the diffusion process is uneasily visible. To tackle the toxic article detection problem in closed social networks, in this paper we empirically study the articles spread in WM which is based on the largest Chinese social platform WeChat. In particular, we systematically analyze users' behavior and text information of normal and toxic articles and identify a striking difference between them. Furthermore, we design a new model named MAT-LSTM which can well capture the impact of different kinds of text information. To improve the performance of automatic toxic article detection, we propose XMATL framework which is enhanced from MAT-LSTM and can utilize text information and users' behavior characteristics in a holistic manner. We conduct extensive experiments using two real-world datasets and demonstrate that our proposed model can effectively detect toxic articles in WM and achieve outstanding performance gain over the classic methods.
Loading