Towards Effective Counter-Responses: Aligning Human Preferences with Strategies to Combat Online Trolling

Towards Effective Counter-Responses: Aligning Human Preferences with Strategies to Combat Online Trolling

ACL ARR 2024 April Submission775 Authors

16 Apr 2024 (modified: 06 May 2024)ACL ARR 2024 April SubmissionEveryoneRevisionsBibTeXCC BY 4.0

Abstract: Trolling in online communities typically involves disruptive behaviors such as provoking anger and manipulating discussions, leading to a polarized atmosphere and emotional distress. Robust moderation is essential for mitigating these negative impacts and maintaining a healthy and constructive community atmosphere. However, effectively addressing trolls is difficult because their behaviors vary widely and require different response strategies (RSs) to counter them. This diversity makes it challenging to choose an appropriate RS for each specific situation. To address this challenge, our research investigates whether humans have preferred strategies tailored to different types of trolling behaviors. Our findings reveal a correlation between the types of trolling encountered and the preferred RS. In this paper, we introduce a methodology that recommends an appropriate RS for various trolling behaviors. This approach is supported by a dataset we constructed, which aligns these strategies with user preferences. This enables the generation of effective counter-responses by recommending the most appropriate strategies based on these preferences. The experimental results demonstrate that our proposed approach improves discussion quality and reduces the negative effects of trolls, thereby enhancing the online community environment.

Paper Type: Short

Research Area: Computational Social Science and Cultural Analytics

Research Area Keywords: Trolling, Counter Response Generation, Reddit, Social media, Online communities, Moderation

Contribution Types: NLP engineering experiment

Languages Studied: English

Submission Number: 775

Loading