Exploring Artificial Image Generation for Stance Detection

Exploring Artificial Image Generation for Stance Detection

ACL ARR 2025 February Submission832 Authors

11 Feb 2025 (modified: 09 May 2025)ACL ARR 2025 February SubmissionEveryoneRevisionsBibTeXCC BY 4.0

Abstract: Stance detection is a task aimed at identifying and analyzing the author's stance from textual data. Previous studies primarily rely on analyzing the text itself, which may not fully capture the implicit stance conveyed by the author. To address this limitation, we propose a novel approach that involves transforming the original text into an artificially generated image and using this visual representation to aid in stance detection. Our approach begins by employing a large vision-language model to generate potential images for a given text. These images are carefully crafted to adhere to three specific criteria: relevance to the text, consistency with the target of the stance, and consistency with the stance itself. Next, we introduce a comprehensive evaluation framework to select the optimal image from the set of generated candidates. Once the optimal image has been selected, we introduce a multimodal stance detection model that leverages both the original textual content and the generated image to identify the author's stance. The experimental results demonstrate the effectiveness of our proposed approach, and also indicates the importance of artificially generated images for stance detection.

Paper Type: Long

Research Area: Sentiment Analysis, Stylistic Analysis, and Argument Mining

Research Area Keywords: Stance Detection, Artificial Image Generation, Multimodal Learning

Contribution Types: NLP engineering experiment

Languages Studied: English

Submission Number: 832

Loading