A Unified Framework and Dataset for Assessing Societal Bias in Vision-Language Models

A Unified Framework and Dataset for Assessing Societal Bias in Vision-Language Models

ACL ARR 2024 June Submission2919 Authors

15 Jun 2024 (modified: 24 Jul 2024)ACL ARR 2024 June SubmissionEveryoneRevisionsBibTeXCC BY 4.0

Abstract: Vision-language models (VLMs) have gained widespread adoption in both industry and academia. In this study, we propose a unified framework for systematically evaluating gender, race, and age biases in VLMs with respect to professions. Our evaluation encompasses all supported inference modes of the recent VLMs, including image-to-text, text-to-text, text-to-image, and image-to-image. We create a synthetic, high-quality dataset comprising text and images that intentionally obscure gender, race, and age distinctions across various professions. The dataset includes action-based descriptions of each profession and serves as a benchmark for evaluating societal biases in vision-language models (VLMs). In our benchmarking of popular vision-language models (VLMs), we observe that different input-output modalities result in distinct bias magnitudes and directions. We hope our work will help guide future progress in improving VLMs to learn socially unbiased representations. We will release our data and code.

Paper Type: Long

Research Area: Ethics, Bias, and Fairness

Research Area Keywords: social bias benchmark, vision-language-models, unified framework for evaluation

Contribution Types: NLP engineering experiment, Data resources

Languages Studied: English

Submission Number: 2919

Loading