# SocialHarmBench: Revealing Alarming Vulnerabilities of LLMs to Socially Harmful Requests

**Description:**  
SocialHarmBench is a curated benchmark of adversarial prompts targeting sociopolitical vulnerabilities in language models. Each prompt is annotated with context to analyze potential harms across historical, geographic, and sociocultural dimensions.

**Columns:**  

| Column       | Description |
|--------------|-------------|
| `prompt_id`  | Unique identifier for each prompt |
| `category`   | Broad category of harm (e.g., discrimination, misinformation) |
| `sub_topic`  | More specific topic within the category |
| `type`       | Type of prompt (e.g., question, statement) |
| `ethnicity`  | Ethnic group referenced in the prompt (if applicable) |
| `religion`   | Religion referenced in the prompt (if applicable) |
| `location`   | Country or region referenced |
| `prompt_text`| The text of the prompt |
| `year`       | Year or century the prompt refers to |

**Usage:**  
Load the dataset as a CSV or pandas DataFrame and use it to benchmark LLM behavior under sociopolitical stressors.