<h1 align="center">SafeEditor: Unified MLLM for Efficient Post-hoc T2I Safety Editing</h1>
 
<div align="center">

  <a href="https://huggingface.co/datasets/SafeEditor/MR-SafeEdit"><img src="https://img.shields.io/static/v1?label=Dataset&message=HuggingFace&color=yellow"></a> &ensp;
  <a href="https://huggingface.co/SafeEditor/SafeEditor"><img src="https://img.shields.io/static/v1?label=Model&message=HuggingFace&color=yellow"></a> &ensp;
</div>


## Abstract
With the rapid advancement of text-to-image (T2I) models, ensuring their safety has become increasingly critical. Existing safety approaches can be categorized into training-time and inference-time methods. While inference-time methods are widely adopted due to their cost-effectiveness, they often suffer from limitations such as over-refusal, degraded generation quality, and failure to capture users' intent. To address these issues, we propose a multi-round image-text interleaved safety editing framework that serves as a plug-and-play module, compatible with any T2I model to achieve efficient safety alignment. Leveraging powerful understanding and generation capabilities of a unifed Multimodal Large Language Model (MLLM), our method identifies safe outputs while preserving semantic intent. Experimental results demonstrate that, compared to filter-based approaches, our method reduces over-refusal and achieves a better balance between utility and safety. Furthermore, it outperforms safety-guidance or prompt-modification methods in maintaining safety while adhering more faithfully to user instructions.

## Getting Started
### Installation
1. Install [DiffSynth-Studio](https://github.com/modelscope/DiffSynth-Studio.git) from source:
```shell
git clone https://github.com/modelscope/DiffSynth-Studio.git
cd DiffSynth-Studio
pip install -e .
```
2. Install requirements
```
pip install -r requirements.txt
```
3. Install [ms-swift](https://github.com/modelscope/ms-swift.git).
```
pip install ms-swift -U
```
### Prepare models
```shell
python download_models.py
python download_dataset.py
```



### SafeEditor Inference
```shell
bash scripts/inference.sh
```

### Prepare MR-SafeEdit for training



### SafeEditor Training
```shell
bash scripts/train_llm_edit.sh
```

