Unraveling and Mitigating Safety Alignment Degradation of Vision-Language Models

Qin Liu, Chao Shang, Ling Liu, Nikolaos Pappas, Jie Ma, Neha Anna John, Srikanth Doss, Lluis Marquez, Miguel Ballesteros, Yassine Benajiba

Published: 01 Jan 2025, Last Modified: 26 Jan 2026University of California Publication Management SystemEveryoneRevisionsBibTeXCC BY-SA 4.0
Loading