BiasEdit: Debiasing Stereotyped Language Models via Model Editing

Published: 22 Sept 2025, Last Modified: 03 Jan 2026WiML @ NeurIPS 2025EveryoneRevisionsBibTeXCC BY 4.0
Keywords: debias, large language model, social bias
Submission Number: 155
Loading