Toggle navigation
OpenReview
.net
Login
×
Go to
DBLP
homepage
The WMDP Benchmark: Measuring and Reducing Malicious Use with Unlearning
Nathaniel Li
,
Alexander Pan
,
Anjali Gopal
,
Summer Yue
,
Daniel Berrios
,
Alice Gatti
,
Justin D. Li
,
Ann-Kathrin Dombrowski
,
Shashwat Goel
,
Gabriel Mukobi
,
Nathan Helm-Burger
,
Rassin Lababidi
,
Lennart Justen
,
Andrew B. Liu
,
Michael Chen
,
Isabelle Barrass
,
Oliver Zhang
,
Xiaoyuan Zhu
,
Rishub Tamirisa
,
Bhrugu Bharathi
et al. (26 additional authors not shown)
Published: 01 Jan 2024, Last Modified: 14 May 2025
ICML 2024
Everyone
Revisions
BibTeX
CC BY-SA 4.0
Loading