Red Teaming Language Models with Language ModelsDownload PDFOpen Website

2022 (modified: 24 Apr 2023)EMNLP 2022Readers: Everyone
Abstract: Ethan Perez, Saffron Huang, Francis Song, Trevor Cai, Roman Ring, John Aslanides, Amelia Glaese, Nat McAleese, Geoffrey Irving. Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing. 2022.
0 Replies

Loading