OpenReview
.net
OpenReview
.net
Login
OpenReview
.net
Login
Luke Bailey
PhD student, Computer Science, Stanford University
Joined
May 2023
Names
Luke Bailey
(Preferred)
,
Luke James Bailey
Emails
****@college.harvard.edu
(Confirmed)
,
****@stanford.edu
(Confirmed)
Personal Links
Google Scholar
DBLP
Career & Education History
PhD student
Computer Science,
Stanford University
(stanford.edu)
2024
–
2029
Undergrad student
Computer Science,
Harvard University
(harvard.edu)
2019
–
2024
Intern
University of California, Berkeley
(berkeley.edu)
2023
–
2023
Advisors, Relations & Conflicts
PhD Advisor
Carlos Guestrin
Present
Coauthor
Weiwei Pan
Present
Coauthor
Siddharth Swaroop
Present
Coauthor
Stuart Russell
Present
Coauthor
Scott Emmons
Present
Coauthor
Euan Ong
Present
Coauthor
Glenn Ko
Present
Coauthor
Yuji Chai
Present
Coauthor
Yaniv Yacoby
Present
Coauthor
Finale Doshi-Velez
Present
Coauthor
Gustaf Ahdritz
Present
Coauthor
Anat Kleiman
Present
Coauthor
Sam Toyer
Present
Coauthor
Justin Svegliato
Present
Coauthor
Olivia Watkins
Present
Expertise
No areas of expertise listed
Publications
Neural Chameleons: Language Models Can Learn to Hide Their Thoughts from Unseen Activation Monitors
Max McGuinness
,
Alex Serrano
,
Luke Bailey
,
Scott Emmons
ICLR 2026 Trustworthy AI
Readers:
Everyone
Neural Chameleons: Language Models Can Learn to Hide Their Thoughts from Unseen Activation Monitors
Max McGuinness
,
Alex Serrano
,
Luke Bailey
,
Scott Emmons
ICLR 2026 Conference Desk Rejected Submission
Readers:
Everyone
Obfuscated Activations Bypass LLM Latent-Space Defenses
Luke Bailey
,
Alex Serrano
,
Abhay Sheshadri
,
Mikhail Seleznyov
,
Jordan Taylor
,
Erik Jenner
,
Jacob Hilton
,
Stephen Casper
,
Carlos Guestrin
,
Scott Emmons
ICLR 2026 Poster
Readers:
Everyone
Practical Principles for AI Cost and Compute Accounting
Stephen Casper
,
Luke Bailey
,
Tim Schreier
ICML 2025 Workshop TAIG Poster
Readers:
Everyone
The AI Agent Index
Stephen Casper
,
Luke Bailey
,
Rosco Hunter
,
Carson Ezell
,
Emma Cabalé
,
Michael Gerovitch
,
Stewart Slocum
,
Kevin Wei
,
Nikola Jurkovic
,
Ariba Khan
,
Phillip J. K. Christoffersen
,
A. Pinar Ozisik
,
Rakshit Trivedi
,
Dylan Hadfield-Menell
,
Noam Kolt
CoRR 2025
Readers:
Everyone
Practical Principles for AI Cost and Compute Accounting
Stephen Casper
,
Luke Bailey
,
Tim Schreier
CoRR 2025
Readers:
Everyone
Failures to Find Transferable Image Jailbreaks Between Vision-Language Models
Rylan Schaeffer
,
Dan Valentine
,
Luke Bailey
,
James Chua
,
Cristobal Eyzaguirre
,
Zane Durante
,
Joe Benton
,
Brando Miranda
,
Henry Sleight
,
Tony Tong Wang
,
John Hughes
,
Rajashree Agrawal
,
Mrinank Sharma
,
Scott Emmons
,
Sanmi Koyejo
,
Ethan Perez
ICLR 2025 Poster
Readers:
Everyone
Failures to Find Transferable Image Jailbreaks Between Vision-Language Models
Rylan Schaeffer
,
Dan Valentine
,
Luke Bailey
,
James Chua
,
Zane Durante
,
Cristobal Eyzaguirre
,
Joe Benton
,
Brando Miranda
,
Henry Sleight
,
Tony Tong Wang
,
John Hughes
,
Rajashree Agrawal
,
Mrinank Sharma
,
Scott Emmons
,
Sanmi Koyejo
,
Ethan Perez
Red Teaming GenAI Workshop @ NeurIPS'24 Oral
Readers:
Everyone
Failures to Find Transferable Image Jailbreaks Between Vision-Language Models
Rylan Schaeffer
,
Dan Valentine
,
Luke Bailey
,
James Chua
,
Zane Durante
,
Cristobal Eyzaguirre
,
Joe Benton
,
Brando Miranda
,
Henry Sleight
,
Tony Tong Wang
,
John Hughes
,
Rajashree Agrawal
,
Mrinank Sharma
,
Scott Emmons
,
Sanmi Koyejo
,
Ethan Perez
SoLaR Spotlight
Readers:
Everyone
When Do Universal Image Jailbreaks Transfer Between Vision-Language Models?
Rylan Schaeffer
,
Dan Valentine
,
Luke Bailey
,
James Chua
,
Cristobal Eyzaguirre
,
Zane Durante
,
Joe Benton
,
Brando Miranda
,
Henry Sleight
,
Tony Tong Wang
,
John Hughes
,
Rajashree Agrawal
,
Mrinank Sharma
,
Scott Emmons
,
Sanmi Koyejo
,
Ethan Perez
NeurIPS 2024 Workshop RBFM Oral
Readers:
Everyone
View all 26 publications
Co-Authors
A. Pinar Ozisik
Abhay Sheshadri
Alan Ritter
Alex Serrano
Anat Kleiman
Ariba Khan
Brando Miranda
Carlos Guestrin
Carson Ezell
Cristobal Eyzaguirre
Cristóbal Eyzaguirre
Dan Valentine
David Brooks
Dylan Hadfield-Menell
Emma Cabalé
Erik Jenner
Ethan Adrian Mendes
Ethan Perez
Euan Ong
Finale Doshi-Velez
Glenn G. Ko
Glenn Ko
Gu-Yeon Wei
Gustaf Ahdritz
H. Kung
View all 65 co-authors