OpenReview.net
  • Login

Nikhil Saxena

Researcher, Safeguards, Anthropic

  • Joined January 2025

Names

Nikhil Saxena (Preferred)
  • Suggest Name

Emails

****@anthropic.com (Confirmed)
,
****@gmail.com (Confirmed)
,
****@stripe.com
,
****@yelp.com
,
****@alumni.duke.edu (Confirmed)
  • Suggest Email

Personal Links

LinkedIn
  • Suggest URL

Career & Education History

Researcher
Safeguards, Anthropic (anthropic.com)
2024 – Present
 
  • Suggest Position

Advisors, Relations & Conflicts

No relations added

  • Suggest Relation

Expertise

AI Safety
Present
 
  • Suggest Expertise

Publications

  • Constitutional Classifiers++: Production-Grade Defenses against Universal Jailbreaks

    Hoagy Cunningham, Jerry Wei, Zihan Wang, Andrew Persic, Alwin Peng, Jordan Abderrachid, Raj Agarwal, Bobby Chen, Andy Dau, Alek Dimitriev, Logan Howard, Yijin Hua, Rob Gilson, Mu Lin, Christopher Liu, Vladimir Mikulik, Rohit Mittapalli, Clare O'Hara, Jin Pan, Nikhil Saxena et al. (7 additional authors not shown)
    • ICLR 2026 Poster
    • Readers: Everyone

Co-Authors

  • Alek Dimitriev
  • Alex Silverstein
  • Alwin Peng
  • Andrew Persic
  • Andy Dau
  • Bobby Chen
  • Christopher Liu
  • Clare O'Hara
  • Ethan Perez
  • Giulio Zhou
  • Hoagy Cunningham
  • Jan Leike
  • Jared Kaplan
  • Jerry Wei
  • Jin Pan
  • Jordan Abderrachid
  • Logan Howard
  • Mrinank Sharma
  • Mu Lin
  • Raj Agarwal
  • Rob Gilson
  • Rohit Mittapalli
  • Vladimir Mikulik
  • Yijin Hua
  • Yue Song
View all 26 co-authors
  • About OpenReview
  • Hosting a Venue
  • All Venues
  • Contact
  • Sponsors
  • Donate
  • FAQ
  • Terms of Use / Privacy Policy
  • News
  • About OpenReview
  • Hosting a Venue
  • All Venues
  • Sponsors
  • News
  • FAQ
  • Contact
  • Donate
  • Terms of Use
  • Privacy Policy

OpenReview is a long-term project to advance science through improved peer review with legal nonprofit status. We gratefully acknowledge the support of the OpenReview Sponsors. © 2026 OpenReview