OpenReview.net
  • Login

Martín Soto

Pronouns: he/him

  • Joined September 2024

Names

Martín Soto (Preferred)
  • Suggest Name

Emails

****@gmail.com (Confirmed)
  • Suggest Email

Personal Links

Google Scholar
  • Suggest URL

Career & Education History

MS student
Mathematics, Universitat de Barcelona (ub.edu)
2022 – 2024
 
  • Suggest Position

Advisors, Relations & Conflicts

No relations added

  • Suggest Relation

Expertise

No areas of expertise listed

  • Suggest Expertise

Publications

  • Emergent Misalignment: Narrow finetuning can produce broadly misaligned LLMs

    Jan Betley, Daniel Chee Hian Tan, Niels Warncke, Anna Sztyber-Betley, Xuchan Bao, Martín Soto, Nathan Labenz, Owain Evans
    • ICLR 2025 FM-Wild Workshop
    • Readers: Everyone
  • Emergent Misalignment: Narrow finetuning can produce broadly misaligned LLMs

    Jan Betley, Daniel Chee Hian Tan, Niels Warncke, Anna Sztyber-Betley, Xuchan Bao, Martín Soto, Nathan Labenz, Owain Evans
    • ICML 2025 oral
    • Readers: Everyone
  • Language Models Can Articulate Their Implicit Goals

    Jan Betley, Xuchan Bao, Martín Soto, Anna Sztyber-Betley, James Chua, Owain Evans
    • SafeGenAi Poster
    • Readers: Everyone
  • Tell me about yourself: LLMs are aware of their learned behaviors

    Jan Betley, Xuchan Bao, Martín Soto, Anna Sztyber-Betley, James Chua, Owain Evans
    • ICLR 2025 Spotlight
    • Readers: Everyone

Co-Authors

  • Anna Sztyber-Betley
  • Daniel Chee Hian Tan
  • James Chua
  • Jan Betley
  • Nathan Labenz
  • Niels Warncke
  • Owain Evans
  • Xuchan Bao
  • About OpenReview
  • Hosting a Venue
  • All Venues
  • Contact
  • Sponsors
  • Join the Team
  • Frequently Asked Questions
  • Terms of Use
  • Privacy Policy
  • About OpenReview
  • Hosting a Venue
  • All Venues
  • Sponsors
  • Join the Team
  • Frequently Asked Questions
  • Contact
  • Terms of Use
  • Privacy Policy

OpenReview is a long-term project to advance science through improved peer review with legal nonprofit status. We gratefully acknowledge the support of the OpenReview Sponsors. © 2025 OpenReview