Lawrence Chan (Preferred)
Lawrence Chan
Researcher, Model Evaluation and Threat Research
Joined
September 2019
Names
Emails
****@berkeley.edu (Confirmed)
****@metr.org (Confirmed)
Personal Links
Career & Education History
Researcher
Model Evaluation and Threat Research (metr.org)
2022 – Present
PhD student
University of California Berkeley (berkeley.edu)
2018 – 2024
Undergrad student
Department of Computer and Information Science, School of Engineering and Applied Science (cis.upenn.edu)
2014 – 2018
Advisors, Relations & Conflicts
Expertise
Mechanistic interpretability
, reverse engineering
, interpretability
2022 – Present
Adversarial training
, language models
2021 – Present
Inverse reinforcement learning
, value learning
, human robot interaction
2018 – Present
Publications
Co-Authors
- Aaron Ho
- Adam Scherlis
- Alex Gibson
- Amy Deng
- Anca D. Dragan
- Anca Dragan
- Andrew Critch
- Aron Lajko
- Avik Jain
- Ben Weinstein-Raun
- Ben West
- Benjamin Weinstein-Raun
- Bilal Chughtai
- Brian Goodrich
- Buck Shlegeris
- Chun Hei Yip
- Daniel M Ziegler
- Daniel M. Ziegler
- Daniel S. Brown
- Daniel Ziegler
- Daniel de Haas
- David Rein
- Dmitrii Krasheninnikov
- Dmitry Vaintrob
- Dylan Hadfield-Menell