OpenReview
.net
OpenReview
.net
Login
OpenReview
.net
Login
Andrew Gritsevskiy
PhD student, Department of Computer Science, University of Wisconsin - Madison
Principal Researcher, Contramont Research
Principal Researcher, Cavendish Labs, Co.
Joined
May 2023
Names
Andrew Gritsevskiy
(Preferred)
,
Andrew George Gritsevskiy
Emails
****@cavendishlabs.org
(Confirmed)
,
****@cs.wisc.edu
(Confirmed)
,
****@contramont.org
(Confirmed)
,
****@promontorylabs.com
(Confirmed)
,
****@mit.edu
(Confirmed)
Personal Links
Homepage
Google Scholar
DBLP
ORCID
LinkedIn
Semantic Scholar
Career & Education History
PhD student
Department of Computer Science, University of Wisconsin - Madison
(cs.wisc.edu)
2024
–
Present
Principal Researcher
Contramont Research
(contramont.org)
2024
–
Present
Principal Researcher
Cavendish Labs, Co.
(cavendishlabs.org)
2022
–
Present
Undergrad student
University of Toronto
(utoronto.ca)
2019
–
2023
Intern
Hospital for Sick Children
(sickkids.ca)
2022
–
2022
Researcher
Institute for Advanced Research in Artificial Intelligence
(iarai.ac.at)
2021
–
2021
Undergrad student
University of California, Los Angeles
(ucla.edu)
2018
–
2019
Intern
Massachusetts Institute of Technology
(mit.edu)
2016
–
2018
Intern
Harvard University
(harvard.edu)
2016
–
2017
Advisors, Relations & Conflicts
Coauthor
Andis Draguns
2024
–
Present
Coauthor
Sumeet Ramesh Motwani
2024
–
Present
Coauthor
Jeffrey Ladish
2024
–
Present
Coauthor
Derik Kauffman
2022
–
Present
Coauthor
Joe Cavanagh
2022
–
Present
Coauthor
Aaron Kirtland
2022
–
Present
Coworker
Michael Kopp
2021
–
2021
Expertise
Reinforcement learning
2016
–
Present
Machine learning
2015
–
Present
Artificial intelligence
2015
–
Present
Publications
Unprobeable Backdoors: Evading Runtime Detection in Transformers
Andis Draguns
,
Andrew Gritsevskiy
,
Erik Jenner
ICLR 2026 Conference Withdrawn Submission
Readers:
Everyone
SmileyLlama: Modifying Large Language Models \\for Directed Chemical Space Exploration
Joe Cavanagh
,
Kunyang Sun
,
Andrew Gritsevskiy
,
Dorian Bagni
,
Teresa Head-Gordon
,
Thomas D. Bannister
AIDrugX Poster
Readers:
Everyone
Unelicitable Backdoors via Cryptographic Transformer Circuits
Andis Draguns
,
Andrew Gritsevskiy
,
Sumeet Ramesh Motwani
,
Christian Schroeder de Witt
NeurIPS 2024 poster
Readers:
Everyone
SmileyLlama: Modifying Large Language Models for Directed Chemical Space Exploration
Joseph M. Cavanagh
,
Kunyang Sun
,
Andrew Gritsevskiy
,
Dorian Bagni
,
Thomas D. Bannister
,
Teresa Head-Gordon
CoRR 2024
Readers:
Everyone
REBUS: A Robust Evaluation Benchmark of Understanding Symbols
Andrew Gritsevskiy
,
Arjun Panickssery
,
Aaron Kirtland
,
Derik Kauffman
,
Hans Gundlach
,
Irina Gritsevskaya
,
Joe Cavanagh
,
Jonathan Chiang
,
Lydia La Roux
,
Michelle Hung
CoRR 2024
Readers:
Everyone
Unelicitable Backdoors in Language Models via Cryptographic Transformer Circuits
Andis Draguns
,
Andrew Gritsevskiy
,
Sumeet Ramesh Motwani
,
Charlie Rogers-Smith
,
Jeffrey Ladish
,
Christian Schröder de Witt
CoRR 2024
Readers:
Everyone
Forecasting the future of artificial intelligence with machine learning-based link prediction in an exponentially growing knowledge network
Mario Krenn
,
Lorenzo Buffoni
,
Bruno C. Coutinho
,
Sagi Eppel
,
Jacob Gates Foster
,
Andrew Gritsevskiy
,
Harlin Lee
,
Yichao Lu
,
João P. Moutinho
,
Nima Sanjabi
,
Rishi Sonthalia
,
Ngoc Mai Tran
,
Francisco Valente
,
Yangxinyu Xie
,
Rose Yu
,
Michael Kopp
Nat. Mac. Intell. 2023
Readers:
Everyone
Inverse Scaling: When Bigger Isn't Better
Ian R. McKenzie
,
Alexander Lyzhov
,
Michael Martin Pieler
,
Alicia Parrish
,
Aaron Mueller
,
Ameya Prabhu
,
Euan McLean
,
Xudong Shen
,
Joe Cavanagh
,
Andrew George Gritsevskiy
,
Derik Kauffman
,
Aaron T. Kirtland
,
Zhengping Zhou
,
Yuhui Zhang
,
Sicong Huang
,
Daniel Wurgaft
,
Max Weiss
,
Alexis Ross
,
Gabriel Recchia
,
Alisa Liu
et al. (6 additional authors not shown)
Accepted by TMLR
Readers:
Everyone
Inverse Scaling: When Bigger Isn't Better
Ian R. McKenzie
,
Alexander Lyzhov
,
Michael Pieler
,
Alicia Parrish
,
Aaron Mueller
,
Ameya Prabhu
,
Euan McLean
,
Aaron Kirtland
,
Alexis Ross
,
Alisa Liu
,
Andrew Gritsevskiy
,
Daniel Wurgaft
,
Derik Kauffman
,
Gabriel Recchia
,
Jiacheng Liu
,
Joe Cavanagh
,
Max Weiss
,
Sicong Huang
,
The Floating Droid
,
Tom Tseng
et al. (7 additional authors not shown)
Trans. Mach. Learn. Res. 2023
Readers:
Everyone
Capsule networks for low-data transfer learning
Andrew Gritsevskiy
,
Maksym Korablyov
CoRR 2018
Readers:
Everyone
Co-Authors
Aaron Kirtland
Aaron Mueller
Aaron T. Kirtland
Alexander Lyzhov
Alexis Ross
Alicia Parrish
Alisa Liu
Ameya Prabhu
Andis Draguns
Arjun Panickssery
Bruno C. Coutinho
Charlie Rogers-Smith
Christian Schroeder de Witt
Christian Schröder de Witt
Daniel Wurgaft
Derik Kauffman
Dorian Bagni
Erik Jenner
Ethan Perez
Euan McLean
Francisco Valente
Gabriel Recchia
Hans Gundlach
Harlin Lee
Ian R. McKenzie
View all 62 co-authors