OpenReview
.net
OpenReview
.net
Login
OpenReview
.net
Login
Carlos E. Jimenez
Researcher, Anthropic
Joined
November 2021
Names
Carlos E. Jimenez
(Preferred)
,
Carlos E Jimenez
Emails
****@princeton.edu
(Confirmed)
,
****@gmail.com
(Confirmed)
,
****@icloud.com
(Confirmed)
Personal Links
Homepage
Google Scholar
DBLP
ORCID
Semantic Scholar
Career & Education History
Researcher
Anthropic
(anthropic.com)
2025
–
Present
PhD student
Computer Science,
Princeton University
(princeton.edu)
2020
–
2025
Advisors, Relations & Conflicts
PhD Advisor
Karthik Narasimhan
2020
–
2025
Expertise
multi-modal learning
,
deep learning
,
machine learning
,
natural language processing
,
computer vision
,
transformers
,
computer use
,
cua
,
computer use agents
,
browser use
Present
Publications
When Models Know More Than They Can Explain: Quantifying Knowledge Transfer in Human-AI Collaboration
Quan Shi
,
Carlos E Jimenez
,
Shunyu Yao
,
Nick Haber
,
Diyi Yang
,
Karthik R Narasimhan
XLLM-Reason-Plan
Readers:
Everyone
IMPersona: Evaluating Individual Level LM Impersonation
Quan Shi
,
Carlos E Jimenez
,
Stephen Dong
,
Brian Seo
,
Caden Yao
,
Adam Kelch
,
Karthik R Narasimhan
COLM 2025 Workshop SoLaR Poster
Readers:
Everyone
When Models Know More Than They Can Explain: Quantifying Knowledge Transfer in Human-AI Collaboration
Quan Shi
,
Carlos E Jimenez
,
Shunyu Yao
,
Nick Haber
,
Diyi Yang
,
Karthik R Narasimhan
NeurIPS 2025 poster
Readers:
Everyone
SWE-smith: Scaling Data for Software Engineering Agents
John Yang
,
Kilian Lieret
,
Carlos E Jimenez
,
Alexander Wettig
,
Kabir Khandpur
,
Yanzhe Zhang
,
Binyuan Hui
,
Ofir Press
,
Ludwig Schmidt
,
Diyi Yang
NeurIPS 2025 Datasets and Benchmarks Track spotlight
Readers:
Everyone
IMPersona: Evaluating Individual Level LLM Impersonation
Quan Shi
,
Carlos E Jimenez
,
Stephen Dong
,
Brian Seo
,
Caden Yao
,
Adam Kelch
,
Karthik R Narasimhan
COLM 2025
Readers:
Everyone
EnIGMA: Interactive Tools Substantially Assist LM Agents in Finding Security Vulnerabilities
Talor Abramovich
,
Meet Udeshi
,
Minghao Shao
,
Kilian Lieret
,
Haoran Xi
,
Kimberly Milner
,
Sofija Jancheska
,
John Yang
,
Carlos E Jimenez
,
Farshad Khorrami
,
Prashanth Krishnamurthy
,
Brendan Dolan-Gavitt
,
Muhammad Shafique
,
Karthik R Narasimhan
,
Ramesh Karri
,
Ofir Press
ICML 2025 poster
Readers:
Everyone
SWE-bench Multimodal: Do AI Systems Generalize to Visual Software Domains?
John Yang
,
Carlos E Jimenez
,
Alex L Zhang
,
Kilian Lieret
,
Joyce Yang
,
Xindi Wu
,
Ori Press
,
Niklas Muennighoff
,
Gabriel Synnaeve
,
Karthik R Narasimhan
,
Diyi Yang
,
Sida Wang
,
Ofir Press
ICLR 2025 Poster
Readers:
Everyone
SWE-agent: Agent-Computer Interfaces Enable Automated Software Engineering
John Yang
,
Carlos E Jimenez
,
Alexander Wettig
,
Kilian Lieret
,
Shunyu Yao
,
Karthik R Narasimhan
,
Ofir Press
NeurIPS 2024 poster
Readers:
Everyone
SWE-agent: Agent-Computer Interfaces Enable Automated Software Engineering
John Yang
,
Carlos E. Jimenez
,
Alexander Wettig
,
Kilian Lieret
,
Shunyu Yao
,
Karthik Narasimhan
,
Ofir Press
CoRR 2024
Readers:
Everyone
SWE-bench: Can Language Models Resolve Real-world Github Issues?
Carlos E. Jimenez
,
John Yang
,
Alexander Wettig
,
Shunyu Yao
,
Kexin Pei
,
Ofir Press
,
Karthik R. Narasimhan
ICLR 2024
Readers:
Everyone
View all 24 publications
Co-Authors
Adam Kelch
Alex L Zhang
Alexander Wettig
Ameet Deshpande
Ashwin Kalyan
Binyuan Hui
Brendan Dolan-Gavitt
Brian Seo
Caden Yao
Danqi Chen
Diyi Yang
Farshad Khorrami
Gabriel Synnaeve
Haoran Xi
Howard Chen
Izhak Shafran
John Yang
Joyce Yang
Kabir Khandpur
Karthik Narasimhan
Karthik R Narasimhan
Karthik R. Narasimhan
Kexin Pei
Kilian Lieret
Kimberly Milner
View all 50 co-authors