OpenReview
.net
OpenReview
.net
Login
OpenReview
.net
Login
Ramón Fernandez Astudillo
Researcher, International Business Machines
Joined
November 2018
Names
Ramón Fernandez Astudillo
(Preferred)
,
Ramon Fernandez Astudillo
Emails
****@astudillo.com
(Confirmed)
,
****@ibm.com
(Confirmed)
Personal Links
Homepage
Google Scholar
Semantic Scholar
ACL Anthology
Career & Education History
Researcher
International Business Machines
(ibm.com)
2019
–
Present
Researcher
Unbabel
(unbabel.com)
2016
–
2018
Postdoc
INESC-ID
(inesc-id.pt)
2010
–
2018
PhD student
TU Berlin
(tu-berlin.de)
2006
–
2010
Advisors, Relations & Conflicts
Coworker
Andre Martins
2016
–
2018
Postdoc Advisor
Isabel Trancoso
2010
–
2016
PhD Advisor
Dorothea Kolossa
2006
–
2010
Expertise
large language models
,
retrieve and generate
,
reinforcement learning
,
distillation
2006
–
2023
Publications
Do LLMs Benefit From Their Own Words?
Jenny Y. Huang
,
Leshem Choshen
,
Ramón Fernandez Astudillo
,
Tamara Broderick
,
Jacob Andreas
ICLR 2026 Workshop MemAgents
Readers:
Everyone
Learning Efficient Latent Reasoning with Abstract Chain-of-Thought
Keshav Ramji
,
Tahira Naseem
,
Ramón Fernandez Astudillo
LIT Workshop @ ICLR 2026
Readers:
Everyone
Optimal Policy Minimum Bayesian Risk
Ramón Fernandez Astudillo
,
Md Arafat Sultan
,
Aashka Trivedi
,
Yousef El-Kurdi
,
Tahira Naseem
,
Radu Florian
,
Salim Roukos
Submitted to ICLR 2026
Readers:
Everyone
Improved Sampling from Masked Diffusion Models with Position Contrastive Guidance
Dhruvesh Patel
,
Tahira Naseem
,
Gaurav Pandey
,
Md Arafat Sultan
,
Andrew McCallum
,
Ramón Fernandez Astudillo
SPIGM @ NeurIPS
Readers:
Everyone
Latent Principle Discovery for Language Model Self-Improvement
Keshav Ramji
,
Tahira Naseem
,
Ramón Fernandez Astudillo
NeurIPS 2025 poster
Readers:
Everyone
Optimal Policy Minimum Bayesian Risk
Ramón Fernandez Astudillo
,
Md Arafat Sultan
,
Aashka Trivedi
,
Yousef El-Kurdi
,
Tahira Naseem
,
Radu Florian
,
Salim Roukos
Submitted to NeurIPS 2025
Readers:
Everyone
A Codespace Autoencoder for Language Tasks
Celine Lee
,
Md Arafat Sultan
,
Tahira Naseem
,
Alexander M Rush
,
Ramón Fernandez Astudillo
ICLR 2025 Conference Withdrawn Submission
Readers:
Everyone
Sampling Language from Latent System 2 Reasoning
Celine Lee
,
Md Arafat Sultan
,
Tahira Naseem
,
Alexander M Rush
,
Ramón Fernandez Astudillo
Sys2-Reasoning Poster
Readers:
Everyone
BRAIn: Bayesian Reward-conditioned Amortized Inference for natural language generation from feedback
Gaurav Pandey
,
Yatin Nandwani
,
Tahira Naseem
,
Mayank Mishra
,
Guangxuan Xu
,
Dinesh Raghu
,
Sachindra Joshi
,
Asim Munawar
,
Ramón Fernandez Astudillo
ICML 2024 Poster
Readers:
Everyone
Ensemble-Instruct: Instruction Tuning Data Generation with a Heterogeneous Mixture of LMs
Young-Suk Lee
,
Md Arafat Sultan
,
Yousef El-Kurdi
,
Tahira Naseem
,
Asim Munawar
,
Radu Florian
,
Salim Roukos
,
Ramón Fernandez Astudillo
EMNLP 2023 Findings
Readers:
Everyone
View all 16 publications
Co-Authors
Aashka Trivedi
Alexander M Rush
Andrew McCallum
Asim Munawar
Celine Lee
Dhruvesh Patel
Dinesh Raghu
Dzung T. Phan
Gabriele Picco
Gaurav Pandey
Guangxuan Xu
Hoang Thanh Lam
Jacob Andreas
Janaki Sheth
Jenny Y. Huang
Jiawei Zhou
Keshav Ramji
Lam M. Nguyen
Leshem Choshen
Manuel Mager
Mayank Mishra
Md Arafat Sultan
Radu Florian
Revanth Gangi Reddy
Sachindra Joshi
View all 35 co-authors