OpenReview
.net
OpenReview
.net
Login
OpenReview
.net
Login
Jose Hernandez-Orallo
Full Professor, Universitat Politecnica de Valencia
Director of Research, University of Cambridge
Joined
September 2018
Names
Jose Hernandez-Orallo
(Preferred)
,
José Hernández-Orallo
Emails
****@dsic.upv.es
(Confirmed)
,
****@upv.es
(Confirmed)
,
****@gmail.com
(Confirmed)
,
****@cam.ac.uk
(Confirmed)
Personal Links
Homepage
Google Scholar
DBLP
ORCID
Semantic Scholar
ACL Anthology
Career & Education History
Full Professor
Universitat Politecnica de Valencia
(upv.es)
2017
–
Present
Director of Research
University of Cambridge
(cam.ac.uk)
2017
–
Present
Advisors, Relations & Conflicts
Coauthor
Peter Flach
1999
–
Present
Coauthor
Karina Vold
1997
–
Present
Coauthor
Lexin Zhou
2020
–
2026
Postdoc Advisee
John Burden
2020
–
2025
Postdoc Advisee
Ryan Burnell
2022
–
2024
Expertise
Large language models
2020
–
Present
data science
2004
–
Present
evaluation metrics
2000
–
Present
machine learning
1999
–
Present
AI evaluation
1997
–
Present
philosophy of AI
1995
–
Present
Publications
Relative Drawing Identification Complexity Is Invariant to Modality in Vision-Language Models
Diogo Freitas
,
Brigt Håvardstun
,
Darío Garigliotti
,
Jan Arne Telle
,
Cèsar Ferri
,
José Hernández-Orallo
Crossref
Readers:
Everyone
LLM GameLab: An Interactive Platform for Testing Large Language Models in Board Games
Paulina Morillo
,
Alex Terreros
,
Cèsar Ferri
,
José Hernández-Orallo
Crossref
Readers:
Everyone
AI Impact on Human Proof Formalization Workflows
Katherine M. Collins
,
Simon Frieder
,
Jonas Bayer
,
Jacob Loader
,
Jeck Lim
,
Peiyang Song
,
Fabian Zaiser
,
Lexin Zhou
,
Shanda Li
,
Shi-Zhuo Looi
,
Jose Hernandez-Orallo
,
Joshua B. Tenenbaum
,
Cameron Freer
,
Umang Bhatt
,
Adrian Weller
,
Valerie Chen
,
Ilia Sucholutsky
MATH-AI 2025 Poster
Readers:
Everyone
From Human-Level AI Tales to AI Levelling Human Scales
Peter Romero
,
Zachary R. Tidler
,
Fernando Martínez-Plumed
,
Matthieu Tehenan
,
Sipeng Chen
,
Álvaro David Gómez Antón
,
Luning Sun
,
Manuel Cebrian
,
Lexin Zhou
,
Yael Moros-Daval
,
Daniel Romero-Alvarado
,
Felix Marti-Perez
,
Kevin Wei
,
Jose Hernandez-Orallo
Submitted to ICLR 2026
Readers:
Everyone
Psychometric Personality Shaping Modulates Capabilities and Safety in Language Models
Stephen Fitz
,
Peter Romero
,
Steven Basart
,
Sipeng Chen
,
Jose Hernandez-Orallo
ICLR 2026 Conference Withdrawn Submission
Readers:
Everyone
Inferring Capabilities from Task Performance with Bayesian Triangulation
John Burden
,
Konstantinos Voudouris
,
Ryan Burnell
,
Danaja Rutar
,
Lucy G Cheke
,
Jose Hernandez-Orallo
Submitted to ICLR 2026
Readers:
Everyone
11Plus-Bench: Demystifying Multimodal LLM Spatial Reasoning with Cognitive-Inspired Analysis
Chengzu Li
,
Wenshan Wu
,
Huanyu Zhang
,
Qingtao Li
,
Zeyu Gao
,
Yan Xia
,
Jose Hernandez-Orallo
,
Ivan Vulić
,
Furu Wei
Submitted to ICLR 2026
Readers:
Everyone
A Framework for the Categorisation of General-Purpose AI Models under the EU AI Act
Lorenzo Pacchiardi
,
John Burden
,
Fernando Martínez-Plumed
,
Jose Hernandez-Orallo
,
Emilia Gomez
,
David Fernández-Llorca
RegML 2025 Poster
Readers:
Everyone
Measuring Data Science Automation: A Survey of Evaluation Tools for AI Assistants and Agents
Irene Testini
,
Lorenzo Pacchiardi
,
Jose Hernandez-Orallo
Accepted by TMLR
Readers:
Everyone
Beyond Benchmarks: Evaluating Generalist Medical Artificial Intelligence With Psychometrics
Luning Sun
,
Christopher Gibbons
,
José Hernández-Orallo
,
Xiting Wang
,
Liming Jiang
,
David Stillwell
,
Fang Luo
,
Xing Xie
Journal of Medical Internet Research
Readers:
Everyone
View all 268 publications
Co-Authors
Aarohi Srivastava
Abhinav Rastogi
Abhishek Rao
Abu Awal Md Shoeb
Abubakar Abid
Abulhair Saparov
Adam Fisch
Adam R. Brown
Adam Santoro
Aditya Gupta
Adolfo Martínez Usó
Adrian Weller
Adrià Garriga-Alonso
Agnieszka Kluska
Aitor Lewkowycz
Akash Kundu
Akbir Khan
Akshat Agarwal
Alan Chan
Alan F. T. Winfield
Alan Winfield
Aleksandar Petrov
Alethea Power
Alex Ray
Alex Terreros
View all 913 co-authors