Revisiting the Calibration of Modern Neural Networks

Matthias Minderer; Josip Djolonga; Rob Romijnders; Frances Ann Hubis; Xiaohua Zhai; Neil Houlsby; Dustin Tran; Mario Lucic

Revisiting the Calibration of Modern Neural Networks

Matthias Minderer, Josip Djolonga, Rob Romijnders, Frances Ann Hubis, Xiaohua Zhai, Neil Houlsby, Dustin Tran, Mario Lucic

Published: 09 Nov 2021, Last Modified: 04 May 2025NeurIPS 2021 PosterReaders: Everyone

Keywords: uncertainty, calibration, image classification

TL;DR: We study how model size, architecture and training affect calibration and show that current SOTA models do not follow past trends.

Abstract: Accurate estimation of predictive uncertainty (model calibration) is essential for the safe application of neural networks. Many instances of miscalibration in modern neural networks have been reported, suggesting a trend that newer, more accurate models produce poorly calibrated predictions. Here, we revisit this question for recent state-of-the-art image classification models. We systematically relate model calibration and accuracy, and find that the most recent models, notably those not using convolutions, are among the best calibrated. Trends observed in prior model generations, such as decay of calibration with distribution shift or model size, are less pronounced in recent architectures. We also show that model size and amount of pretraining do not fully explain these differences, suggesting that architecture is a major determinant of calibration properties.

Code Of Conduct: I certify that all co-authors of this work have read and commit to adhering to the NeurIPS Statement on Ethics, Fairness, Inclusivity, and Code of Conduct.

Supplementary Material: pdf

Code: https://github.com/google-research/robustness_metrics/tree/master/robustness_metrics/projects/revisiting_calibration

Community Implementations: [![CatalyzeX](/images/catalyzex_icon.svg) 3 code implementations](https://www.catalyzex.com/paper/revisiting-the-calibration-of-modern-neural/code)

8 Replies

Loading