Improving Neural Network Accuracy and Calibration Under Distributional Shift with Prior Augmented Data

Jeffrey Ryan Willette; Juho Lee; Sung Ju Hwang

Improving Neural Network Accuracy and Calibration Under Distributional Shift with Prior Augmented Data

Jeffrey Ryan Willette, Juho Lee, Sung Ju Hwang

28 Sept 2020 (modified: 05 May 2023)ICLR 2021 Conference Blind SubmissionReaders: Everyone

Keywords: Bayesian, Calibration

Abstract: Neural networks have proven successful at learning from complex data distributions by acting as universal function approximators. However, neural networks are often overconfident in their predictions, which leads to inaccurate and miscalibrated probabilistic predictions. The problem of overconfidence becomes especially apparent in cases where the test-time data distribution differs from that which was seen during training. We propose a solution to this problem by seeking out regions in arbitrary feature space where the model is unjustifiably overconfident, and conditionally raising the entropy of those predictions towards that of the Bayesian prior on the distribution of the labels. Our method results in a better calibrated network and is agnostic to the underlying model structure, so it can be applied to any neural network which produces a probability density as an output. We demonstrate the effectiveness of our method and validate its performance on both classification and regression problems by applying it to the training of recent state-of-the-art neural network models.

One-sentence Summary: We propose a method of training existing neural network models which results in better calibrated probabilistic outputs.

Code Of Ethics: I acknowledge that I and all co-authors of this work have read and commit to adhering to the ICLR Code of Ethics

Supplementary Material: zip

Reviewed Version (pdf): https://openreview.net/references/pdf?id=HwfXPwNTXi

27 Replies

Loading