Detecting Misclassification Errors in Neural Networks with a Gaussian Process Model

Xin Qiu; Risto Miikkulainen

Detecting Misclassification Errors in Neural Networks with a Gaussian Process Model

Xin Qiu, Risto Miikkulainen

28 Sept 2020 (modified: 22 Jun 2025)ICLR 2021 Conference Blind SubmissionReaders: Everyone

Keywords: Neural Network Classifier, Error Detection, AI safety

Abstract: As neural network classifiers are deployed in real-world applications, it is crucial that their predictions are not just accurate, but trustworthy as well. One practical solution is to assign confidence scores to each prediction, then filter out low-confidence predictions. However, existing confidence metrics are not yet sufficiently reliable for this role. This paper presents a new framework that produces more reliable confidence scores for detecting misclassification errors. This framework, RED, calibrates the classifier's inherent confidence indicators and estimates uncertainty of the calibrated confidence scores using Gaussian Processes. Empirical comparisons with other confidence estimation methods on 125 UCI datasets demonstrate that this approach is effective. An experiment on a vision task with a large deep learning architecture further confirms that the method can scale up, and a case study involving out-of-distribution and adversarial samples shows potential of the proposed method to improve robustness of neural network classifiers more broadly in the future.

One-sentence Summary: This paper presents a Gaussian-Processes-based framework that generates more reliable confidence scores for detecting misclassification errors in Neural Network Classifiers.

Code Of Ethics: I acknowledge that I and all co-authors of this work have read and commit to adhering to the ICLR Code of Ethics

Community Implementations: [![CatalyzeX](/images/catalyzex_icon.svg) 4 code implementations](https://www.catalyzex.com/paper/detecting-misclassification-errors-in-neural/code)

Reviewed Version (pdf): https://openreview.net/references/pdf?id=ZKafB3QhUT

13 Replies

Loading