On Calibration of Modern Neural Networks

Chuan Guo; Geoff Pleiss; Yu Sun; Kilian Q. Weinberger

doi:10.48550/arxiv.1706.04599

On Calibration of Modern Neural Networks

Chuan Guo(Cornell University), Geoff Pleiss(Cornell University), Yu Sun(Cornell University), Kilian Q. Weinberger(Cornell University)

arXiv (Cornell University)

June 14, 2017

10.48550/arxiv.1706.04599

Cited by 1,727Open Access

Full Text

Abstract

Confidence calibration -- the problem of predicting probability estimates representative of the true correctness likelihood -- is important for classification models in many applications. We discover that modern neural networks, unlike those from a decade ago, are poorly calibrated. Through extensive experiments, we observe that depth, width, weight decay, and Batch Normalization are important factors influencing calibration. We evaluate the performance of various post-processing calibration methods on state-of-the-art architectures with image and document classification datasets. Our analysis and experiments not only offer insights into neural network learning, but also provide a simple and straightforward recipe for practical settings: on most datasets, temperature scaling -- a single-parameter variant of Platt Scaling -- is surprisingly effective at calibrating predictions.

Related Papers

No related papers found

Powered by citation graph analysis