Low-Cost Bayesian Methods for Fixing Neural Networks' Overconfidence

Kristiadi, Agustinus

Publikationsdienste
→
TOBIAS-lib - Publikationen und Dissertationen
→
7 Mathematisch-Naturwissenschaftliche Fakultät
→
Dokumentanzeige

« zurück

Low-Cost Bayesian Methods for Fixing Neural Networks' Overconfidence

Kristiadi, Agustinus

Dateien:	thesis_final_print.pdf 2.65 MB PDF Beschreibung: Main article in PDF

Zitierfähiger Link (URI):	http://hdl.handle.net/10900/135535 http://nbn-resolving.de/urn:nbn:de:bsz:21-dspace-1355355 http://dx.doi.org/10.15496/publikation-76886 http://nbn-resolving.org/urn:nbn:de:bsz:21-dspace-1355356
Dokumentart:	Dissertation
Erscheinungsdatum:	2023-01-20
Sprache:	Englisch
Fakultät:	7 Mathematisch-Naturwissenschaftliche Fakultät
Fachbereich:	Informatik
Gutachter:	Hennig, Philipp (Prof. Dr.)
Tag der mündl. Prüfung:	2023-01-13
DDC-Klassifikation:	004 - Informatik
Schlagworte:	Maschinelles Lernen , Neuronales Netz
Freie Schlagwörter:	Neural Network Bayesian Deep Learning Uncertainty Quantification Laplace Approximations
Lizenz:	http://tobias-lib.uni-tuebingen.de/doku/lic_mit_pod.php?la=de http://tobias-lib.uni-tuebingen.de/doku/lic_mit_pod.php?la=en
Gedruckte Kopie bestellen:	Print-on-Demand
Zur Langanzeige

Abstract:

Well-calibrated predictive uncertainty of neural networks—essentially making them know when they do not know—is paramount in safety-critical applications. However, deep neural networks are overconfident in the region both far away and near the training data. In this thesis, we study Bayesian neural networks and their extensions to mitigate this issue. First, we show that being Bayesian, even just at the last layer and in a post-hoc manner via Laplace approximations, helps mitigate overconfidence in deep ReLU classifiers. Then, we provide a cost-effective Gaussian-process extension to ReLU Bayesian neural networks that provides a guarantee that ReLU nets will never be overconfident in the region far from the data. Furthermore, we propose three ways of improving the calibration of general Bayesian neural networks in the regions near the data by (i) refining parametric approximations to the Bayesian neural networks’ posteriors with normalizing flows, (ii) training the uncertainty of Laplace approximations, and (iii) leveraging out-of-distribution data during training. We provide an easy-to-use library, laplace-torch, to facilitate the modern arts of Laplace approximations in deep learning. It gives users a way to turn a standard pre-trained deep net into a Bayesian neural network in a cost-efficient manner.

Das Dokument erscheint in:

7 Mathematisch-Naturwissenschaftliche Fakultät [5010]

Veröffentlichen

Stöbern

Gesamter Bestand
Diese Sammlung

Mein Benutzerkonto

Einloggen

Low-Cost Bayesian Methods for Fixing Neural Networks' Overconfidence

DSpace Repositorium (Manakin basiert)

Low-Cost Bayesian Methods for Fixing Neural Networks' Overconfidence

Abstract:

Das Dokument erscheint in:

Stöbern

Gesamter Bestand

Diese Sammlung

Mein Benutzerkonto